AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

In Japan, the robot isn't coming for your job; it's filling the one nobody wants | TechCrunch
Robotics

In Japan, the robot isn't coming for your job; it's filling the one nobody wants | TechCrunch

Driven by labor shortages, Japan is pushing physical AI from pilot projects into real-world deployment.

TechCrunch - AI · 9 min ·
Machine Learning

[P] bitnet-edge: Ternary-weight CNNs ({-1,0,+1}) on MNIST and CIFAR-10, deployed to ESP32-S3 with zero multiplications

I built a pipeline that takes ternary-quantized CNNs from PyTorch training all the way to bare-metal inference on an ESP32-S3 microcontro...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·

All Content

[2512.09069] KD-OCT: Efficient Knowledge Distillation for Clinical-Grade Retinal OCT Classification
Machine Learning

[2512.09069] KD-OCT: Efficient Knowledge Distillation for Clinical-Grade Retinal OCT Classification

The paper presents KD-OCT, a novel knowledge distillation framework that enhances the efficiency of deep learning models for classifying ...

arXiv - Machine Learning · 4 min ·
[2509.21500] Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training
Llms

[2509.21500] Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

This article presents a novel approach to reward modeling in large language models (LLMs) using rubric-based methods to mitigate reward o...

arXiv - Machine Learning · 4 min ·
[2509.14537] ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference
Machine Learning

[2509.14537] ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference

The paper introduces ClearFairy, an AI assistant designed to enhance decision-making in creative workflows by structuring reasoning and i...

arXiv - AI · 3 min ·
[2508.19982] Diffusion Language Models Know the Answer Before Decoding
Llms

[2508.19982] Diffusion Language Models Know the Answer Before Decoding

The paper discusses Diffusion Language Models (DLMs) and introduces a new decoding method called Prophet, which allows for faster inferen...

arXiv - AI · 4 min ·
[2507.17691] CASCADE: LLM-Powered JavaScript Deobfuscator at Google
Llms

[2507.17691] CASCADE: LLM-Powered JavaScript Deobfuscator at Google

The paper presents CASCADE, a novel LLM-powered JavaScript deobfuscator developed by Google, which enhances code comprehension and analys...

arXiv - Machine Learning · 3 min ·
[2507.02376] On the Inference (In-)Security of Vertical Federated Learning: Efficient Auditing against Inference Tampering Attack
Machine Learning

[2507.02376] On the Inference (In-)Security of Vertical Federated Learning: Efficient Auditing against Inference Tampering Attack

This paper introduces a novel attack and auditing framework for Vertical Federated Learning (VFL), addressing vulnerabilities in inferenc...

arXiv - AI · 4 min ·
[2506.06060] Simple Yet Effective: Extracting Private Data Across Clients in Federated Fine-Tuning of Large Language Models
Llms

[2506.06060] Simple Yet Effective: Extracting Private Data Across Clients in Federated Fine-Tuning of Large Language Models

This article discusses the privacy risks associated with federated fine-tuning of large language models, highlighting methods for extract...

arXiv - AI · 4 min ·
[2505.03801] Large Language Model Compression with Global Rank and Sparsity Optimization
Llms

[2505.03801] Large Language Model Compression with Global Rank and Sparsity Optimization

This paper presents a novel two-stage method for compressing large language models (LLMs) by optimizing global rank and sparsity, address...

arXiv - Machine Learning · 4 min ·
[2407.15738] Parallel Split Learning with Global Sampling
Machine Learning

[2407.15738] Parallel Split Learning with Global Sampling

The paper presents a novel server-driven sampling strategy for distributed deep learning, enhancing scalability and accuracy in resource-...

arXiv - Machine Learning · 3 min ·
[2401.12455] Multi-agent deep reinforcement learning with centralized training and decentralized execution for transportation infrastructure management
Machine Learning

[2401.12455] Multi-agent deep reinforcement learning with centralized training and decentralized execution for transportation infrastructure management

This article presents a novel framework for managing transportation infrastructure using multi-agent deep reinforcement learning, address...

arXiv - Machine Learning · 4 min ·
[2503.14499] Measuring AI Ability to Complete Long Software Tasks
Machine Learning

[2503.14499] Measuring AI Ability to Complete Long Software Tasks

The paper introduces a new metric to evaluate AI's ability to complete long software tasks, revealing significant advancements in AI capa...

arXiv - Machine Learning · 4 min ·
[2602.22207] Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets
Llms

[2602.22207] Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets

The paper presents an automated framework for translating benchmarks and datasets for multilingual Large Language Model evaluation, addre...

arXiv - Machine Learning · 3 min ·
[2602.22145] When AI Writes, Whose Voice Remains? Quantifying Cultural Marker Erasure Across World English Varieties in Large Language Models
Llms

[2602.22145] When AI Writes, Whose Voice Remains? Quantifying Cultural Marker Erasure Across World English Varieties in Large Language Models

This article explores the phenomenon of 'Cultural Ghosting' in large language models (LLMs), highlighting the systematic erasure of cultu...

arXiv - AI · 4 min ·
[2602.22144] NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors
Llms

[2602.22144] NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors

The paper presents NoLan, a framework aimed at reducing object hallucinations in Large Vision-Language Models (LVLMs) by dynamically supp...

arXiv - AI · 4 min ·
[2602.21997] Enhancing LLM-Based Test Generation by Eliminating Covered Code
Llms

[2602.21997] Enhancing LLM-Based Test Generation by Eliminating Covered Code

This paper presents a novel method for enhancing LLM-based unit test generation by eliminating covered code, addressing challenges in tes...

arXiv - Machine Learning · 4 min ·
[2602.21841] Resilient Federated Chain: Transforming Blockchain Consensus into an Active Defense Layer for Federated Learning
Machine Learning

[2602.21841] Resilient Federated Chain: Transforming Blockchain Consensus into an Active Defense Layer for Federated Learning

The paper presents the Resilient Federated Chain (RFC), a blockchain-enabled framework designed to enhance the security of Federated Lear...

arXiv - AI · 4 min ·
[2602.21845] xai-cola: A Python library for sparsifying counterfactual explanations
Ai Infrastructure

[2602.21845] xai-cola: A Python library for sparsifying counterfactual explanations

The article introduces xai-cola, an open-source Python library designed to sparsify counterfactual explanations, enhancing interpretabili...

arXiv - Machine Learning · 3 min ·
[2602.21798] Excitation: Momentum For Experts
Machine Learning

[2602.21798] Excitation: Momentum For Experts

The paper introduces Excitation, a novel optimization framework aimed at enhancing learning in sparse architectures like Mixture-of-Exper...

arXiv - Machine Learning · 3 min ·
[2602.21652] Sparsity Induction for Accurate Post-Training Pruning of Large Language Models
Llms

[2602.21652] Sparsity Induction for Accurate Post-Training Pruning of Large Language Models

The paper presents a novel method called Sparsity Induction for enhancing post-training pruning of large language models, addressing chal...

arXiv - AI · 3 min ·
[2602.21647] Mitigating Structural Noise in Low-Resource S2TT: An Optimized Cascaded Nepali-English Pipeline with Punctuation Restoration
Machine Learning

[2602.21647] Mitigating Structural Noise in Low-Resource S2TT: An Optimized Cascaded Nepali-English Pipeline with Punctuation Restoration

This paper presents an optimized cascaded Nepali-English speech-to-text translation system that mitigates structural noise from ASR, enha...

arXiv - Machine Learning · 4 min ·
Previous Page 78 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime