AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Robotics

In Japan, the robot isn't coming for your job; it's filling the one nobody wants | TechCrunch

Driven by labor shortages, Japan is pushing physical AI from pilot projects into real-world deployment.

TechCrunch - AI · 9 min · about 2 hours ago

Machine Learning

[P] bitnet-edge: Ternary-weight CNNs ({-1,0,+1}) on MNIST and CIFAR-10, deployed to ESP32-S3 with zero multiplications

I built a pipeline that takes ternary-quantized CNNs from PyTorch training all the way to bare-metal inference on an ESP32-S3 microcontro...

Reddit - Machine Learning · 1 min · about 4 hours ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 9 hours ago

All Content

Machine Learning

[2512.09069] KD-OCT: Efficient Knowledge Distillation for Clinical-Grade Retinal OCT Classification

The paper presents KD-OCT, a novel knowledge distillation framework that enhances the efficiency of deep learning models for classifying ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.21500] Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

This article presents a novel approach to reward modeling in large language models (LLMs) using rubric-based methods to mitigate reward o...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2509.14537] ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference

The paper introduces ClearFairy, an AI assistant designed to enhance decision-making in creative workflows by structuring reasoning and i...

arXiv - AI · 3 min · about 1 month ago

Llms

[2508.19982] Diffusion Language Models Know the Answer Before Decoding

The paper discusses Diffusion Language Models (DLMs) and introduces a new decoding method called Prophet, which allows for faster inferen...

arXiv - AI · 4 min · about 1 month ago

Llms

[2507.17691] CASCADE: LLM-Powered JavaScript Deobfuscator at Google

The paper presents CASCADE, a novel LLM-powered JavaScript deobfuscator developed by Google, which enhances code comprehension and analys...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2507.02376] On the Inference (In-)Security of Vertical Federated Learning: Efficient Auditing against Inference Tampering Attack

This paper introduces a novel attack and auditing framework for Vertical Federated Learning (VFL), addressing vulnerabilities in inferenc...

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.06060] Simple Yet Effective: Extracting Private Data Across Clients in Federated Fine-Tuning of Large Language Models

This article discusses the privacy risks associated with federated fine-tuning of large language models, highlighting methods for extract...

arXiv - AI · 4 min · about 1 month ago

Llms

[2505.03801] Large Language Model Compression with Global Rank and Sparsity Optimization

This paper presents a novel two-stage method for compressing large language models (LLMs) by optimizing global rank and sparsity, address...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2407.15738] Parallel Split Learning with Global Sampling

The paper presents a novel server-driven sampling strategy for distributed deep learning, enhancing scalability and accuracy in resource-...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2401.12455] Multi-agent deep reinforcement learning with centralized training and decentralized execution for transportation infrastructure management

This article presents a novel framework for managing transportation infrastructure using multi-agent deep reinforcement learning, address...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2503.14499] Measuring AI Ability to Complete Long Software Tasks

The paper introduces a new metric to evaluate AI's ability to complete long software tasks, revealing significant advancements in AI capa...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22207] Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets

The paper presents an automated framework for translating benchmarks and datasets for multilingual Large Language Model evaluation, addre...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.22145] When AI Writes, Whose Voice Remains? Quantifying Cultural Marker Erasure Across World English Varieties in Large Language Models

This article explores the phenomenon of 'Cultural Ghosting' in large language models (LLMs), highlighting the systematic erasure of cultu...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.22144] NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors

The paper presents NoLan, a framework aimed at reducing object hallucinations in Large Vision-Language Models (LVLMs) by dynamically supp...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.21997] Enhancing LLM-Based Test Generation by Eliminating Covered Code

This paper presents a novel method for enhancing LLM-based unit test generation by eliminating covered code, addressing challenges in tes...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.21841] Resilient Federated Chain: Transforming Blockchain Consensus into an Active Defense Layer for Federated Learning

The paper presents the Resilient Federated Chain (RFC), a blockchain-enabled framework designed to enhance the security of Federated Lear...

arXiv - AI · 4 min · about 1 month ago

Ai Infrastructure

[2602.21845] xai-cola: A Python library for sparsifying counterfactual explanations

The article introduces xai-cola, an open-source Python library designed to sparsify counterfactual explanations, enhancing interpretabili...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.21798] Excitation: Momentum For Experts

The paper introduces Excitation, a novel optimization framework aimed at enhancing learning in sparse architectures like Mixture-of-Exper...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.21652] Sparsity Induction for Accurate Post-Training Pruning of Large Language Models

The paper presents a novel method called Sparsity Induction for enhancing post-training pruning of large language models, addressing chal...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.21647] Mitigating Structural Noise in Low-Resource S2TT: An Optimized Cascaded Nepali-English Pipeline with Punctuation Restoration

This paper presents an optimized cascaded Nepali-English speech-to-text translation system that mitigates structural noise from ASR, enha...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 78 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

In Japan, the robot isn't coming for your job; it's filling the one nobody wants | TechCrunch

[P] bitnet-edge: Ternary-weight CNNs ({-1,0,+1}) on MNIST and CIFAR-10, deployed to ESP32-S3 with zero multiplications

UMKC Announces New Master of Science in Artificial Intelligence

All Content

[2512.09069] KD-OCT: Efficient Knowledge Distillation for Clinical-Grade Retinal OCT Classification

[2509.21500] Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

[2509.14537] ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference

[2508.19982] Diffusion Language Models Know the Answer Before Decoding

[2507.17691] CASCADE: LLM-Powered JavaScript Deobfuscator at Google

[2507.02376] On the Inference (In-)Security of Vertical Federated Learning: Efficient Auditing against Inference Tampering Attack

[2506.06060] Simple Yet Effective: Extracting Private Data Across Clients in Federated Fine-Tuning of Large Language Models

[2505.03801] Large Language Model Compression with Global Rank and Sparsity Optimization

[2407.15738] Parallel Split Learning with Global Sampling

[2401.12455] Multi-agent deep reinforcement learning with centralized training and decentralized execution for transportation infrastructure management

[2503.14499] Measuring AI Ability to Complete Long Software Tasks

[2602.22207] Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets

[2602.22145] When AI Writes, Whose Voice Remains? Quantifying Cultural Marker Erasure Across World English Varieties in Large Language Models

[2602.22144] NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors

[2602.21997] Enhancing LLM-Based Test Generation by Eliminating Covered Code

[2602.21841] Resilient Federated Chain: Transforming Blockchain Consensus into an Active Defense Layer for Federated Learning

[2602.21845] xai-cola: A Python library for sparsifying counterfactual explanations

[2602.21798] Excitation: Momentum For Experts

[2602.21652] Sparsity Induction for Accurate Post-Training Pruning of Large Language Models

[2602.21647] Mitigating Structural Noise in Low-Resource S2TT: An Optimized Cascaded Nepali-English Pipeline with Punctuation Restoration

Related Topics

Stay updated with AI News