AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Trained a Qwen2.5-0.5B-Instruct bf16 model on Reddit post summarization task with GRPO written from scratch in PyTorch - updates! [P]

So, yesterday run was a success and I did get an avg rollout length of about 64 tokens as attached in the image! This was with quality_re...

Reddit - Machine Learning · 1 min · 14 minutes ago

Llms

[2603.10652] Are Video Reasoning Models Ready to Go Outside?

Abstract page for arXiv paper 2603.10652: Are Video Reasoning Models Ready to Go Outside?

arXiv - AI · 4 min · about 3 hours ago

Machine Learning

[2602.00181] CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning

Abstract page for arXiv paper 2602.00181: CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning

arXiv - AI · 4 min · about 3 hours ago

All Content

Machine Learning

[2508.11460] Calibrated and uncertain? Evaluating uncertainty estimates in binary classification models

This article evaluates uncertainty estimates in binary classification models, comparing six probabilistic machine learning algorithms to ...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15286] AI-Paging: Lease-Based Execution Anchoring for Network-Exposed AI-as-a-Service

The paper presents AI-Paging, a framework for optimizing AI-as-a-Service by enabling network providers to manage model selection and exec...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.15281] High-Fidelity Network Management for Federated AI-as-a-Service: Cross-Domain Orchestration

This paper presents a framework for high-fidelity network management in Federated AI-as-a-Service, focusing on cross-domain orchestration...

arXiv - AI · 4 min · about 2 months ago

Llms

[2505.11824] Latent Veracity Inference for Identifying Errors in Stepwise Reasoning

This paper presents a novel method for identifying errors in stepwise reasoning using latent veracity inference, enhancing the reliabilit...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2505.11695] Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization

The paper introduces Qronos, a novel post-training quantization algorithm that enhances neural network performance by correcting quantiza...

arXiv - AI · 4 min · about 2 months ago

Ai Infrastructure

[2602.15249] Artificial Intelligence Specialization in the European Union: Underexplored Role of the Periphery at NUTS-3 Level

This study analyzes AI research production across European regions at the NUTS-3 level, highlighting the specialization of peripheral reg...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.15241] GenAI for Systems: Recurring Challenges and Design Principles from Software to Silicon

This paper explores the integration of Generative AI in computing systems, identifying recurring challenges and design principles across ...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2411.18954] NeuroLifting: Neural Inference on Markov Random Fields at Scale

NeuroLifting introduces a novel approach for inference in large-scale Markov Random Fields (MRFs) using Graph Neural Networks, achieving ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.15197] OpaqueToolsBench: Learning Nuances of Tool Behavior Through Interaction

The paper introduces OpaqueToolsBench, a benchmark for evaluating Large Language Model (LLM) agents' performance with opaque tools, propo...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.15756] A Note on Non-Composability of Layerwise Approximate Verification for Neural Inference

This paper discusses the limitations of layerwise approximate verification in neural inference, presenting a counterexample that challeng...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.15751] Enabling Low-Latency Machine learning on Radiation-Hard FPGAs with hls4ml

This article presents a novel approach to implementing low-latency machine learning on radiation-hard FPGAs, demonstrating its applicatio...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Infrastructure

[2602.15707] Proactive Conversational Assistant for a Procedural Manual Task based on Audio and IMU

This article presents a novel real-time conversational assistant that utilizes audio and IMU data to guide users through procedural tasks...

arXiv - Machine Learning · 4 min · about 2 months ago

Robotics

[2602.15061] Safe-SDL:Establishing Safety Boundaries and Control Mechanisms for AI-Driven Self-Driving Laboratories

The paper presents Safe-SDL, a framework for ensuring safety in AI-driven Self-Driving Laboratories, addressing the critical 'Syntax-to-S...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.15055] Beyond Context Sharing: A Unified Agent Communication Protocol (ACP) for Secure, Federated, and Autonomous Agent-to-Agent (A2A) Orchestration

The paper introduces the Agent Communication Protocol (ACP), a framework for secure and efficient agent-to-agent orchestration, addressin...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.15521] ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns

The paper presents ExpertWeaver, a framework that enhances the conversion of dense LLMs into sparse Mixture-of-Experts (MoE) models using...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15472] Fluids You Can Trust: Property-Preserving Operator Learning for Incompressible Flows

This article introduces a novel operator learning method for incompressible flows, enhancing computational efficiency while preserving es...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.15036] Transforming Computational Lithography with AC and AI -- Faster, More Accurate, and Energy-efficient

This article discusses the integration of accelerated computing (AC) and artificial intelligence (AI) in computational lithography, highl...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2602.15470] The Skeletal Trap: Mapping Spatial Inequality and Ghost Stops in Ankara's Transit Network

This article explores Ankara's public transport crisis, attributing it to structural issues rather than mere inefficiencies. It highlight...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.15423] GaiaFlow: Semantic-Guided Diffusion Tuning for Carbon-Frugal Search

GaiaFlow presents a novel framework for carbon-efficient search, employing semantic-guided diffusion tuning to balance retrieval accuracy...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.15785] This human study did not involve human subjects: Validating LLM simulations as behavioral evidence

This article discusses the use of large language models (LLMs) as synthetic participants in social science experiments, evaluating their ...

arXiv - AI · 4 min · about 2 months ago

Previous Page 152 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

Trained a Qwen2.5-0.5B-Instruct bf16 model on Reddit post summarization task with GRPO written from scratch in PyTorch - updates! [P]

[2603.10652] Are Video Reasoning Models Ready to Go Outside?

[2602.00181] CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning

All Content

[2508.11460] Calibrated and uncertain? Evaluating uncertainty estimates in binary classification models

[2602.15286] AI-Paging: Lease-Based Execution Anchoring for Network-Exposed AI-as-a-Service

[2602.15281] High-Fidelity Network Management for Federated AI-as-a-Service: Cross-Domain Orchestration

[2505.11824] Latent Veracity Inference for Identifying Errors in Stepwise Reasoning

[2505.11695] Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization

[2602.15249] Artificial Intelligence Specialization in the European Union: Underexplored Role of the Periphery at NUTS-3 Level

[2602.15241] GenAI for Systems: Recurring Challenges and Design Principles from Software to Silicon

[2411.18954] NeuroLifting: Neural Inference on Markov Random Fields at Scale

[2602.15197] OpaqueToolsBench: Learning Nuances of Tool Behavior Through Interaction

[2602.15756] A Note on Non-Composability of Layerwise Approximate Verification for Neural Inference

[2602.15751] Enabling Low-Latency Machine learning on Radiation-Hard FPGAs with hls4ml

[2602.15707] Proactive Conversational Assistant for a Procedural Manual Task based on Audio and IMU

[2602.15061] Safe-SDL:Establishing Safety Boundaries and Control Mechanisms for AI-Driven Self-Driving Laboratories

[2602.15055] Beyond Context Sharing: A Unified Agent Communication Protocol (ACP) for Secure, Federated, and Autonomous Agent-to-Agent (A2A) Orchestration

[2602.15521] ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns

[2602.15472] Fluids You Can Trust: Property-Preserving Operator Learning for Incompressible Flows

[2602.15036] Transforming Computational Lithography with AC and AI -- Faster, More Accurate, and Energy-efficient

[2602.15470] The Skeletal Trap: Mapping Spatial Inequality and Ghost Stops in Ankara's Transit Network

[2602.15423] GaiaFlow: Semantic-Guided Diffusion Tuning for Carbon-Frugal Search

[2602.15785] This human study did not involve human subjects: Validating LLM simulations as behavioral evidence

Related Topics

Stay updated with AI News