AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Ai Infrastructure

Nvidia Built the A.I. Era. Now It Has to Defend It.

Nvidia has played a significant role in the development of the A.I. era and now faces the challenge of maintaining its position in this e...

AI Events · 1 min · about 1 hour ago

Ai Infrastructure

Mythos SI (Structured Intelligence): Technical Evidence, Coordinated Criticism, and What the Pattern Actually Shows

Perplexity just ran a structural analysis on the criticism campaign against my work. What it found: synchronized language across posts, n...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

All Content

Machine Learning

[2602.08032] Horizon Imagination: Efficient On-Policy Rollout in Diffusion World Models

The paper presents Horizon Imagination (HI), an innovative on-policy imagination process for reinforcement learning using diffusion-based...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2602.15491] The Equalizer: Introducing Shape-Gain Decomposition in Neural Audio Codecs

The paper presents Shape-Gain Decomposition for Neural Audio Codecs, enhancing bitrate-distortion performance and reducing complexity by ...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.00240] Green-NAS: A Global-Scale Multi-Objective Neural Architecture Search for Robust and Efficient Edge-Native Weather Forecasting

Green-NAS presents a multi-objective neural architecture search framework aimed at optimizing weather forecasting models for low-resource...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Infrastructure

[2602.15377] Orchestration-Free Customer Service Automation: A Privacy-Preserving and Flowchart-Guided Framework

This paper presents an orchestration-free framework for customer service automation, utilizing Task-Oriented Flowcharts (TOFs) to enhance...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2601.01016] Improving Variational Autoencoder using Random Fourier Transformation: An Aviation Safety Anomaly Detection Case-Study

This study explores enhancements to Variational Autoencoders (VAEs) using Random Fourier Transformation (RFT) for anomaly detection in av...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2512.04189] BEP: A Binary Error Propagation Algorithm for Binary Neural Networks Training

The paper presents BEP, a novel Binary Error Propagation algorithm for training Binary Neural Networks (BNNs) that enables efficient back...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2512.01389] Syndrome-Flow Consistency Model Achieves One-step Denoising Error Correction Codes

The paper presents the Error Correction Syndrome-Flow Consistency Model (ECCFM), which enhances one-step denoising error correction codes...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.15353] NeuroSymActive: Differentiable Neural-Symbolic Reasoning with Active Exploration for Knowledge Graph Question Answering

The paper presents NeuroSymActive, a novel framework for Knowledge Graph Question Answering that integrates differentiable neural-symboli...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.15318] Sparrow: Text-Anchored Window Attention with Visual-Semantic Glimpsing for Speculative Decoding in Video LLMs

The paper introduces Sparrow, a novel framework designed to enhance speculative decoding in Video Large Language Models (Vid-LLMs) by opt...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2508.11460] Calibrated and uncertain? Evaluating uncertainty estimates in binary classification models

This article evaluates uncertainty estimates in binary classification models, comparing six probabilistic machine learning algorithms to ...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15286] AI-Paging: Lease-Based Execution Anchoring for Network-Exposed AI-as-a-Service

The paper presents AI-Paging, a framework for optimizing AI-as-a-Service by enabling network providers to manage model selection and exec...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.15281] High-Fidelity Network Management for Federated AI-as-a-Service: Cross-Domain Orchestration

This paper presents a framework for high-fidelity network management in Federated AI-as-a-Service, focusing on cross-domain orchestration...

arXiv - AI · 4 min · about 2 months ago

Llms

[2505.11824] Latent Veracity Inference for Identifying Errors in Stepwise Reasoning

This paper presents a novel method for identifying errors in stepwise reasoning using latent veracity inference, enhancing the reliabilit...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2505.11695] Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization

The paper introduces Qronos, a novel post-training quantization algorithm that enhances neural network performance by correcting quantiza...

arXiv - AI · 4 min · about 2 months ago

Ai Infrastructure

[2602.15249] Artificial Intelligence Specialization in the European Union: Underexplored Role of the Periphery at NUTS-3 Level

This study analyzes AI research production across European regions at the NUTS-3 level, highlighting the specialization of peripheral reg...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.15241] GenAI for Systems: Recurring Challenges and Design Principles from Software to Silicon

This paper explores the integration of Generative AI in computing systems, identifying recurring challenges and design principles across ...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2411.18954] NeuroLifting: Neural Inference on Markov Random Fields at Scale

NeuroLifting introduces a novel approach for inference in large-scale Markov Random Fields (MRFs) using Graph Neural Networks, achieving ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.15197] OpaqueToolsBench: Learning Nuances of Tool Behavior Through Interaction

The paper introduces OpaqueToolsBench, a benchmark for evaluating Large Language Model (LLM) agents' performance with opaque tools, propo...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.15756] A Note on Non-Composability of Layerwise Approximate Verification for Neural Inference

This paper discusses the limitations of layerwise approximate verification in neural inference, presenting a counterexample that challeng...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.15751] Enabling Low-Latency Machine learning on Radiation-Hard FPGAs with hls4ml

This article presents a novel approach to implementing low-latency machine learning on radiation-hard FPGAs, demonstrating its applicatio...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 148 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

Nvidia Built the A.I. Era. Now It Has to Defend It.

Mythos SI (Structured Intelligence): Technical Evidence, Coordinated Criticism, and What the Pattern Actually Shows

All Content

[2602.08032] Horizon Imagination: Efficient On-Policy Rollout in Diffusion World Models

[2602.15491] The Equalizer: Introducing Shape-Gain Decomposition in Neural Audio Codecs

[2602.00240] Green-NAS: A Global-Scale Multi-Objective Neural Architecture Search for Robust and Efficient Edge-Native Weather Forecasting

[2602.15377] Orchestration-Free Customer Service Automation: A Privacy-Preserving and Flowchart-Guided Framework

[2601.01016] Improving Variational Autoencoder using Random Fourier Transformation: An Aviation Safety Anomaly Detection Case-Study

[2512.04189] BEP: A Binary Error Propagation Algorithm for Binary Neural Networks Training

[2512.01389] Syndrome-Flow Consistency Model Achieves One-step Denoising Error Correction Codes

[2602.15353] NeuroSymActive: Differentiable Neural-Symbolic Reasoning with Active Exploration for Knowledge Graph Question Answering

[2602.15318] Sparrow: Text-Anchored Window Attention with Visual-Semantic Glimpsing for Speculative Decoding in Video LLMs

[2508.11460] Calibrated and uncertain? Evaluating uncertainty estimates in binary classification models

[2602.15286] AI-Paging: Lease-Based Execution Anchoring for Network-Exposed AI-as-a-Service

[2602.15281] High-Fidelity Network Management for Federated AI-as-a-Service: Cross-Domain Orchestration

[2505.11824] Latent Veracity Inference for Identifying Errors in Stepwise Reasoning

[2505.11695] Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization

[2602.15249] Artificial Intelligence Specialization in the European Union: Underexplored Role of the Periphery at NUTS-3 Level

[2602.15241] GenAI for Systems: Recurring Challenges and Design Principles from Software to Silicon

[2411.18954] NeuroLifting: Neural Inference on Markov Random Fields at Scale

[2602.15197] OpaqueToolsBench: Learning Nuances of Tool Behavior Through Interaction

[2602.15756] A Note on Non-Composability of Layerwise Approximate Verification for Neural Inference

[2602.15751] Enabling Low-Latency Machine learning on Radiation-Hard FPGAs with hls4ml

Related Topics

Stay updated with AI News