AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[R] GPT-5.4-mini regressed 22pp on vanilla prompting vs GPT-5-mini. Nobody noticed because benchmarks don't test this. Recursive Language Models solved it.

GPT-5.4-mini produces shorter, terser outputs by default. Vanilla accuracy dropped from 69.5% to 47.2% across 12 tasks (1,800 evals). The...

Reddit - Machine Learning · 1 min · about 1 hour ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 2 hours ago

Machine Learning

[R] First open-source implementation of Hebbian fast-weight write-back for the BDH architecture

The BDH (Dragon Hatchling) paper (arXiv:2509.26507) describes a Hebbian synaptic plasticity mechanism where model weights update during i...

Reddit - Machine Learning · 1 min · about 4 hours ago

All Content

Machine Learning

“AI” is a description, not the thing itself. Are we missing a word?

We keep talking about “AI” as if it were the name of an entity. But artificial intelligence is not the entity. It is a description. Intel...

Reddit - Artificial Intelligence · 1 min · 4 days ago

Machine Learning

[R] Ternary neural networks as a path to more efficient AI - is (+1, 0, -1) weight quantization getting serious research attention?

I've been reading about ternary weight quantization in neural networks and wanted to get a sence of how seriously the ML research communi...

Reddit - Machine Learning · 1 min · 4 days ago

Machine Learning

[2603.16146] Deep Adaptive Model-Based Design of Experiments

Abstract page for arXiv paper 2603.16146: Deep Adaptive Model-Based Design of Experiments

arXiv - Machine Learning · 3 min · 4 days ago

Llms

[2603.13606] NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL

Abstract page for arXiv paper 2603.13606: NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL

arXiv - AI · 4 min · 4 days ago

Robotics

[2511.20008] Pedestrian Crossing Intention Prediction Using Multimodal Fusion Network

Abstract page for arXiv paper 2511.20008: Pedestrian Crossing Intention Prediction Using Multimodal Fusion Network

arXiv - AI · 3 min · 4 days ago

Machine Learning

[2512.07697] Delay-Aware Diffusion Policy: Bridging the Observation-Execution Gap in Dynamic Tasks

Abstract page for arXiv paper 2512.07697: Delay-Aware Diffusion Policy: Bridging the Observation-Execution Gap in Dynamic Tasks

arXiv - Machine Learning · 3 min · 4 days ago

Machine Learning

[2511.04568] Riesz Regression As Direct Density Ratio Estimation

Abstract page for arXiv paper 2511.04568: Riesz Regression As Direct Density Ratio Estimation

arXiv - Machine Learning · 3 min · 4 days ago

Machine Learning

[2508.10149] Prediction-Powered Inference with Inverse Probability Weighting

Abstract page for arXiv paper 2508.10149: Prediction-Powered Inference with Inverse Probability Weighting

arXiv - Machine Learning · 3 min · 4 days ago

Machine Learning

[2509.06027] DreamAudio: Customized Text-to-Audio Generation with Diffusion Models

Abstract page for arXiv paper 2509.06027: DreamAudio: Customized Text-to-Audio Generation with Diffusion Models

arXiv - AI · 4 min · 4 days ago

Machine Learning

[2502.10001] EmbBERT: Attention Under 2 MB Memory

Abstract page for arXiv paper 2502.10001: EmbBERT: Attention Under 2 MB Memory

arXiv - Machine Learning · 4 min · 4 days ago

Llms

[2412.08686] LatentQA: Teaching LLMs to Decode Activations Into Natural Language

Abstract page for arXiv paper 2412.08686: LatentQA: Teaching LLMs to Decode Activations Into Natural Language

arXiv - Machine Learning · 4 min · 4 days ago

Ai Infrastructure

[2410.06112] SwiftQueue: Optimizing Low-Latency Applications with Swift Packet Queuing

Abstract page for arXiv paper 2410.06112: SwiftQueue: Optimizing Low-Latency Applications with Swift Packet Queuing

arXiv - Machine Learning · 4 min · 4 days ago

Machine Learning

[2403.16125] Arena: Efficiently Training Large Models via Dynamic Scheduling and Adaptive Parallelism Co-Design

Abstract page for arXiv paper 2403.16125: Arena: Efficiently Training Large Models via Dynamic Scheduling and Adaptive Parallelism Co-Design

arXiv - Machine Learning · 4 min · 4 days ago

Machine Learning

[2202.05775] Inference of Multiscale Gaussian Graphical Model

Abstract page for arXiv paper 2202.05775: Inference of Multiscale Gaussian Graphical Model

arXiv - Machine Learning · 4 min · 4 days ago

Ai Infrastructure

[2603.11858] Multi-Station WiFi CSI Sensing Framework Robust to Station-wise Feature Missingness and Limited Labeled Data

Abstract page for arXiv paper 2603.11858: Multi-Station WiFi CSI Sensing Framework Robust to Station-wise Feature Missingness and Limited...

arXiv - Machine Learning · 4 min · 4 days ago

Machine Learning

[2601.14026] Universal Approximation Theorem for Input-Connected Multilayer Perceptrons

Abstract page for arXiv paper 2601.14026: Universal Approximation Theorem for Input-Connected Multilayer Perceptrons

arXiv - Machine Learning · 3 min · 4 days ago

Llms

[2603.23485] Failure of contextual invariance in gender inference with large language models

Abstract page for arXiv paper 2603.23485: Failure of contextual invariance in gender inference with large language models

arXiv - AI · 3 min · 4 days ago

Machine Learning

[2511.16105] Data-Efficient and Robust Trajectory Generation through Pathlet Dictionary Learning

Abstract page for arXiv paper 2511.16105: Data-Efficient and Robust Trajectory Generation through Pathlet Dictionary Learning

arXiv - Machine Learning · 4 min · 4 days ago

Machine Learning

[2510.12996] CSI-4CAST: A Hybrid Deep Learning Model for CSI Prediction with Comprehensive Robustness and Generalization Testing

Abstract page for arXiv paper 2510.12996: CSI-4CAST: A Hybrid Deep Learning Model for CSI Prediction with Comprehensive Robustness and Ge...

arXiv - Machine Learning · 4 min · 4 days ago

Machine Learning

[2510.08294] Counterfactual Identifiability via Dynamic Optimal Transport

Abstract page for arXiv paper 2510.08294: Counterfactual Identifiability via Dynamic Optimal Transport

arXiv - AI · 3 min · 4 days ago

Previous Page 7 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

[R] GPT-5.4-mini regressed 22pp on vanilla prompting vs GPT-5-mini. Nobody noticed because benchmarks don't test this. Recursive Language Models solved it.

UMKC Announces New Master of Science in Artificial Intelligence

[R] First open-source implementation of Hebbian fast-weight write-back for the BDH architecture

All Content

“AI” is a description, not the thing itself. Are we missing a word?

[R] Ternary neural networks as a path to more efficient AI - is (+1, 0, -1) weight quantization getting serious research attention?

[2603.16146] Deep Adaptive Model-Based Design of Experiments

[2603.13606] NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL

[2511.20008] Pedestrian Crossing Intention Prediction Using Multimodal Fusion Network

[2512.07697] Delay-Aware Diffusion Policy: Bridging the Observation-Execution Gap in Dynamic Tasks

[2511.04568] Riesz Regression As Direct Density Ratio Estimation

[2508.10149] Prediction-Powered Inference with Inverse Probability Weighting

[2509.06027] DreamAudio: Customized Text-to-Audio Generation with Diffusion Models

[2502.10001] EmbBERT: Attention Under 2 MB Memory

[2412.08686] LatentQA: Teaching LLMs to Decode Activations Into Natural Language

[2410.06112] SwiftQueue: Optimizing Low-Latency Applications with Swift Packet Queuing

[2403.16125] Arena: Efficiently Training Large Models via Dynamic Scheduling and Adaptive Parallelism Co-Design

[2202.05775] Inference of Multiscale Gaussian Graphical Model

[2603.11858] Multi-Station WiFi CSI Sensing Framework Robust to Station-wise Feature Missingness and Limited Labeled Data

[2601.14026] Universal Approximation Theorem for Input-Connected Multilayer Perceptrons

[2603.23485] Failure of contextual invariance in gender inference with large language models

[2511.16105] Data-Efficient and Robust Trajectory Generation through Pathlet Dictionary Learning

[2510.12996] CSI-4CAST: A Hybrid Deep Learning Model for CSI Prediction with Comprehensive Robustness and Generalization Testing

[2510.08294] Counterfactual Identifiability via Dynamic Optimal Transport

Related Topics

Stay updated with AI News