Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 24 minutes ago

Llms

Depth-first pruning seems to transfer from GPT-2 to Llama (unexpectedly well)

TL;DR: Removing the right transformer layers (instead of shrinking all layers) gives smaller, faster models with minimal quality loss — a...

Reddit - Artificial Intelligence · 1 min · 38 minutes ago

Machine Learning

If frontier AI labs have unlimited shovels, what's stopping them from building everything?

I found myself explaining AI tokens to my mom over the weekend. At first I related them to building bricks: blocks of data the model uses...

Reddit - Artificial Intelligence · 1 min · 38 minutes ago

All Content

Machine Learning

[2603.26258] ARTA: Adaptive Mixed-Resolution Token Allocation for Efficient Dense Feature Extraction

Abstract page for arXiv paper 2603.26258: ARTA: Adaptive Mixed-Resolution Token Allocation for Efficient Dense Feature Extraction

arXiv - AI · 3 min · 1 day ago

Llms

[2603.26246] Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

Abstract page for arXiv paper 2603.26246: Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

arXiv - AI · 3 min · 1 day ago

Machine Learning

[2603.26217] On associative neural networks for sparse patterns with huge capacities

Abstract page for arXiv paper 2603.26217: On associative neural networks for sparse patterns with huge capacities

arXiv - Machine Learning · 3 min · 1 day ago

Machine Learning

[2603.26127] Finding Distributed Object-Centric Properties in Self-Supervised Transformers

Abstract page for arXiv paper 2603.26127: Finding Distributed Object-Centric Properties in Self-Supervised Transformers

arXiv - AI · 4 min · 1 day ago

Machine Learning

[2603.26098] A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning

Abstract page for arXiv paper 2603.26098: A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning

arXiv - AI · 3 min · 1 day ago

Machine Learning

[2603.26071] MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality

Abstract page for arXiv paper 2603.26071: MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Predic...

arXiv - Machine Learning · 4 min · 1 day ago

Machine Learning

[2603.26092] CD-Buffer: Complementary Dual-Buffer Framework for Test-Time Adaptation in Adverse Weather Object Detection

Abstract page for arXiv paper 2603.26092: CD-Buffer: Complementary Dual-Buffer Framework for Test-Time Adaptation in Adverse Weather Obje...

arXiv - Machine Learning · 4 min · 1 day ago

Machine Learning

[2603.26048] Asymptotic Optimism for Tensor Regression Models with Applications to Neural Network Compression

Abstract page for arXiv paper 2603.26048: Asymptotic Optimism for Tensor Regression Models with Applications to Neural Network Compression

arXiv - Machine Learning · 3 min · 1 day ago

Machine Learning

[2603.25880] Spectral Coherence Index: A Model-Free Metric for Protein Structural Ensemble Quality Assessment

Abstract page for arXiv paper 2603.25880: Spectral Coherence Index: A Model-Free Metric for Protein Structural Ensemble Quality Assessment

arXiv - AI · 4 min · 1 day ago

Machine Learning

[2603.25948] Globalized Adversarial Regret Optimization: Robust Decisions with Uncalibrated Predictions

Abstract page for arXiv paper 2603.25948: Globalized Adversarial Regret Optimization: Robust Decisions with Uncalibrated Predictions

arXiv - Machine Learning · 4 min · 1 day ago

Llms

[2603.25937] Can Vision Foundation Models Navigate? Zero-Shot Real-World Evaluation and Lessons Learned

Abstract page for arXiv paper 2603.25937: Can Vision Foundation Models Navigate? Zero-Shot Real-World Evaluation and Lessons Learned

arXiv - Machine Learning · 3 min · 1 day ago

Machine Learning

[2603.25860] On the Expressive Power of Contextual Relations in Transformers

Abstract page for arXiv paper 2603.25860: On the Expressive Power of Contextual Relations in Transformers

arXiv - Machine Learning · 3 min · 1 day ago

Machine Learning

[2603.25832] A Neural Score-Based Particle Method for the Vlasov-Maxwell-Landau System

Abstract page for arXiv paper 2603.25832: A Neural Score-Based Particle Method for the Vlasov-Maxwell-Landau System

arXiv - Machine Learning · 4 min · 1 day ago

Machine Learning

[2603.25821] Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI

Abstract page for arXiv paper 2603.25821: Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI

arXiv - AI · 4 min · 1 day ago

Llms

[2603.25810] ExVerus: Verus Proof Repair via Counterexample Reasoning

Abstract page for arXiv paper 2603.25810: ExVerus: Verus Proof Repair via Counterexample Reasoning

arXiv - Machine Learning · 3 min · 1 day ago

Machine Learning

[2603.25803] Do All Vision Transformers Need Registers? A Cross-Architectural Reassessment

Abstract page for arXiv paper 2603.25803: Do All Vision Transformers Need Registers? A Cross-Architectural Reassessment

arXiv - Machine Learning · 3 min · 1 day ago

Machine Learning

[2603.25793] Vision Transformers and Graph Neural Networks for Charged Particle Tracking in the ATLAS Muon Spectrometer

Abstract page for arXiv paper 2603.25793: Vision Transformers and Graph Neural Networks for Charged Particle Tracking in the ATLAS Muon S...

arXiv - Machine Learning · 4 min · 1 day ago

Machine Learning

[2603.25796] Beyond identifiability: Learning causal representations with few environments and finite samples

Abstract page for arXiv paper 2603.25796: Beyond identifiability: Learning causal representations with few environments and finite samples

arXiv - AI · 3 min · 1 day ago

Llms

[2603.25780] A Judge Agent Closes the Reliability Gap in AI-Generated Scientific Simulation

Abstract page for arXiv paper 2603.25780: A Judge Agent Closes the Reliability Gap in AI-Generated Scientific Simulation

arXiv - Machine Learning · 3 min · 1 day ago

Machine Learning

[2603.25776] SAHMM-VAE: A Source-Wise Adaptive Hidden Markov Prior Variational Autoencoder for Unsupervised Blind Source Separation

Abstract page for arXiv paper 2603.25776: SAHMM-VAE: A Source-Wise Adaptive Hidden Markov Prior Variational Autoencoder for Unsupervised ...

arXiv - Machine Learning · 4 min · 1 day ago

Previous Page 37 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

Depth-first pruning seems to transfer from GPT-2 to Llama (unexpectedly well)

If frontier AI labs have unlimited shovels, what's stopping them from building everything?

All Content

[2603.26258] ARTA: Adaptive Mixed-Resolution Token Allocation for Efficient Dense Feature Extraction

[2603.26246] Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

[2603.26217] On associative neural networks for sparse patterns with huge capacities

[2603.26127] Finding Distributed Object-Centric Properties in Self-Supervised Transformers

[2603.26098] A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning

[2603.26071] MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality

[2603.26092] CD-Buffer: Complementary Dual-Buffer Framework for Test-Time Adaptation in Adverse Weather Object Detection

[2603.26048] Asymptotic Optimism for Tensor Regression Models with Applications to Neural Network Compression

[2603.25880] Spectral Coherence Index: A Model-Free Metric for Protein Structural Ensemble Quality Assessment

[2603.25948] Globalized Adversarial Regret Optimization: Robust Decisions with Uncalibrated Predictions

[2603.25937] Can Vision Foundation Models Navigate? Zero-Shot Real-World Evaluation and Lessons Learned

[2603.25860] On the Expressive Power of Contextual Relations in Transformers

[2603.25832] A Neural Score-Based Particle Method for the Vlasov-Maxwell-Landau System

[2603.25821] Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI

[2603.25810] ExVerus: Verus Proof Repair via Counterexample Reasoning

[2603.25803] Do All Vision Transformers Need Registers? A Cross-Architectural Reassessment

[2603.25793] Vision Transformers and Graph Neural Networks for Charged Particle Tracking in the ATLAS Muon Spectrometer

[2603.25796] Beyond identifiability: Learning causal representations with few environments and finite samples

[2603.25780] A Judge Agent Closes the Reliability Gap in AI-Generated Scientific Simulation

[2603.25776] SAHMM-VAE: A Source-Wise Adaptive Hidden Markov Prior Variational Autoencoder for Unsupervised Blind Source Separation

Related Topics

Stay updated with AI News