Machine Learning

ML algorithms, training, and inference

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Llms

Depth-first pruning seems to transfer from GPT-2 to Llama (unexpectedly well)

TL;DR: Removing the right transformer layers (instead of shrinking all layers) gives smaller, faster models with minimal quality loss — a...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

If frontier AI labs have unlimited shovels, what's stopping them from building everything?

I found myself explaining AI tokens to my mom over the weekend. At first I related them to building bricks: blocks of data the model uses...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.26258] ARTA: Adaptive Mixed-Resolution Token Allocation for Efficient Dense Feature Extraction
Machine Learning

[2603.26258] ARTA: Adaptive Mixed-Resolution Token Allocation for Efficient Dense Feature Extraction

Abstract page for arXiv paper 2603.26258: ARTA: Adaptive Mixed-Resolution Token Allocation for Efficient Dense Feature Extraction

arXiv - AI · 3 min ·
[2603.26246] Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR
Llms

[2603.26246] Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

Abstract page for arXiv paper 2603.26246: Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

arXiv - AI · 3 min ·
[2603.26217] On associative neural networks for sparse patterns with huge capacities
Machine Learning

[2603.26217] On associative neural networks for sparse patterns with huge capacities

Abstract page for arXiv paper 2603.26217: On associative neural networks for sparse patterns with huge capacities

arXiv - Machine Learning · 3 min ·
[2603.26127] Finding Distributed Object-Centric Properties in Self-Supervised Transformers
Machine Learning

[2603.26127] Finding Distributed Object-Centric Properties in Self-Supervised Transformers

Abstract page for arXiv paper 2603.26127: Finding Distributed Object-Centric Properties in Self-Supervised Transformers

arXiv - AI · 4 min ·
[2603.26098] A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning
Machine Learning

[2603.26098] A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning

Abstract page for arXiv paper 2603.26098: A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning

arXiv - AI · 3 min ·
[2603.26071] MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality
Machine Learning

[2603.26071] MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality

Abstract page for arXiv paper 2603.26071: MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Predic...

arXiv - Machine Learning · 4 min ·
[2603.26092] CD-Buffer: Complementary Dual-Buffer Framework for Test-Time Adaptation in Adverse Weather Object Detection
Machine Learning

[2603.26092] CD-Buffer: Complementary Dual-Buffer Framework for Test-Time Adaptation in Adverse Weather Object Detection

Abstract page for arXiv paper 2603.26092: CD-Buffer: Complementary Dual-Buffer Framework for Test-Time Adaptation in Adverse Weather Obje...

arXiv - Machine Learning · 4 min ·
[2603.26048] Asymptotic Optimism for Tensor Regression Models with Applications to Neural Network Compression
Machine Learning

[2603.26048] Asymptotic Optimism for Tensor Regression Models with Applications to Neural Network Compression

Abstract page for arXiv paper 2603.26048: Asymptotic Optimism for Tensor Regression Models with Applications to Neural Network Compression

arXiv - Machine Learning · 3 min ·
[2603.25880] Spectral Coherence Index: A Model-Free Metric for Protein Structural Ensemble Quality Assessment
Machine Learning

[2603.25880] Spectral Coherence Index: A Model-Free Metric for Protein Structural Ensemble Quality Assessment

Abstract page for arXiv paper 2603.25880: Spectral Coherence Index: A Model-Free Metric for Protein Structural Ensemble Quality Assessment

arXiv - AI · 4 min ·
[2603.25948] Globalized Adversarial Regret Optimization: Robust Decisions with Uncalibrated Predictions
Machine Learning

[2603.25948] Globalized Adversarial Regret Optimization: Robust Decisions with Uncalibrated Predictions

Abstract page for arXiv paper 2603.25948: Globalized Adversarial Regret Optimization: Robust Decisions with Uncalibrated Predictions

arXiv - Machine Learning · 4 min ·
[2603.25937] Can Vision Foundation Models Navigate? Zero-Shot Real-World Evaluation and Lessons Learned
Llms

[2603.25937] Can Vision Foundation Models Navigate? Zero-Shot Real-World Evaluation and Lessons Learned

Abstract page for arXiv paper 2603.25937: Can Vision Foundation Models Navigate? Zero-Shot Real-World Evaluation and Lessons Learned

arXiv - Machine Learning · 3 min ·
[2603.25860] On the Expressive Power of Contextual Relations in Transformers
Machine Learning

[2603.25860] On the Expressive Power of Contextual Relations in Transformers

Abstract page for arXiv paper 2603.25860: On the Expressive Power of Contextual Relations in Transformers

arXiv - Machine Learning · 3 min ·
[2603.25832] A Neural Score-Based Particle Method for the Vlasov-Maxwell-Landau System
Machine Learning

[2603.25832] A Neural Score-Based Particle Method for the Vlasov-Maxwell-Landau System

Abstract page for arXiv paper 2603.25832: A Neural Score-Based Particle Method for the Vlasov-Maxwell-Landau System

arXiv - Machine Learning · 4 min ·
[2603.25821] Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI
Machine Learning

[2603.25821] Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI

Abstract page for arXiv paper 2603.25821: Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI

arXiv - AI · 4 min ·
[2603.25810] ExVerus: Verus Proof Repair via Counterexample Reasoning
Llms

[2603.25810] ExVerus: Verus Proof Repair via Counterexample Reasoning

Abstract page for arXiv paper 2603.25810: ExVerus: Verus Proof Repair via Counterexample Reasoning

arXiv - Machine Learning · 3 min ·
[2603.25803] Do All Vision Transformers Need Registers? A Cross-Architectural Reassessment
Machine Learning

[2603.25803] Do All Vision Transformers Need Registers? A Cross-Architectural Reassessment

Abstract page for arXiv paper 2603.25803: Do All Vision Transformers Need Registers? A Cross-Architectural Reassessment

arXiv - Machine Learning · 3 min ·
[2603.25793] Vision Transformers and Graph Neural Networks for Charged Particle Tracking in the ATLAS Muon Spectrometer
Machine Learning

[2603.25793] Vision Transformers and Graph Neural Networks for Charged Particle Tracking in the ATLAS Muon Spectrometer

Abstract page for arXiv paper 2603.25793: Vision Transformers and Graph Neural Networks for Charged Particle Tracking in the ATLAS Muon S...

arXiv - Machine Learning · 4 min ·
[2603.25796] Beyond identifiability: Learning causal representations with few environments and finite samples
Machine Learning

[2603.25796] Beyond identifiability: Learning causal representations with few environments and finite samples

Abstract page for arXiv paper 2603.25796: Beyond identifiability: Learning causal representations with few environments and finite samples

arXiv - AI · 3 min ·
[2603.25780] A Judge Agent Closes the Reliability Gap in AI-Generated Scientific Simulation
Llms

[2603.25780] A Judge Agent Closes the Reliability Gap in AI-Generated Scientific Simulation

Abstract page for arXiv paper 2603.25780: A Judge Agent Closes the Reliability Gap in AI-Generated Scientific Simulation

arXiv - Machine Learning · 3 min ·
[2603.25776] SAHMM-VAE: A Source-Wise Adaptive Hidden Markov Prior Variational Autoencoder for Unsupervised Blind Source Separation
Machine Learning

[2603.25776] SAHMM-VAE: A Source-Wise Adaptive Hidden Markov Prior Variational Autoencoder for Unsupervised Blind Source Separation

Abstract page for arXiv paper 2603.25776: SAHMM-VAE: A Source-Wise Adaptive Hidden Markov Prior Variational Autoencoder for Unsupervised ...

arXiv - Machine Learning · 4 min ·
Previous Page 37 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime