Machine Learning

ML algorithms, training, and inference

Top This Week

Machine Learning

I tried building a memory-first AI… and ended up discovering smaller models can beat larger ones

Dataset Model Acc F1 Δ vs Log Δ vs Static Avg Params Peak Params Steps Infer ms Size Banking77-20 Logistic TF-IDF 92.37% 0.9230 +0.00pp +...

Reddit - Artificial Intelligence · 1 min ·
Llms

[D] Howcome Muon is only being used for Transformers?

Muon has quickly been adopted in LLM training, yet we don't see it being talked about in other contexts. Searches for Muon on ConvNets tu...

Reddit - Machine Learning · 1 min ·
Machine Learning

[P] Run Karpathy's Autoresearch for $0.44 instead of $24 — Open-source parallel evolution pipeline on SageMaker Spot

TL;DR: I built an open-source pipeline that runs Karpathy's autoresearch on SageMaker Spot instances — 25 autonomous ML experiments for $...

Reddit - Machine Learning · 1 min ·

All Content

[2603.24400] Neural Network Models for Contextual Regression
Machine Learning

[2603.24400] Neural Network Models for Contextual Regression

Abstract page for arXiv paper 2603.24400: Neural Network Models for Contextual Regression

arXiv - Machine Learning · 3 min ·
[2603.24396] Exploring How Fair Model Representations Relate to Fair Recommendations
Machine Learning

[2603.24396] Exploring How Fair Model Representations Relate to Fair Recommendations

Abstract page for arXiv paper 2603.24396: Exploring How Fair Model Representations Relate to Fair Recommendations

arXiv - Machine Learning · 3 min ·
[2603.24392] Federated fairness-aware classification under differential privacy
Machine Learning

[2603.24392] Federated fairness-aware classification under differential privacy

Abstract page for arXiv paper 2603.24392: Federated fairness-aware classification under differential privacy

arXiv - Machine Learning · 3 min ·
[2603.24369] Adaptive decision-making for stochastic service network design
Machine Learning

[2603.24369] Adaptive decision-making for stochastic service network design

Abstract page for arXiv paper 2603.24369: Adaptive decision-making for stochastic service network design

arXiv - Machine Learning · 4 min ·
[2603.24323] Connecting Meteorite Spectra to Lunar Surface Composition Using Hyperspectral Imaging and Machine Learning
Machine Learning

[2603.24323] Connecting Meteorite Spectra to Lunar Surface Composition Using Hyperspectral Imaging and Machine Learning

Abstract page for arXiv paper 2603.24323: Connecting Meteorite Spectra to Lunar Surface Composition Using Hyperspectral Imaging and Machi...

arXiv - Machine Learning · 4 min ·
[2603.24304] CGRL: Causal-Guided Representation Learning for Graph Out-of-Distribution Generalization
Machine Learning

[2603.24304] CGRL: Causal-Guided Representation Learning for Graph Out-of-Distribution Generalization

Abstract page for arXiv paper 2603.24304: CGRL: Causal-Guided Representation Learning for Graph Out-of-Distribution Generalization

arXiv - Machine Learning · 3 min ·
[2603.24239] DVM: Real-Time Kernel Generation for Dynamic AI Models
Machine Learning

[2603.24239] DVM: Real-Time Kernel Generation for Dynamic AI Models

Abstract page for arXiv paper 2603.24239: DVM: Real-Time Kernel Generation for Dynamic AI Models

arXiv - Machine Learning · 4 min ·
[2603.24226] UniScale: Synergistic Entire Space Data and Model Scaling for Search Ranking
Llms

[2603.24226] UniScale: Synergistic Entire Space Data and Model Scaling for Search Ranking

Abstract page for arXiv paper 2603.24226: UniScale: Synergistic Entire Space Data and Model Scaling for Search Ranking

arXiv - Machine Learning · 4 min ·
[2603.24209] HEART-PFL: Stable Personalized Federated Learning under Heterogeneity with Hierarchical Directional Alignment and Adversarial Knowledge Transfer
Machine Learning

[2603.24209] HEART-PFL: Stable Personalized Federated Learning under Heterogeneity with Hierarchical Directional Alignment and Adversarial Knowledge Transfer

Abstract page for arXiv paper 2603.24209: HEART-PFL: Stable Personalized Federated Learning under Heterogeneity with Hierarchical Directi...

arXiv - Machine Learning · 4 min ·
[2603.24196] Quantum Neural Physics: Solving Partial Differential Equations on Quantum Simulators using Quantum Convolutional Neural Networks
Machine Learning

[2603.24196] Quantum Neural Physics: Solving Partial Differential Equations on Quantum Simulators using Quantum Convolutional Neural Networks

Abstract page for arXiv paper 2603.24196: Quantum Neural Physics: Solving Partial Differential Equations on Quantum Simulators using Quan...

arXiv - Machine Learning · 4 min ·
[2603.24167] Walma: Learning to See Memory Corruption in WebAssembly
Machine Learning

[2603.24167] Walma: Learning to See Memory Corruption in WebAssembly

Abstract page for arXiv paper 2603.24167: Walma: Learning to See Memory Corruption in WebAssembly

arXiv - Machine Learning · 3 min ·
[2603.24150] A visual observation on the geometry of UMAP projections of the difference vectors of antonym and synonym word pair embeddings
Machine Learning

[2603.24150] A visual observation on the geometry of UMAP projections of the difference vectors of antonym and synonym word pair embeddings

Abstract page for arXiv paper 2603.24150: A visual observation on the geometry of UMAP projections of the difference vectors of antonym a...

arXiv - Machine Learning · 3 min ·
[2603.24139] Tutor-Student Reinforcement Learning: A Dynamic Curriculum for Robust Deepfake Detection
Machine Learning

[2603.24139] Tutor-Student Reinforcement Learning: A Dynamic Curriculum for Robust Deepfake Detection

Abstract page for arXiv paper 2603.24139: Tutor-Student Reinforcement Learning: A Dynamic Curriculum for Robust Deepfake Detection

arXiv - Machine Learning · 4 min ·
[2603.24111] Toward a Multi-Layer ML-Based Security Framework for Industrial IoT
Machine Learning

[2603.24111] Toward a Multi-Layer ML-Based Security Framework for Industrial IoT

Abstract page for arXiv paper 2603.24111: Toward a Multi-Layer ML-Based Security Framework for Industrial IoT

arXiv - Machine Learning · 4 min ·
[2603.24083] Knowledge-Guided Manipulation Using Multi-Task Reinforcement Learning
Machine Learning

[2603.24083] Knowledge-Guided Manipulation Using Multi-Task Reinforcement Learning

Abstract page for arXiv paper 2603.24083: Knowledge-Guided Manipulation Using Multi-Task Reinforcement Learning

arXiv - Machine Learning · 4 min ·
[2603.24016] COVTrack++: Learning Open-Vocabulary Multi-Object Tracking from Continuous Videos via a Synergistic Paradigm
Machine Learning

[2603.24016] COVTrack++: Learning Open-Vocabulary Multi-Object Tracking from Continuous Videos via a Synergistic Paradigm

Abstract page for arXiv paper 2603.24016: COVTrack++: Learning Open-Vocabulary Multi-Object Tracking from Continuous Videos via a Synergi...

arXiv - Machine Learning · 4 min ·
[2603.24054] Hierarchical Spatial-Temporal Graph-Enhanced Model for Map-Matching
Machine Learning

[2603.24054] Hierarchical Spatial-Temporal Graph-Enhanced Model for Map-Matching

Abstract page for arXiv paper 2603.24054: Hierarchical Spatial-Temporal Graph-Enhanced Model for Map-Matching

arXiv - Machine Learning · 4 min ·
[2603.24041] Minimal Sufficient Representations for Self-interpretable Deep Neural Networks
Machine Learning

[2603.24041] Minimal Sufficient Representations for Self-interpretable Deep Neural Networks

Abstract page for arXiv paper 2603.24041: Minimal Sufficient Representations for Self-interpretable Deep Neural Networks

arXiv - Machine Learning · 3 min ·
[2603.23974] Machine vision with small numbers of detected photons per inference
Machine Learning

[2603.23974] Machine vision with small numbers of detected photons per inference

Abstract page for arXiv paper 2603.23974: Machine vision with small numbers of detected photons per inference

arXiv - Machine Learning · 4 min ·
[2603.23971] The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More
Llms

[2603.23971] The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More

Abstract page for arXiv paper 2603.23971: The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More

arXiv - Machine Learning · 4 min ·
Previous Page 34 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime