Machine Learning

ML algorithms, training, and inference

Top This Week

Machine Learning

[D] Make. Big. Batch. Size.

It's something between vent and learning. I tried training RWKV v6 model by my own code on my RTX 4050. I trained over 50k steps on batch...

Reddit - Machine Learning · 1 min ·
Machine Learning

AI Tools That Can’t Prove What They Did Will Hit a Wall

Most AI products are still judged like answer machines. People ask whether the model is smart, fast, creative, cheap, or good at sounding...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[P] PhAIL (phail.ai) – an open benchmark for robot AI on real hardware. Best model: 5% of human throughput, needs help every 4 minutes.

I spent the last year trying to answer a simple question: how good are VLA models on real commercial tasks? Not demos, not simulation, no...

Reddit - Machine Learning · 1 min ·

All Content

[2511.07436] Analysing Environmental Efficiency in AI for X-Ray Diagnosis
Llms

[2511.07436] Analysing Environmental Efficiency in AI for X-Ray Diagnosis

Abstract page for arXiv paper 2511.07436: Analysing Environmental Efficiency in AI for X-Ray Diagnosis

arXiv - AI · 4 min ·
[2601.02856] Electricity Price Forecasting: Bridging Linear Models, Neural Networks and Online Learning
Machine Learning

[2601.02856] Electricity Price Forecasting: Bridging Linear Models, Neural Networks and Online Learning

Abstract page for arXiv paper 2601.02856: Electricity Price Forecasting: Bridging Linear Models, Neural Networks and Online Learning

arXiv - Machine Learning · 4 min ·
[2601.00428] Interpretable ML Under the Microscope: Performance, Meta-Features, and the Regression-Classification Predictability Gap
Machine Learning

[2601.00428] Interpretable ML Under the Microscope: Performance, Meta-Features, and the Regression-Classification Predictability Gap

Abstract page for arXiv paper 2601.00428: Interpretable ML Under the Microscope: Performance, Meta-Features, and the Regression-Classific...

arXiv - Machine Learning · 4 min ·
[2510.18087] Planned Diffusion
Llms

[2510.18087] Planned Diffusion

Abstract page for arXiv paper 2510.18087: Planned Diffusion

arXiv - AI · 4 min ·
[2509.23768] From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning
Llms

[2509.23768] From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

Abstract page for arXiv paper 2509.23768: From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

arXiv - AI · 3 min ·
[2512.18951] Benchmarking Attribute Discrimination in Infant-Scale Vision-Language Models
Llms

[2512.18951] Benchmarking Attribute Discrimination in Infant-Scale Vision-Language Models

Abstract page for arXiv paper 2512.18951: Benchmarking Attribute Discrimination in Infant-Scale Vision-Language Models

arXiv - Machine Learning · 3 min ·
[2509.03345] Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive Reasoning
Llms

[2509.03345] Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive Reasoning

Abstract page for arXiv paper 2509.03345: Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive ...

arXiv - AI · 4 min ·
[2512.10152] Rethinking Bivariate Causal Discovery Through the Lens of Exchangeability
Machine Learning

[2512.10152] Rethinking Bivariate Causal Discovery Through the Lens of Exchangeability

Abstract page for arXiv paper 2512.10152: Rethinking Bivariate Causal Discovery Through the Lens of Exchangeability

arXiv - Machine Learning · 4 min ·
[2512.01906] Delays in Spiking Neural Networks: A State Space Model Approach
Machine Learning

[2512.01906] Delays in Spiking Neural Networks: A State Space Model Approach

Abstract page for arXiv paper 2512.01906: Delays in Spiking Neural Networks: A State Space Model Approach

arXiv - Machine Learning · 4 min ·
[2504.15780] TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving
Llms

[2504.15780] TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving

Abstract page for arXiv paper 2504.15780: TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving

arXiv - AI · 4 min ·
[2503.03361] Concepts Learned Visually by Infants Can Contribute to Visual Learning and Understanding in AI Models
Machine Learning

[2503.03361] Concepts Learned Visually by Infants Can Contribute to Visual Learning and Understanding in AI Models

Abstract page for arXiv paper 2503.03361: Concepts Learned Visually by Infants Can Contribute to Visual Learning and Understanding in AI ...

arXiv - AI · 4 min ·
[2512.01678] Morphling: Fast, Fused, and Flexible GNN Training at Scale
Machine Learning

[2512.01678] Morphling: Fast, Fused, and Flexible GNN Training at Scale

Abstract page for arXiv paper 2512.01678: Morphling: Fast, Fused, and Flexible GNN Training at Scale

arXiv - Machine Learning · 4 min ·
[2511.22344] Cleaning the Pool: Progressive Filtering of Unlabeled Pools in Deep Active Learning
Machine Learning

[2511.22344] Cleaning the Pool: Progressive Filtering of Unlabeled Pools in Deep Active Learning

Abstract page for arXiv paper 2511.22344: Cleaning the Pool: Progressive Filtering of Unlabeled Pools in Deep Active Learning

arXiv - Machine Learning · 4 min ·
[2410.20894] Working Paper: Active Causal Structure Learning with Latent Variables: Towards Learning to Detour in Autonomous Robots
Machine Learning

[2410.20894] Working Paper: Active Causal Structure Learning with Latent Variables: Towards Learning to Detour in Autonomous Robots

Abstract page for arXiv paper 2410.20894: Working Paper: Active Causal Structure Learning with Latent Variables: Towards Learning to Deto...

arXiv - Machine Learning · 4 min ·
[2511.16992] FIRM: Federated In-client Regularized Multi-objective Alignment for Large Language Models
Llms

[2511.16992] FIRM: Federated In-client Regularized Multi-objective Alignment for Large Language Models

Abstract page for arXiv paper 2511.16992: FIRM: Federated In-client Regularized Multi-objective Alignment for Large Language Models

arXiv - Machine Learning · 4 min ·
[2511.14961] Graph Memory: A Structured and Interpretable Framework for Modality-Agnostic Embedding-Based Inference
Machine Learning

[2511.14961] Graph Memory: A Structured and Interpretable Framework for Modality-Agnostic Embedding-Based Inference

Abstract page for arXiv paper 2511.14961: Graph Memory: A Structured and Interpretable Framework for Modality-Agnostic Embedding-Based In...

arXiv - Machine Learning · 4 min ·
[2603.25741] Vega: Learning to Drive with Natural Language Instructions
Machine Learning

[2603.25741] Vega: Learning to Drive with Natural Language Instructions

Abstract page for arXiv paper 2603.25741: Vega: Learning to Drive with Natural Language Instructions

arXiv - AI · 3 min ·
[2510.13772] Tensor Gaussian Processes: Efficient Solvers for Nonlinear PDEs
Machine Learning

[2510.13772] Tensor Gaussian Processes: Efficient Solvers for Nonlinear PDEs

Abstract page for arXiv paper 2510.13772: Tensor Gaussian Processes: Efficient Solvers for Nonlinear PDEs

arXiv - Machine Learning · 4 min ·
[2603.25730] PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference
Machine Learning

[2603.25730] PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Abstract page for arXiv paper 2603.25730: PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

arXiv - AI · 4 min ·
[2510.12453] Time-Correlated Video Bridge Matching
Machine Learning

[2510.12453] Time-Correlated Video Bridge Matching

Abstract page for arXiv paper 2510.12453: Time-Correlated Video Bridge Matching

arXiv - Machine Learning · 3 min ·
Previous Page 73 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime