Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

[R] Are there ML approaches for prioritizing and routing “important” signals across complex systems?

I’ve been reading more about attention mechanisms in transformers and how they effectively learn to weight and prioritize relevant inputs...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

[P] I trained a language model from scratch for a low resource language and got it running fully on-device on Android (no GPU, demo)

Hi Everybody! I just wanted to share an update on a project I’ve been working on called BULaMU, a family of language models trained (20M,...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

[R] Structure Over Scale: Memory-First Reasoning and Depth-Pruned Efficiency in Magnus and Seed Architecture Auto-Discovery

Dataset Model Acc F1 Δ vs Log Δ vs Static Avg Params Peak Params Steps Infer ms Size Banking77-20 Logistic TF-IDF 92.37% 0.9230 +0.00pp +...

Reddit - Machine Learning · 1 min · about 1 hour ago

All Content

Machine Learning

[2508.02330] A Compression Based Classification Framework Using Symbolic Dynamics of Chaotic Maps

Abstract page for arXiv paper 2508.02330: A Compression Based Classification Framework Using Symbolic Dynamics of Chaotic Maps

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2507.21037] When Brain Foundation Model Meets Cauchy-Schwarz Divergence: A New Framework for Cross-Subject Motor Imagery Decoding

Abstract page for arXiv paper 2507.21037: When Brain Foundation Model Meets Cauchy-Schwarz Divergence: A New Framework for Cross-Subject ...

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2507.07580] COALA: Numerically Stable and Efficient Framework for Context-Aware Low-Rank Approximation

Abstract page for arXiv paper 2507.07580: COALA: Numerically Stable and Efficient Framework for Context-Aware Low-Rank Approximation

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2506.06482] TimeRecipe: A Time-Series Forecasting Recipe via Benchmarking Module Level Effectiveness

Abstract page for arXiv paper 2506.06482: TimeRecipe: A Time-Series Forecasting Recipe via Benchmarking Module Level Effectiveness

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2506.06303] Reward Is Enough: LLMs Are In-Context Reinforcement Learners

Abstract page for arXiv paper 2506.06303: Reward Is Enough: LLMs Are In-Context Reinforcement Learners

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2506.04831] EHR2Path: Scalable Modeling of Longitudinal Patient Pathways from Multimodal Electronic Health Records

Abstract page for arXiv paper 2506.04831: EHR2Path: Scalable Modeling of Longitudinal Patient Pathways from Multimodal Electronic Health ...

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2505.22785] Navigating the Latent Space Dynamics of Neural Models

Abstract page for arXiv paper 2505.22785: Navigating the Latent Space Dynamics of Neural Models

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2505.16950] Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning

Abstract page for arXiv paper 2505.16950: Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2505.15516] Explainable embeddings with Distance Explainer

Abstract page for arXiv paper 2505.15516: Explainable embeddings with Distance Explainer

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2502.01521] Symmetry-Guided Memory Augmentation for Efficient Locomotion Learning

Abstract page for arXiv paper 2502.01521: Symmetry-Guided Memory Augmentation for Efficient Locomotion Learning

arXiv - Machine Learning · 3 min · 5 days ago

Machine Learning

[2409.11847] An efficient wavelet-based physics-informed neural network for multiscale problems

Abstract page for arXiv paper 2409.11847: An efficient wavelet-based physics-informed neural network for multiscale problems

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2406.01969] Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training

Abstract page for arXiv paper 2406.01969: Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2210.11039] Entire Space Counterfactual Learning for Reliable Content Recommendations

Abstract page for arXiv paper 2210.11039: Entire Space Counterfactual Learning for Reliable Content Recommendations

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2603.24567] Trust Region Constrained Bayesian Optimization with Penalized Constraint Handling

Abstract page for arXiv paper 2603.24567: Trust Region Constrained Bayesian Optimization with Penalized Constraint Handling

arXiv - Machine Learning · 3 min · 5 days ago

Machine Learning

[2603.24481] Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA

Abstract page for arXiv paper 2603.24481: Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical...

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2603.24436] Enes Causal Discovery

Abstract page for arXiv paper 2603.24436: Enes Causal Discovery

arXiv - Machine Learning · 3 min · 5 days ago

Llms

[2603.24472] Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Abstract page for arXiv paper 2603.24472: Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

arXiv - Machine Learning · 3 min · 5 days ago

Machine Learning

[2603.24477] Composer 2 Technical Report

Abstract page for arXiv paper 2603.24477: Composer 2 Technical Report

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2603.24400] Neural Network Models for Contextual Regression

Abstract page for arXiv paper 2603.24400: Neural Network Models for Contextual Regression

arXiv - Machine Learning · 3 min · 5 days ago

Machine Learning

[2603.24396] Exploring How Fair Model Representations Relate to Fair Recommendations

Abstract page for arXiv paper 2603.24396: Exploring How Fair Model Representations Relate to Fair Recommendations

arXiv - Machine Learning · 3 min · 5 days ago

Previous Page 33 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

[R] Are there ML approaches for prioritizing and routing “important” signals across complex systems?

[P] I trained a language model from scratch for a low resource language and got it running fully on-device on Android (no GPU, demo)

[R] Structure Over Scale: Memory-First Reasoning and Depth-Pruned Efficiency in Magnus and Seed Architecture Auto-Discovery

All Content

[2508.02330] A Compression Based Classification Framework Using Symbolic Dynamics of Chaotic Maps

[2507.21037] When Brain Foundation Model Meets Cauchy-Schwarz Divergence: A New Framework for Cross-Subject Motor Imagery Decoding

[2507.07580] COALA: Numerically Stable and Efficient Framework for Context-Aware Low-Rank Approximation

[2506.06482] TimeRecipe: A Time-Series Forecasting Recipe via Benchmarking Module Level Effectiveness

[2506.06303] Reward Is Enough: LLMs Are In-Context Reinforcement Learners

[2506.04831] EHR2Path: Scalable Modeling of Longitudinal Patient Pathways from Multimodal Electronic Health Records

[2505.22785] Navigating the Latent Space Dynamics of Neural Models

[2505.16950] Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning

[2505.15516] Explainable embeddings with Distance Explainer

[2502.01521] Symmetry-Guided Memory Augmentation for Efficient Locomotion Learning

[2409.11847] An efficient wavelet-based physics-informed neural network for multiscale problems

[2406.01969] Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training

[2210.11039] Entire Space Counterfactual Learning for Reliable Content Recommendations

[2603.24567] Trust Region Constrained Bayesian Optimization with Penalized Constraint Handling

[2603.24481] Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA

[2603.24436] Enes Causal Discovery

[2603.24472] Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

[2603.24477] Composer 2 Technical Report

[2603.24400] Neural Network Models for Contextual Regression

[2603.24396] Exploring How Fair Model Representations Relate to Fair Recommendations

Related Topics

Stay updated with AI News