Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Educational PyTorch repo for distributed training from scratch: DP, FSDP, TP, FSDP+TP, and PP

I put together a small educational repo that implements distributed training parallelism from scratch in PyTorch: https://github.com/shre...

Reddit - Artificial Intelligence · 1 min · 26 minutes ago

Llms

Claude cannot be trusted to perform complex engineering tasks

AMD’s AI director just analyzed 6,852 Claude Code sessions, 234,760 tool calls, and 17,871 thinking blocks. Her conclusion: “Claude canno...

Reddit - Artificial Intelligence · 1 min · 26 minutes ago

Machine Learning

Training an AI to play Resident Evil Requiem using Behavior Cloning + HG-DAgge [P]

Code of Project: https://github.com/paulo101977/notebooks-rl/tree/main/re_requiem I’ve been working on training an agent to play a segmen...

Reddit - Machine Learning · 1 min · about 2 hours ago

All Content

Machine Learning

[2508.02330] A Compression Based Classification Framework Using Symbolic Dynamics of Chaotic Maps

Abstract page for arXiv paper 2508.02330: A Compression Based Classification Framework Using Symbolic Dynamics of Chaotic Maps

arXiv - Machine Learning · 4 min · 17 days ago

Llms

[2507.21037] When Brain Foundation Model Meets Cauchy-Schwarz Divergence: A New Framework for Cross-Subject Motor Imagery Decoding

Abstract page for arXiv paper 2507.21037: When Brain Foundation Model Meets Cauchy-Schwarz Divergence: A New Framework for Cross-Subject ...

arXiv - Machine Learning · 4 min · 17 days ago

Machine Learning

[2507.07580] COALA: Numerically Stable and Efficient Framework for Context-Aware Low-Rank Approximation

Abstract page for arXiv paper 2507.07580: COALA: Numerically Stable and Efficient Framework for Context-Aware Low-Rank Approximation

arXiv - Machine Learning · 4 min · 17 days ago

Machine Learning

[2506.06482] TimeRecipe: A Time-Series Forecasting Recipe via Benchmarking Module Level Effectiveness

Abstract page for arXiv paper 2506.06482: TimeRecipe: A Time-Series Forecasting Recipe via Benchmarking Module Level Effectiveness

arXiv - Machine Learning · 4 min · 17 days ago

Llms

[2506.06303] Reward Is Enough: LLMs Are In-Context Reinforcement Learners

Abstract page for arXiv paper 2506.06303: Reward Is Enough: LLMs Are In-Context Reinforcement Learners

arXiv - Machine Learning · 4 min · 17 days ago

Machine Learning

[2506.04831] EHR2Path: Scalable Modeling of Longitudinal Patient Pathways from Multimodal Electronic Health Records

Abstract page for arXiv paper 2506.04831: EHR2Path: Scalable Modeling of Longitudinal Patient Pathways from Multimodal Electronic Health ...

arXiv - Machine Learning · 4 min · 17 days ago

Machine Learning

[2505.22785] Navigating the Latent Space Dynamics of Neural Models

Abstract page for arXiv paper 2505.22785: Navigating the Latent Space Dynamics of Neural Models

arXiv - Machine Learning · 4 min · 17 days ago

Llms

[2505.16950] Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning

Abstract page for arXiv paper 2505.16950: Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning

arXiv - Machine Learning · 4 min · 17 days ago

Machine Learning

[2505.15516] Explainable embeddings with Distance Explainer

Abstract page for arXiv paper 2505.15516: Explainable embeddings with Distance Explainer

arXiv - Machine Learning · 4 min · 17 days ago

Machine Learning

[2502.01521] Symmetry-Guided Memory Augmentation for Efficient Locomotion Learning

Abstract page for arXiv paper 2502.01521: Symmetry-Guided Memory Augmentation for Efficient Locomotion Learning

arXiv - Machine Learning · 3 min · 17 days ago

Machine Learning

[2409.11847] An efficient wavelet-based physics-informed neural network for multiscale problems

Abstract page for arXiv paper 2409.11847: An efficient wavelet-based physics-informed neural network for multiscale problems

arXiv - Machine Learning · 4 min · 17 days ago

Machine Learning

[2406.01969] Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training

Abstract page for arXiv paper 2406.01969: Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training

arXiv - Machine Learning · 4 min · 17 days ago

Machine Learning

[2210.11039] Entire Space Counterfactual Learning for Reliable Content Recommendations

Abstract page for arXiv paper 2210.11039: Entire Space Counterfactual Learning for Reliable Content Recommendations

arXiv - Machine Learning · 4 min · 17 days ago

Machine Learning

[2603.24567] Trust Region Constrained Bayesian Optimization with Penalized Constraint Handling

Abstract page for arXiv paper 2603.24567: Trust Region Constrained Bayesian Optimization with Penalized Constraint Handling

arXiv - Machine Learning · 3 min · 17 days ago

Machine Learning

[2603.24481] Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA

Abstract page for arXiv paper 2603.24481: Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical...

arXiv - Machine Learning · 4 min · 17 days ago

Machine Learning

[2603.24436] Enes Causal Discovery

Abstract page for arXiv paper 2603.24436: Enes Causal Discovery

arXiv - AI · 3 min · 17 days ago

Llms

[2603.24472] Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Abstract page for arXiv paper 2603.24472: Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

arXiv - Machine Learning · 3 min · 17 days ago

Machine Learning

[2603.24477] Composer 2 Technical Report

Abstract page for arXiv paper 2603.24477: Composer 2 Technical Report

arXiv - Machine Learning · 4 min · 17 days ago

Machine Learning

[2603.24400] Neural Network Models for Contextual Regression

Abstract page for arXiv paper 2603.24400: Neural Network Models for Contextual Regression

arXiv - Machine Learning · 3 min · 17 days ago

Machine Learning

[2603.24396] Exploring How Fair Model Representations Relate to Fair Recommendations

Abstract page for arXiv paper 2603.24396: Exploring How Fair Model Representations Relate to Fair Recommendations

arXiv - Machine Learning · 3 min · 17 days ago

Previous Page 176 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Educational PyTorch repo for distributed training from scratch: DP, FSDP, TP, FSDP+TP, and PP

Claude cannot be trusted to perform complex engineering tasks

Training an AI to play Resident Evil Requiem using Behavior Cloning + HG-DAgge [P]

All Content

[2508.02330] A Compression Based Classification Framework Using Symbolic Dynamics of Chaotic Maps

[2507.21037] When Brain Foundation Model Meets Cauchy-Schwarz Divergence: A New Framework for Cross-Subject Motor Imagery Decoding

[2507.07580] COALA: Numerically Stable and Efficient Framework for Context-Aware Low-Rank Approximation

[2506.06482] TimeRecipe: A Time-Series Forecasting Recipe via Benchmarking Module Level Effectiveness

[2506.06303] Reward Is Enough: LLMs Are In-Context Reinforcement Learners

[2506.04831] EHR2Path: Scalable Modeling of Longitudinal Patient Pathways from Multimodal Electronic Health Records

[2505.22785] Navigating the Latent Space Dynamics of Neural Models

[2505.16950] Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning

[2505.15516] Explainable embeddings with Distance Explainer

[2502.01521] Symmetry-Guided Memory Augmentation for Efficient Locomotion Learning

[2409.11847] An efficient wavelet-based physics-informed neural network for multiscale problems

[2406.01969] Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training

[2210.11039] Entire Space Counterfactual Learning for Reliable Content Recommendations

[2603.24567] Trust Region Constrained Bayesian Optimization with Penalized Constraint Handling

[2603.24481] Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA

[2603.24436] Enes Causal Discovery

[2603.24472] Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

[2603.24477] Composer 2 Technical Report

[2603.24400] Neural Network Models for Contextual Regression

[2603.24396] Exploring How Fair Model Representations Relate to Fair Recommendations

Related Topics

Stay updated with AI News