Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[P] I built an autonomous ML agent that runs experiments on tabular data indefinitely - inspired by Karpathy's AutoResearch

Inspired by Andrej Karpathy's AutoResearch, I built a system where Claude Code acts as an autonomous ML researcher on tabular binary clas...

Reddit - Machine Learning · 1 min · about 2 hours ago

Machine Learning

[D] Data curation and targeted replacement as a pre-training alignment and controllability method

Hi, r/MachineLearning: has much research been done in large-scale training scenarios where undesirable data has been replaced before trai...

Reddit - Machine Learning · 1 min · about 2 hours ago

Llms

[R] BraiNN: An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning

BraiNN An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning BraiNN is a compact research‑...

Reddit - Machine Learning · 1 min · about 3 hours ago

All Content

Llms

[2603.25040] Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Abstract page for arXiv paper 2603.25040: Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

arXiv - Machine Learning · 5 min · 3 days ago

Llms

[2603.24601] FED-HARGPT: A Hybrid Centralized-Federated Approach of a Transformer-based Architecture for Human Context Recognition

Abstract page for arXiv paper 2603.24601: FED-HARGPT: A Hybrid Centralized-Federated Approach of a Transformer-based Architecture for Hum...

arXiv - Machine Learning · 3 min · 3 days ago

Machine Learning

[2603.24602] MuViS: Multimodal Virtual Sensing Benchmark

Abstract page for arXiv paper 2603.24602: MuViS: Multimodal Virtual Sensing Benchmark

arXiv - AI · 3 min · 3 days ago

Llms

[2603.25033] Epistemic Compression: The Case for Deliberate Ignorance in High-Stakes AI

Abstract page for arXiv paper 2603.25033: Epistemic Compression: The Case for Deliberate Ignorance in High-Stakes AI

arXiv - Machine Learning · 3 min · 3 days ago

Machine Learning

[2603.24599] A Learnable SIM Paradigm: Fundamentals, Training Techniques, and Applications

Abstract page for arXiv paper 2603.24599: A Learnable SIM Paradigm: Fundamentals, Training Techniques, and Applications

arXiv - AI · 3 min · 3 days ago

Llms

[2603.24596] X-OPD: Cross-Modal On-Policy Distillation for Capability Alignment in Speech LLMs

Abstract page for arXiv paper 2603.24596: X-OPD: Cross-Modal On-Policy Distillation for Capability Alignment in Speech LLMs

arXiv - AI · 3 min · 3 days ago

Machine Learning

[2603.25009] A Systematic Empirical Study of Grokking: Depth, Architecture, Activation, and Regularization

Abstract page for arXiv paper 2603.25009: A Systematic Empirical Study of Grokking: Depth, Architecture, Activation, and Regularization

arXiv - Machine Learning · 4 min · 3 days ago

Llms

[2603.24595] Model2Kernel: Model-Aware Symbolic Execution For Safe CUDA Kernels

Abstract page for arXiv paper 2603.24595: Model2Kernel: Model-Aware Symbolic Execution For Safe CUDA Kernels

arXiv - AI · 4 min · 3 days ago

Machine Learning

[2402.05122] History of generative Artificial Intelligence (AI) chatbots: past, present, and future development

Abstract page for arXiv paper 2402.05122: History of generative Artificial Intelligence (AI) chatbots: past, present, and future development

arXiv - AI · 4 min · 3 days ago

Machine Learning

[2603.25737] Training the Knowledge Base through Evidence Distillation and Write-Back Enrichment

Abstract page for arXiv paper 2603.25737: Training the Knowledge Base through Evidence Distillation and Write-Back Enrichment

arXiv - AI · 3 min · 3 days ago

Machine Learning

[2603.24916] Once-for-All Channel Mixers (HYPERTINYPW): Generative Compression for TinyML

Abstract page for arXiv paper 2603.24916: Once-for-All Channel Mixers (HYPERTINYPW): Generative Compression for TinyML

arXiv - Machine Learning · 4 min · 3 days ago

Llms

[2603.24883] Learning to Staff: Offline Reinforcement Learning and Fine-Tuned LLMs for Warehouse Staffing Optimization

Abstract page for arXiv paper 2603.24883: Learning to Staff: Offline Reinforcement Learning and Fine-Tuned LLMs for Warehouse Staffing Op...

arXiv - Machine Learning · 4 min · 3 days ago

Machine Learning

[2603.25720] R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning

Abstract page for arXiv paper 2603.25720: R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning

arXiv - AI · 3 min · 3 days ago

Llms

[2603.24844] Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

Abstract page for arXiv paper 2603.24844: Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

arXiv - AI · 4 min · 3 days ago

Machine Learning

[2603.25719] Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization?

Abstract page for arXiv paper 2603.25719: Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardw...

arXiv - Machine Learning · 4 min · 3 days ago

Machine Learning

[2603.25551] Voxtral TTS

Abstract page for arXiv paper 2603.25551: Voxtral TTS

arXiv - AI · 5 min · 3 days ago

Llms

[2603.25633] Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?

Abstract page for arXiv paper 2603.25633: Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment P...

arXiv - AI · 4 min · 3 days ago

Machine Learning

[2603.24828] A Practical Guide Towards Interpreting Time-Series Deep Clinical Predictive Models: A Reproducibility Study

Abstract page for arXiv paper 2603.24828: A Practical Guide Towards Interpreting Time-Series Deep Clinical Predictive Models: A Reproduci...

arXiv - AI · 4 min · 3 days ago

Machine Learning

[2603.25415] Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation

Abstract page for arXiv paper 2603.25415: Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation

arXiv - AI · 4 min · 3 days ago

Machine Learning

[2603.24790] Local learning for stable backpropagation-free neural network training towards physical learning

Abstract page for arXiv paper 2603.24790: Local learning for stable backpropagation-free neural network training towards physical learning

arXiv - Machine Learning · 3 min · 3 days ago

Previous Page 14 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

[P] I built an autonomous ML agent that runs experiments on tabular data indefinitely - inspired by Karpathy's AutoResearch

[D] Data curation and targeted replacement as a pre-training alignment and controllability method

[R] BraiNN: An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning

All Content

[2603.25040] Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

[2603.24601] FED-HARGPT: A Hybrid Centralized-Federated Approach of a Transformer-based Architecture for Human Context Recognition

[2603.24602] MuViS: Multimodal Virtual Sensing Benchmark

[2603.25033] Epistemic Compression: The Case for Deliberate Ignorance in High-Stakes AI

[2603.24599] A Learnable SIM Paradigm: Fundamentals, Training Techniques, and Applications

[2603.24596] X-OPD: Cross-Modal On-Policy Distillation for Capability Alignment in Speech LLMs

[2603.25009] A Systematic Empirical Study of Grokking: Depth, Architecture, Activation, and Regularization

[2603.24595] Model2Kernel: Model-Aware Symbolic Execution For Safe CUDA Kernels

[2402.05122] History of generative Artificial Intelligence (AI) chatbots: past, present, and future development

[2603.25737] Training the Knowledge Base through Evidence Distillation and Write-Back Enrichment

[2603.24916] Once-for-All Channel Mixers (HYPERTINYPW): Generative Compression for TinyML

[2603.24883] Learning to Staff: Offline Reinforcement Learning and Fine-Tuned LLMs for Warehouse Staffing Optimization

[2603.25720] R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning

[2603.24844] Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

[2603.25719] Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization?

[2603.25551] Voxtral TTS

[2603.25633] Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?

[2603.24828] A Practical Guide Towards Interpreting Time-Series Deep Clinical Predictive Models: A Reproducibility Study

[2603.25415] Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation

[2603.24790] Local learning for stable backpropagation-free neural network training towards physical learning

Related Topics

Stay updated with AI News