Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Looking to build a production-level AI/ML project (agentic systems), need guidance on what to build

Hi everyone, I’m a final-year undergraduate AI/ML student currently focusing on applied AI / agentic systems. So far, I’ve spent time und...

Reddit - ML Jobs · 1 min · 35 minutes ago

Machine Learning

Meta is reentering the AI race with a new model called Muse Spark | The Verge

Meta Superintelligence Labs has unveiled a new AI model called Muse Spark that will soon roll out across apps like Instagram and Facebook.

The Verge - AI · 5 min · 35 minutes ago

Llms

[P] Building a LLM from scratch with Mary Shelley's "Frankenstein" (on Kaggle)

Notebook on GitHub: https://github.com/Buzzpy/Python-Machine-Learning-Models/blob/main/Frankenstein/train-frankenstein.ipynb submitted by...

Reddit - Machine Learning · 1 min · about 2 hours ago

All Content

Machine Learning

[2603.25009] A Systematic Empirical Study of Grokking: Depth, Architecture, Activation, and Regularization

Abstract page for arXiv paper 2603.25009: A Systematic Empirical Study of Grokking: Depth, Architecture, Activation, and Regularization

arXiv - Machine Learning · 4 min · 12 days ago

Llms

[2603.24595] Model2Kernel: Model-Aware Symbolic Execution For Safe CUDA Kernels

Abstract page for arXiv paper 2603.24595: Model2Kernel: Model-Aware Symbolic Execution For Safe CUDA Kernels

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2402.05122] History of generative Artificial Intelligence (AI) chatbots: past, present, and future development

Abstract page for arXiv paper 2402.05122: History of generative Artificial Intelligence (AI) chatbots: past, present, and future development

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2603.25737] Training the Knowledge Base through Evidence Distillation and Write-Back Enrichment

Abstract page for arXiv paper 2603.25737: Training the Knowledge Base through Evidence Distillation and Write-Back Enrichment

arXiv - AI · 3 min · 12 days ago

Machine Learning

[2603.24916] Once-for-All Channel Mixers (HYPERTINYPW): Generative Compression for TinyML

Abstract page for arXiv paper 2603.24916: Once-for-All Channel Mixers (HYPERTINYPW): Generative Compression for TinyML

arXiv - Machine Learning · 4 min · 12 days ago

Llms

[2603.24883] Learning to Staff: Offline Reinforcement Learning and Fine-Tuned LLMs for Warehouse Staffing Optimization

Abstract page for arXiv paper 2603.24883: Learning to Staff: Offline Reinforcement Learning and Fine-Tuned LLMs for Warehouse Staffing Op...

arXiv - Machine Learning · 4 min · 12 days ago

Machine Learning

[2603.25720] R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning

Abstract page for arXiv paper 2603.25720: R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning

arXiv - AI · 3 min · 12 days ago

Llms

[2603.24844] Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

Abstract page for arXiv paper 2603.24844: Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2603.25719] Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization?

Abstract page for arXiv paper 2603.25719: Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardw...

arXiv - Machine Learning · 4 min · 12 days ago

Machine Learning

[2603.25551] Voxtral TTS

Abstract page for arXiv paper 2603.25551: Voxtral TTS

arXiv - AI · 5 min · 12 days ago

Llms

[2603.25633] Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?

Abstract page for arXiv paper 2603.25633: Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment P...

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2603.24828] A Practical Guide Towards Interpreting Time-Series Deep Clinical Predictive Models: A Reproducibility Study

Abstract page for arXiv paper 2603.24828: A Practical Guide Towards Interpreting Time-Series Deep Clinical Predictive Models: A Reproduci...

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2603.25415] Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation

Abstract page for arXiv paper 2603.25415: Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2603.24790] Local learning for stable backpropagation-free neural network training towards physical learning

Abstract page for arXiv paper 2603.24790: Local learning for stable backpropagation-free neural network training towards physical learning

arXiv - Machine Learning · 3 min · 12 days ago

Llms

[2603.25498] EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents

Abstract page for arXiv paper 2603.25498: EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents

arXiv - AI · 3 min · 12 days ago

Llms

[2603.24780] Transformers in the Dark: Navigating Unknown Search Spaces via Bandit Feedback

Abstract page for arXiv paper 2603.24780: Transformers in the Dark: Navigating Unknown Search Spaces via Bandit Feedback

arXiv - Machine Learning · 4 min · 12 days ago

Machine Learning

[2603.25480] Retraining as Approximate Bayesian Inference

Abstract page for arXiv paper 2603.25480: Retraining as Approximate Bayesian Inference

arXiv - AI · 3 min · 12 days ago

Machine Learning

[2603.24753] Light Cones For Vision: Simple Causal Priors For Visual Hierarchy

Abstract page for arXiv paper 2603.24753: Light Cones For Vision: Simple Causal Priors For Visual Hierarchy

arXiv - Machine Learning · 3 min · 12 days ago

Llms

[2603.25450] Cross-Model Disagreement as a Label-Free Correctness Signal

Abstract page for arXiv paper 2603.25450: Cross-Model Disagreement as a Label-Free Correctness Signal

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2603.24744] Contrastive Learning Boosts Deterministic and Generative Models for Weather Data

Abstract page for arXiv paper 2603.24744: Contrastive Learning Boosts Deterministic and Generative Models for Weather Data

arXiv - Machine Learning · 4 min · 12 days ago

Previous Page 149 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Looking to build a production-level AI/ML project (agentic systems), need guidance on what to build

Meta is reentering the AI race with a new model called Muse Spark | The Verge

[P] Building a LLM from scratch with Mary Shelley's "Frankenstein" (on Kaggle)

All Content

[2603.25009] A Systematic Empirical Study of Grokking: Depth, Architecture, Activation, and Regularization

[2603.24595] Model2Kernel: Model-Aware Symbolic Execution For Safe CUDA Kernels

[2402.05122] History of generative Artificial Intelligence (AI) chatbots: past, present, and future development

[2603.25737] Training the Knowledge Base through Evidence Distillation and Write-Back Enrichment

[2603.24916] Once-for-All Channel Mixers (HYPERTINYPW): Generative Compression for TinyML

[2603.24883] Learning to Staff: Offline Reinforcement Learning and Fine-Tuned LLMs for Warehouse Staffing Optimization

[2603.25720] R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning

[2603.24844] Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

[2603.25719] Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization?

[2603.25551] Voxtral TTS

[2603.25633] Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?

[2603.24828] A Practical Guide Towards Interpreting Time-Series Deep Clinical Predictive Models: A Reproducibility Study

[2603.25415] Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation

[2603.24790] Local learning for stable backpropagation-free neural network training towards physical learning

[2603.25498] EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents

[2603.24780] Transformers in the Dark: Navigating Unknown Search Spaces via Bandit Feedback

[2603.25480] Retraining as Approximate Bayesian Inference

[2603.24753] Light Cones For Vision: Simple Causal Priors For Visual Hierarchy

[2603.25450] Cross-Model Disagreement as a Label-Free Correctness Signal

[2603.24744] Contrastive Learning Boosts Deterministic and Generative Models for Weather Data

Related Topics

Stay updated with AI News