Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Trained a Qwen2.5-0.5B-Instruct bf16 model on Reddit post summarization task with GRPO [P]

So, a few days back I shared a post where I trained a tiny Qwen2.5-0.5B-Instruct model on smoltldr (reddit post summarization dataset of ...

Reddit - Machine Learning · 1 min · 23 minutes ago

Machine Learning

Mark Zuckerberg is reportedly building an AI clone to replace him in meetings | The Verge

Meta is working to build an AI version of its CEO Mark Zuckerberg, which he will use to interact with employees, according to a report fr...

The Verge - AI · 4 min · 23 minutes ago

Machine Learning

When the Mirror Turns: How AI alignment reshapes the voice inside your head

We build our inner voices from the voices we're in dialogue with. Vygotsky established this nearly a century ago. For people in sustained...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

All Content

Machine Learning

[2603.25328] Macroscopic Characteristics of Mixed Traffic Flow with Deep Reinforcement Learning Based Automated and Human-Driven Vehicles

Abstract page for arXiv paper 2603.25328: Macroscopic Characteristics of Mixed Traffic Flow with Deep Reinforcement Learning Based Automa...

arXiv - AI · 4 min · 17 days ago

Llms

[2603.24647] Can LLMs Beat Classical Hyperparameter Optimization Algorithms? A Study on autoresearch

Abstract page for arXiv paper 2603.24647: Can LLMs Beat Classical Hyperparameter Optimization Algorithms? A Study on autoresearch

arXiv - Machine Learning · 4 min · 17 days ago

Llms

[2603.25326] Evaluating Language Models for Harmful Manipulation

Abstract page for arXiv paper 2603.25326: Evaluating Language Models for Harmful Manipulation

arXiv - AI · 4 min · 17 days ago

Machine Learning

[2603.24644] Physics-Informed Neural Network Digital Twin for Dynamic Tray-Wise Modeling of Distillation Columns under Transient Operating Conditions

Abstract page for arXiv paper 2603.24644: Physics-Informed Neural Network Digital Twin for Dynamic Tray-Wise Modeling of Distillation Col...

arXiv - Machine Learning · 4 min · 17 days ago

Llms

[2603.24639] Experiential Reflective Learning for Self-Improving LLM Agents

Abstract page for arXiv paper 2603.24639: Experiential Reflective Learning for Self-Improving LLM Agents

arXiv - AI · 3 min · 17 days ago

Llms

[2603.25284] SliderQuant: Accurate Post-Training Quantization for LLMs

Abstract page for arXiv paper 2603.25284: SliderQuant: Accurate Post-Training Quantization for LLMs

arXiv - AI · 4 min · 17 days ago

Llms

[2603.25283] A Gait Foundation Model Predicts Multi-System Health Phenotypes from 3D Skeletal Motion

Abstract page for arXiv paper 2603.25283: A Gait Foundation Model Predicts Multi-System Health Phenotypes from 3D Skeletal Motion

arXiv - AI · 3 min · 17 days ago

Machine Learning

[2603.24638] How unconstrained machine-learning models learn physical symmetries

Abstract page for arXiv paper 2603.24638: How unconstrained machine-learning models learn physical symmetries

arXiv - Machine Learning · 4 min · 17 days ago

Machine Learning

[2603.25273] Distribution and Clusters Approximations as Abstract Domains in Probabilistic Abstract Interpretation to Neural Network Analysis

Abstract page for arXiv paper 2603.25273: Distribution and Clusters Approximations as Abstract Domains in Probabilistic Abstract Interpre...

arXiv - AI · 3 min · 17 days ago

Machine Learning

[2603.25266] Probabilistic Abstract Interpretation on Neural Networks via Grids Approximation

Abstract page for arXiv paper 2603.25266: Probabilistic Abstract Interpretation on Neural Networks via Grids Approximation

arXiv - AI · 3 min · 17 days ago

Llms

[2603.25158] Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Abstract page for arXiv paper 2603.25158: Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

arXiv - AI · 4 min · 17 days ago

Llms

[2603.25133] RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following

Abstract page for arXiv paper 2603.25133: RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following

arXiv - AI · 3 min · 17 days ago

Llms

[2603.25097] ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents

Abstract page for arXiv paper 2603.25097: ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents

arXiv - AI · 4 min · 17 days ago

Llms

[2603.25075] Sparse Visual Thought Circuits in Vision-Language Models

Abstract page for arXiv paper 2603.25075: Sparse Visual Thought Circuits in Vision-Language Models

arXiv - AI · 3 min · 17 days ago

Machine Learning

[2603.25046] MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting

Abstract page for arXiv paper 2603.25046: MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting

arXiv - Machine Learning · 4 min · 17 days ago

Llms

[2603.25035] Mechanistically Interpreting Compression in Vision-Language Models

Abstract page for arXiv paper 2603.25035: Mechanistically Interpreting Compression in Vision-Language Models

arXiv - AI · 3 min · 17 days ago

Llms

[2603.25031] From Stateless to Situated: Building a Psychological World for LLM-Based Emotional Support

Abstract page for arXiv paper 2603.25031: From Stateless to Situated: Building a Psychological World for LLM-Based Emotional Support

arXiv - AI · 4 min · 17 days ago

Machine Learning

[2603.25022] A Public Theory of Distillation Resistance via Constraint-Coupled Reasoning Architectures

Abstract page for arXiv paper 2603.25022: A Public Theory of Distillation Resistance via Constraint-Coupled Reasoning Architectures

arXiv - Machine Learning · 3 min · 17 days ago

Llms

[2603.24967] The Anatomy of Uncertainty in LLMs

Abstract page for arXiv paper 2603.24967: The Anatomy of Uncertainty in LLMs

arXiv - AI · 3 min · 17 days ago

Machine Learning

[2603.24963] Design Once, Deploy at Scale: Template-Driven ML Development for Large Model Ecosystems

Abstract page for arXiv paper 2603.24963: Design Once, Deploy at Scale: Template-Driven ML Development for Large Model Ecosystems

arXiv - Machine Learning · 4 min · 17 days ago

Previous Page 187 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Trained a Qwen2.5-0.5B-Instruct bf16 model on Reddit post summarization task with GRPO [P]

Mark Zuckerberg is reportedly building an AI clone to replace him in meetings | The Verge

When the Mirror Turns: How AI alignment reshapes the voice inside your head

All Content

[2603.25328] Macroscopic Characteristics of Mixed Traffic Flow with Deep Reinforcement Learning Based Automated and Human-Driven Vehicles

[2603.24647] Can LLMs Beat Classical Hyperparameter Optimization Algorithms? A Study on autoresearch

[2603.25326] Evaluating Language Models for Harmful Manipulation

[2603.24644] Physics-Informed Neural Network Digital Twin for Dynamic Tray-Wise Modeling of Distillation Columns under Transient Operating Conditions

[2603.24639] Experiential Reflective Learning for Self-Improving LLM Agents

[2603.25284] SliderQuant: Accurate Post-Training Quantization for LLMs

[2603.25283] A Gait Foundation Model Predicts Multi-System Health Phenotypes from 3D Skeletal Motion

[2603.24638] How unconstrained machine-learning models learn physical symmetries

[2603.25273] Distribution and Clusters Approximations as Abstract Domains in Probabilistic Abstract Interpretation to Neural Network Analysis

[2603.25266] Probabilistic Abstract Interpretation on Neural Networks via Grids Approximation

[2603.25158] Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

[2603.25133] RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following

[2603.25097] ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents

[2603.25075] Sparse Visual Thought Circuits in Vision-Language Models

[2603.25046] MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting

[2603.25035] Mechanistically Interpreting Compression in Vision-Language Models

[2603.25031] From Stateless to Situated: Building a Psychological World for LLM-Based Emotional Support

[2603.25022] A Public Theory of Distillation Resistance via Constraint-Coupled Reasoning Architectures

[2603.24967] The Anatomy of Uncertainty in LLMs

[2603.24963] Design Once, Deploy at Scale: Template-Driven ML Development for Large Model Ecosystems

Related Topics

Stay updated with AI News