Machine Learning

ML algorithms, training, and inference

Top This Week

[2604.01676] GPA: Learning GUI Process Automation from Demonstrations
Llms

[2604.01676] GPA: Learning GUI Process Automation from Demonstrations

Abstract page for arXiv paper 2604.01676: GPA: Learning GUI Process Automation from Demonstrations

arXiv - AI · 3 min ·
[2604.01413] Adaptive Stopping for Multi-Turn LLM Reasoning
Llms

[2604.01413] Adaptive Stopping for Multi-Turn LLM Reasoning

Abstract page for arXiv paper 2604.01413: Adaptive Stopping for Multi-Turn LLM Reasoning

arXiv - AI · 4 min ·
[2603.13842] Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving
Machine Learning

[2603.13842] Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving

Abstract page for arXiv paper 2603.13842: Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement L...

arXiv - AI · 4 min ·

All Content

[2603.24648] Energy-Efficient Hierarchical Federated Anomaly Detection for the Internet of Underwater Things via Selective Cooperative Aggregation
Machine Learning

[2603.24648] Energy-Efficient Hierarchical Federated Anomaly Detection for the Internet of Underwater Things via Selective Cooperative Aggregation

Abstract page for arXiv paper 2603.24648: Energy-Efficient Hierarchical Federated Anomaly Detection for the Internet of Underwater Things...

arXiv - Machine Learning · 4 min ·
[2603.24641] Learning Mesh-Free Discrete Differential Operators with Self-Supervised Graph Neural Networks
Machine Learning

[2603.24641] Learning Mesh-Free Discrete Differential Operators with Self-Supervised Graph Neural Networks

Abstract page for arXiv paper 2603.24641: Learning Mesh-Free Discrete Differential Operators with Self-Supervised Graph Neural Networks

arXiv - Machine Learning · 3 min ·
[2603.25328] Macroscopic Characteristics of Mixed Traffic Flow with Deep Reinforcement Learning Based Automated and Human-Driven Vehicles
Machine Learning

[2603.25328] Macroscopic Characteristics of Mixed Traffic Flow with Deep Reinforcement Learning Based Automated and Human-Driven Vehicles

Abstract page for arXiv paper 2603.25328: Macroscopic Characteristics of Mixed Traffic Flow with Deep Reinforcement Learning Based Automa...

arXiv - AI · 4 min ·
[2603.24647] Can LLMs Beat Classical Hyperparameter Optimization Algorithms? A Study on autoresearch
Llms

[2603.24647] Can LLMs Beat Classical Hyperparameter Optimization Algorithms? A Study on autoresearch

Abstract page for arXiv paper 2603.24647: Can LLMs Beat Classical Hyperparameter Optimization Algorithms? A Study on autoresearch

arXiv - Machine Learning · 4 min ·
[2603.25326] Evaluating Language Models for Harmful Manipulation
Llms

[2603.25326] Evaluating Language Models for Harmful Manipulation

Abstract page for arXiv paper 2603.25326: Evaluating Language Models for Harmful Manipulation

arXiv - AI · 4 min ·
[2603.24644] Physics-Informed Neural Network Digital Twin for Dynamic Tray-Wise Modeling of Distillation Columns under Transient Operating Conditions
Machine Learning

[2603.24644] Physics-Informed Neural Network Digital Twin for Dynamic Tray-Wise Modeling of Distillation Columns under Transient Operating Conditions

Abstract page for arXiv paper 2603.24644: Physics-Informed Neural Network Digital Twin for Dynamic Tray-Wise Modeling of Distillation Col...

arXiv - Machine Learning · 4 min ·
[2603.24639] Experiential Reflective Learning for Self-Improving LLM Agents
Llms

[2603.24639] Experiential Reflective Learning for Self-Improving LLM Agents

Abstract page for arXiv paper 2603.24639: Experiential Reflective Learning for Self-Improving LLM Agents

arXiv - AI · 3 min ·
[2603.25284] SliderQuant: Accurate Post-Training Quantization for LLMs
Llms

[2603.25284] SliderQuant: Accurate Post-Training Quantization for LLMs

Abstract page for arXiv paper 2603.25284: SliderQuant: Accurate Post-Training Quantization for LLMs

arXiv - AI · 4 min ·
[2603.25283] A Gait Foundation Model Predicts Multi-System Health Phenotypes from 3D Skeletal Motion
Llms

[2603.25283] A Gait Foundation Model Predicts Multi-System Health Phenotypes from 3D Skeletal Motion

Abstract page for arXiv paper 2603.25283: A Gait Foundation Model Predicts Multi-System Health Phenotypes from 3D Skeletal Motion

arXiv - AI · 3 min ·
[2603.24638] How unconstrained machine-learning models learn physical symmetries
Machine Learning

[2603.24638] How unconstrained machine-learning models learn physical symmetries

Abstract page for arXiv paper 2603.24638: How unconstrained machine-learning models learn physical symmetries

arXiv - Machine Learning · 4 min ·
[2603.25273] Distribution and Clusters Approximations as Abstract Domains in Probabilistic Abstract Interpretation to Neural Network Analysis
Machine Learning

[2603.25273] Distribution and Clusters Approximations as Abstract Domains in Probabilistic Abstract Interpretation to Neural Network Analysis

Abstract page for arXiv paper 2603.25273: Distribution and Clusters Approximations as Abstract Domains in Probabilistic Abstract Interpre...

arXiv - AI · 3 min ·
[2603.25266] Probabilistic Abstract Interpretation on Neural Networks via Grids Approximation
Machine Learning

[2603.25266] Probabilistic Abstract Interpretation on Neural Networks via Grids Approximation

Abstract page for arXiv paper 2603.25266: Probabilistic Abstract Interpretation on Neural Networks via Grids Approximation

arXiv - AI · 3 min ·
[2603.25158] Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills
Llms

[2603.25158] Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Abstract page for arXiv paper 2603.25158: Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

arXiv - AI · 4 min ·
[2603.25133] RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following
Llms

[2603.25133] RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following

Abstract page for arXiv paper 2603.25133: RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following

arXiv - AI · 3 min ·
[2603.25097] ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents
Llms

[2603.25097] ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents

Abstract page for arXiv paper 2603.25097: ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents

arXiv - AI · 4 min ·
[2603.25075] Sparse Visual Thought Circuits in Vision-Language Models
Llms

[2603.25075] Sparse Visual Thought Circuits in Vision-Language Models

Abstract page for arXiv paper 2603.25075: Sparse Visual Thought Circuits in Vision-Language Models

arXiv - AI · 3 min ·
[2603.25046] MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting
Machine Learning

[2603.25046] MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting

Abstract page for arXiv paper 2603.25046: MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting

arXiv - Machine Learning · 4 min ·
[2603.25035] Mechanistically Interpreting Compression in Vision-Language Models
Llms

[2603.25035] Mechanistically Interpreting Compression in Vision-Language Models

Abstract page for arXiv paper 2603.25035: Mechanistically Interpreting Compression in Vision-Language Models

arXiv - AI · 3 min ·
[2603.25031] From Stateless to Situated: Building a Psychological World for LLM-Based Emotional Support
Llms

[2603.25031] From Stateless to Situated: Building a Psychological World for LLM-Based Emotional Support

Abstract page for arXiv paper 2603.25031: From Stateless to Situated: Building a Psychological World for LLM-Based Emotional Support

arXiv - AI · 4 min ·
[2603.25022] A Public Theory of Distillation Resistance via Constraint-Coupled Reasoning Architectures
Machine Learning

[2603.25022] A Public Theory of Distillation Resistance via Constraint-Coupled Reasoning Architectures

Abstract page for arXiv paper 2603.25022: A Public Theory of Distillation Resistance via Constraint-Coupled Reasoning Architectures

arXiv - Machine Learning · 3 min ·
Previous Page 132 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime