Machine Learning

ML algorithms, training, and inference

Top This Week

Llms

Claude on Claude

The Story of Anthropic’s Latest Controversies Regarding the Business of Its Prized Creation… As Told by the Thing Itself. Editor’s note: ...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

This OpenClaw paper shows why agent safety is an execution problem, not just a model problem

Paper: https://arxiv.org/abs/2604.04759 This OpenClaw paper is one of the clearest signals so far that agent risk is architectural, not j...

Reddit - Artificial Intelligence · 1 min ·
Llms

"Authoritarian Parents In Rationalist Clothes": a piece I wrote in December about alignment

Posted today in light of the Claude Mythos model card release. Originally I wrote this for r/ControlProblem but realized it was getting o...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.23539] PLDR-LLMs Reason At Self-Organized Criticality
Llms

[2603.23539] PLDR-LLMs Reason At Self-Organized Criticality

Abstract page for arXiv paper 2603.23539: PLDR-LLMs Reason At Self-Organized Criticality

arXiv - Machine Learning · 3 min ·
[2603.23534] Not All Pretraining are Created Equal: Threshold Tuning and Class Weighting for Imbalanced Polarization Tasks in Low-Resource Settings
Machine Learning

[2603.23534] Not All Pretraining are Created Equal: Threshold Tuning and Class Weighting for Imbalanced Polarization Tasks in Low-Resource Settings

Abstract page for arXiv paper 2603.23534: Not All Pretraining are Created Equal: Threshold Tuning and Class Weighting for Imbalanced Pola...

arXiv - Machine Learning · 3 min ·
[2603.23530] Did You Forget What I Asked? Prospective Memory Failures in Large Language Models
Llms

[2603.23530] Did You Forget What I Asked? Prospective Memory Failures in Large Language Models

Abstract page for arXiv paper 2603.23530: Did You Forget What I Asked? Prospective Memory Failures in Large Language Models

arXiv - Machine Learning · 3 min ·
[2603.23514] DepthCharge: A Domain-Agnostic Framework for Measuring Depth-Dependent Knowledge in Large Language Models
Llms

[2603.23514] DepthCharge: A Domain-Agnostic Framework for Measuring Depth-Dependent Knowledge in Large Language Models

Abstract page for arXiv paper 2603.23514: DepthCharge: A Domain-Agnostic Framework for Measuring Depth-Dependent Knowledge in Large Langu...

arXiv - Machine Learning · 4 min ·
[2603.23507] Beyond Masks: Efficient, Flexible Diffusion Language Models via Deletion-Insertion Processes
Llms

[2603.23507] Beyond Masks: Efficient, Flexible Diffusion Language Models via Deletion-Insertion Processes

Abstract page for arXiv paper 2603.23507: Beyond Masks: Efficient, Flexible Diffusion Language Models via Deletion-Insertion Processes

arXiv - Machine Learning · 4 min ·
[2603.24594] Polynomial Speedup in Diffusion Models with the Multilevel Euler-Maruyama Method
Machine Learning

[2603.24594] Polynomial Speedup in Diffusion Models with the Multilevel Euler-Maruyama Method

Abstract page for arXiv paper 2603.24594: Polynomial Speedup in Diffusion Models with the Multilevel Euler-Maruyama Method

arXiv - Machine Learning · 4 min ·
[2603.24587] DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving
Machine Learning

[2603.24587] DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving

Abstract page for arXiv paper 2603.24587: DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving

arXiv - Machine Learning · 3 min ·
[2603.24562] Scaling Recurrence-aware Foundation Models for Clinical Records via Next-Visit Prediction
Llms

[2603.24562] Scaling Recurrence-aware Foundation Models for Clinical Records via Next-Visit Prediction

Abstract page for arXiv paper 2603.24562: Scaling Recurrence-aware Foundation Models for Clinical Records via Next-Visit Prediction

arXiv - Machine Learning · 4 min ·
[2603.24533] UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience
Llms

[2603.24533] UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Abstract page for arXiv paper 2603.24533: UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

arXiv - Machine Learning · 4 min ·
[2603.24524] No Single Metric Tells the Whole Story: A Multi-Dimensional Evaluation Framework for Uncertainty Attributions
Machine Learning

[2603.24524] No Single Metric Tells the Whole Story: A Multi-Dimensional Evaluation Framework for Uncertainty Attributions

Abstract page for arXiv paper 2603.24524: No Single Metric Tells the Whole Story: A Multi-Dimensional Evaluation Framework for Uncertaint...

arXiv - Machine Learning · 4 min ·
[2603.24518] TuneShift-KD: Knowledge Distillation and Transfer for Fine-tuned Models
Llms

[2603.24518] TuneShift-KD: Knowledge Distillation and Transfer for Fine-tuned Models

Abstract page for arXiv paper 2603.24518: TuneShift-KD: Knowledge Distillation and Transfer for Fine-tuned Models

arXiv - Machine Learning · 4 min ·
[2603.24517] AVO: Agentic Variation Operators for Autonomous Evolutionary Search
Llms

[2603.24517] AVO: Agentic Variation Operators for Autonomous Evolutionary Search

Abstract page for arXiv paper 2603.24517: AVO: Agentic Variation Operators for Autonomous Evolutionary Search

arXiv - Machine Learning · 4 min ·
[2603.24503] Towards Safe Learning-Based Non-Linear Model Predictive Control through Recurrent Neural Network Modeling
Machine Learning

[2603.24503] Towards Safe Learning-Based Non-Linear Model Predictive Control through Recurrent Neural Network Modeling

Abstract page for arXiv paper 2603.24503: Towards Safe Learning-Based Non-Linear Model Predictive Control through Recurrent Neural Networ...

arXiv - Machine Learning · 3 min ·
[2603.24500] Project and Generate: Divergence-Free Neural Operators for Incompressible Flows
Machine Learning

[2603.24500] Project and Generate: Divergence-Free Neural Operators for Incompressible Flows

Abstract page for arXiv paper 2603.24500: Project and Generate: Divergence-Free Neural Operators for Incompressible Flows

arXiv - Machine Learning · 3 min ·
[2603.24475] Conformalized Transfer Learning for Li-ion Battery State of Health Forecasting under Manufacturing and Usage Variability
Machine Learning

[2603.24475] Conformalized Transfer Learning for Li-ion Battery State of Health Forecasting under Manufacturing and Usage Variability

Abstract page for arXiv paper 2603.24475: Conformalized Transfer Learning for Li-ion Battery State of Health Forecasting under Manufactur...

arXiv - Machine Learning · 3 min ·
[2603.24431] Learning Response-Statistic Shifts and Parametric Roll Episodes from Wave--Vessel Time Series via LSTM Functional Models
Machine Learning

[2603.24431] Learning Response-Statistic Shifts and Parametric Roll Episodes from Wave--Vessel Time Series via LSTM Functional Models

Abstract page for arXiv paper 2603.24431: Learning Response-Statistic Shifts and Parametric Roll Episodes from Wave--Vessel Time Series v...

arXiv - Machine Learning · 4 min ·
[2603.24428] Marchuk: Efficient Global Weather Forecasting from Mid-Range to Sub-Seasonal Scales via Flow Matching
Machine Learning

[2603.24428] Marchuk: Efficient Global Weather Forecasting from Mid-Range to Sub-Seasonal Scales via Flow Matching

Abstract page for arXiv paper 2603.24428: Marchuk: Efficient Global Weather Forecasting from Mid-Range to Sub-Seasonal Scales via Flow Ma...

arXiv - Machine Learning · 4 min ·
[2603.24384] On the Use of Bagging for Local Intrinsic Dimensionality Estimation
Machine Learning

[2603.24384] On the Use of Bagging for Local Intrinsic Dimensionality Estimation

Abstract page for arXiv paper 2603.24384: On the Use of Bagging for Local Intrinsic Dimensionality Estimation

arXiv - Machine Learning · 4 min ·
[2603.24382] MolEvolve: LLM-Guided Evolutionary Search for Interpretable Molecular Optimization
Llms

[2603.24382] MolEvolve: LLM-Guided Evolutionary Search for Interpretable Molecular Optimization

Abstract page for arXiv paper 2603.24382: MolEvolve: LLM-Guided Evolutionary Search for Interpretable Molecular Optimization

arXiv - Machine Learning · 3 min ·
[2603.24324] Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning
Llms

[2603.24324] Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning

Abstract page for arXiv paper 2603.24324: Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforce...

arXiv - AI · 4 min ·
Previous Page 141 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime