Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Coherence Without Convergence: A New Protocol for Multi-Agent AI

Opening For the past year, most progress in multi-agent AI has followed a familiar pattern: Add more agents. Add more coordination. Watch...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

Week 6 AIPass update - answering the top questions from last post (file conflicts, remote models, scale)

Followup to last post with answers to the top questions from the comments. Appreciate everyone who jumped in. The most common one by a mi...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

Honest ChatGPT vs Claude comparison after using both daily for a month

got tired of reading comparisons that were obvisously written by people who tested each tool for 20 minutes so i ran both at $20/month fo...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

All Content

Machine Learning

[2512.03336] Single-Round Scalable Analytic Federated Learning

Abstract page for arXiv paper 2512.03336: Single-Round Scalable Analytic Federated Learning

arXiv - Machine Learning · 3 min · 16 days ago

Machine Learning

[2511.19413] UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

Abstract page for arXiv paper 2511.19413: UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2511.14262] Object-Centric World Models for Causality-Aware Reinforcement Learning

Abstract page for arXiv paper 2511.14262: Object-Centric World Models for Causality-Aware Reinforcement Learning

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2511.04124] Decomposable Neuro Symbolic Regression

Abstract page for arXiv paper 2511.04124: Decomposable Neuro Symbolic Regression

arXiv - Machine Learning · 4 min · 16 days ago

Llms

[2510.10102] PANTHER: Generative Pretraining Beyond Language for Sequential User Behavior Modeling

Abstract page for arXiv paper 2510.10102: PANTHER: Generative Pretraining Beyond Language for Sequential User Behavior Modeling

arXiv - Machine Learning · 4 min · 16 days ago

Llms

[2510.08992] Constraints-of-Thought: A Framework for Constrained Reasoning in Language-Model-Guided Search

Abstract page for arXiv paper 2510.08992: Constraints-of-Thought: A Framework for Constrained Reasoning in Language-Model-Guided Search

arXiv - Machine Learning · 4 min · 16 days ago

Llms

[2510.06162] TabPFN-Wide: Continued Pre-Training for Extreme Feature Counts

Abstract page for arXiv paper 2510.06162: TabPFN-Wide: Continued Pre-Training for Extreme Feature Counts

arXiv - Machine Learning · 4 min · 16 days ago

Llms

[2510.05825] Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling

Abstract page for arXiv paper 2510.05825: Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling

arXiv - Machine Learning · 4 min · 16 days ago

Llms

[2510.04618] Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Abstract page for arXiv paper 2510.04618: Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

arXiv - Machine Learning · 4 min · 16 days ago

Llms

[2510.03904] LLM as an Algorithmist: Enhancing Anomaly Detectors via Programmatic Synthesis

Abstract page for arXiv paper 2510.03904: LLM as an Algorithmist: Enhancing Anomaly Detectors via Programmatic Synthesis

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2510.01349] To Augment or Not to Augment? Diagnosing Distributional Symmetry Breaking

Abstract page for arXiv paper 2510.01349: To Augment or Not to Augment? Diagnosing Distributional Symmetry Breaking

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2509.22381] Enhancing Credit Risk Prediction: A Multi-stage Ensemble Pipeline

Abstract page for arXiv paper 2509.22381: Enhancing Credit Risk Prediction: A Multi-stage Ensemble Pipeline

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2509.19601] Learning Genetic Circuit Modules with Neural Networks: Full Version

Abstract page for arXiv paper 2509.19601: Learning Genetic Circuit Modules with Neural Networks: Full Version

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2509.13007] ReTrack: Data Unlearning in Diffusion Models through Redirecting the Denoising Trajectory

Abstract page for arXiv paper 2509.13007: ReTrack: Data Unlearning in Diffusion Models through Redirecting the Denoising Trajectory

arXiv - Machine Learning · 3 min · 16 days ago

Machine Learning

[2509.17889] GaussianPSL: Soft partitioning for complex PSL problem

Abstract page for arXiv paper 2509.17889: GaussianPSL: Soft partitioning for complex PSL problem

arXiv - Machine Learning · 3 min · 16 days ago

Machine Learning

[2509.12573] No Need for Learning to Defer? A Training Free Deferral Framework to Multiple Experts through Conformal Prediction

Abstract page for arXiv paper 2509.12573: No Need for Learning to Defer? A Training Free Deferral Framework to Multiple Experts through C...

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2509.04959] On the Normalization of Confusion Matrices: Methods and Geometric Interpretations

Abstract page for arXiv paper 2509.04959: On the Normalization of Confusion Matrices: Methods and Geometric Interpretations

arXiv - Machine Learning · 3 min · 16 days ago

Machine Learning

[2509.03417] Initialization Schemes for Kolmogorov-Arnold Networks: An Empirical Study

Abstract page for arXiv paper 2509.03417: Initialization Schemes for Kolmogorov-Arnold Networks: An Empirical Study

arXiv - Machine Learning · 3 min · 16 days ago

Machine Learning

[2508.13773] PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting

Abstract page for arXiv paper 2508.13773: PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series F...

arXiv - Machine Learning · 3 min · 16 days ago

Llms

[2508.04329] Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning

Abstract page for arXiv paper 2508.04329: Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning

arXiv - Machine Learning · 4 min · 16 days ago

Previous Page 197 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Coherence Without Convergence: A New Protocol for Multi-Agent AI

Week 6 AIPass update - answering the top questions from last post (file conflicts, remote models, scale)

Honest ChatGPT vs Claude comparison after using both daily for a month

All Content

[2512.03336] Single-Round Scalable Analytic Federated Learning

[2511.19413] UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

[2511.14262] Object-Centric World Models for Causality-Aware Reinforcement Learning

[2511.04124] Decomposable Neuro Symbolic Regression

[2510.10102] PANTHER: Generative Pretraining Beyond Language for Sequential User Behavior Modeling

[2510.08992] Constraints-of-Thought: A Framework for Constrained Reasoning in Language-Model-Guided Search

[2510.06162] TabPFN-Wide: Continued Pre-Training for Extreme Feature Counts

[2510.05825] Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling

[2510.04618] Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

[2510.03904] LLM as an Algorithmist: Enhancing Anomaly Detectors via Programmatic Synthesis

[2510.01349] To Augment or Not to Augment? Diagnosing Distributional Symmetry Breaking

[2509.22381] Enhancing Credit Risk Prediction: A Multi-stage Ensemble Pipeline

[2509.19601] Learning Genetic Circuit Modules with Neural Networks: Full Version

[2509.13007] ReTrack: Data Unlearning in Diffusion Models through Redirecting the Denoising Trajectory

[2509.17889] GaussianPSL: Soft partitioning for complex PSL problem

[2509.12573] No Need for Learning to Defer? A Training Free Deferral Framework to Multiple Experts through Conformal Prediction

[2509.04959] On the Normalization of Confusion Matrices: Methods and Geometric Interpretations

[2509.03417] Initialization Schemes for Kolmogorov-Arnold Networks: An Empirical Study

[2508.13773] PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting

[2508.04329] Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning

Related Topics

Stay updated with AI News