Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

[D] ICML Reviewer Acknowledgement

Hi, I'm a little confused about ICML discussion period Does the period for reviewer acknowledging responses have already ended? One of th...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone I've set up a self-hosted API gateway using [New-API](QuantumNous/new-ap) to manage and distribute Claude Opus 4.6 access ac...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Machine Learning

[D] ICML reviewer making up false claim in acknowledgement, what to do?

In a rebuttal acknowledgement we received, the reviewer made up a claim that our method performs worse than baselines with some hyperpara...

Reddit - Machine Learning · 1 min · about 3 hours ago

All Content

Llms

[2603.23871] HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

Abstract page for arXiv paper 2603.23871: HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

arXiv - Machine Learning · 3 min · 9 days ago

Llms

[2603.23867] Can VLMs Reason Robustly? A Neuro-Symbolic Investigation

Abstract page for arXiv paper 2603.23867: Can VLMs Reason Robustly? A Neuro-Symbolic Investigation

arXiv - Machine Learning · 4 min · 9 days ago

Machine Learning

[2603.23862] Deep Convolutional Neural Networks for predicting highest priority functional group in organic molecules

Abstract page for arXiv paper 2603.23862: Deep Convolutional Neural Networks for predicting highest priority functional group in organic ...

arXiv - Machine Learning · 3 min · 9 days ago

Machine Learning

[2603.23861] An Invariant Compiler for Neural ODEs in AI-Accelerated Scientific Simulation

Abstract page for arXiv paper 2603.23861: An Invariant Compiler for Neural ODEs in AI-Accelerated Scientific Simulation

arXiv - Machine Learning · 3 min · 9 days ago

Machine Learning

[2603.23860] Why the Maximum Second Derivative of Activations Matters for Adversarial Robustness

Abstract page for arXiv paper 2603.23860: Why the Maximum Second Derivative of Activations Matters for Adversarial Robustness

arXiv - Machine Learning · 3 min · 9 days ago

Machine Learning

[2603.23854] Symbolic--KAN: Kolmogorov-Arnold Networks with Discrete Symbolic Structure for Interpretable Learning

Abstract page for arXiv paper 2603.23854: Symbolic--KAN: Kolmogorov-Arnold Networks with Discrete Symbolic Structure for Interpretable Le...

arXiv - Machine Learning · 4 min · 9 days ago

Llms

[2603.23831] Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

Abstract page for arXiv paper 2603.23831: Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

arXiv - Machine Learning · 3 min · 9 days ago

Machine Learning

[2603.23823] Circuit Complexity of Hierarchical Knowledge Tracing and Implications for Log-Precision Transformers

Abstract page for arXiv paper 2603.23823: Circuit Complexity of Hierarchical Knowledge Tracing and Implications for Log-Precision Transfo...

arXiv - Machine Learning · 3 min · 9 days ago

Machine Learning

[2603.23805] Deep Neural Regression Collapse

Abstract page for arXiv paper 2603.23805: Deep Neural Regression Collapse

arXiv - Machine Learning · 3 min · 9 days ago

Machine Learning

[2603.23799] Resolving gradient pathology in physics-informed epidemiological models

Abstract page for arXiv paper 2603.23799: Resolving gradient pathology in physics-informed epidemiological models

arXiv - Machine Learning · 4 min · 9 days ago

Machine Learning

[2603.23792] Manifold Generalization Provably Proceeds Memorization in Diffusion Models

Abstract page for arXiv paper 2603.23792: Manifold Generalization Provably Proceeds Memorization in Diffusion Models

arXiv - Machine Learning · 4 min · 9 days ago

Machine Learning

[2603.23784] Latent Algorithmic Structure Precedes Grokking: A Mechanistic Study of ReLU MLPs on Modular Arithmetic

Abstract page for arXiv paper 2603.23784: Latent Algorithmic Structure Precedes Grokking: A Mechanistic Study of ReLU MLPs on Modular Ari...

arXiv - Machine Learning · 4 min · 9 days ago

Llms

[2603.23783] Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models

Abstract page for arXiv paper 2603.23783: Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation ...

arXiv - AI · 4 min · 9 days ago

Llms

[2603.23780] Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters

Abstract page for arXiv paper 2603.23780: Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters

arXiv - Machine Learning · 3 min · 9 days ago

Machine Learning

[2603.23746] Kronecker-Structured Nonparametric Spatiotemporal Point Processes

Abstract page for arXiv paper 2603.23746: Kronecker-Structured Nonparametric Spatiotemporal Point Processes

arXiv - Machine Learning · 3 min · 9 days ago

Machine Learning

[2603.23719] CDMT-EHR: A Continuous-Time Diffusion Framework for Generating Mixed-Type Time-Series Electronic Health Records

Abstract page for arXiv paper 2603.23719: CDMT-EHR: A Continuous-Time Diffusion Framework for Generating Mixed-Type Time-Series Electroni...

arXiv - Machine Learning · 4 min · 9 days ago

Machine Learning

[2603.23658] Boost Like a (Var)Pro: Trust-Region Gradient Boosting via Variable Projection

Abstract page for arXiv paper 2603.23658: Boost Like a (Var)Pro: Trust-Region Gradient Boosting via Variable Projection

arXiv - Machine Learning · 4 min · 9 days ago

Llms

[2603.23629] Steering Code LLMs with Activation Directions for Language and Library Control

Abstract page for arXiv paper 2603.23629: Steering Code LLMs with Activation Directions for Language and Library Control

arXiv - Machine Learning · 3 min · 9 days ago

Llms

[2603.23626] A Theory of LLM Information Susceptibility

Abstract page for arXiv paper 2603.23626: A Theory of LLM Information Susceptibility

arXiv - Machine Learning · 4 min · 9 days ago

Machine Learning

[2603.23584] LineMVGNN: Anti-Money Laundering with Line-Graph-Assisted Multi-View Graph Neural Networks

Abstract page for arXiv paper 2603.23584: LineMVGNN: Anti-Money Laundering with Line-Graph-Assisted Multi-View Graph Neural Networks

arXiv - Machine Learning · 4 min · 9 days ago

Previous Page 103 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

[D] ICML Reviewer Acknowledgement

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

[D] ICML reviewer making up false claim in acknowledgement, what to do?

All Content

[2603.23871] HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

[2603.23867] Can VLMs Reason Robustly? A Neuro-Symbolic Investigation

[2603.23862] Deep Convolutional Neural Networks for predicting highest priority functional group in organic molecules

[2603.23861] An Invariant Compiler for Neural ODEs in AI-Accelerated Scientific Simulation

[2603.23860] Why the Maximum Second Derivative of Activations Matters for Adversarial Robustness

[2603.23854] Symbolic--KAN: Kolmogorov-Arnold Networks with Discrete Symbolic Structure for Interpretable Learning

[2603.23831] Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

[2603.23823] Circuit Complexity of Hierarchical Knowledge Tracing and Implications for Log-Precision Transformers

[2603.23805] Deep Neural Regression Collapse

[2603.23799] Resolving gradient pathology in physics-informed epidemiological models

[2603.23792] Manifold Generalization Provably Proceeds Memorization in Diffusion Models

[2603.23784] Latent Algorithmic Structure Precedes Grokking: A Mechanistic Study of ReLU MLPs on Modular Arithmetic

[2603.23783] Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models

[2603.23780] Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters

[2603.23746] Kronecker-Structured Nonparametric Spatiotemporal Point Processes

[2603.23719] CDMT-EHR: A Continuous-Time Diffusion Framework for Generating Mixed-Type Time-Series Electronic Health Records

[2603.23658] Boost Like a (Var)Pro: Trust-Region Gradient Boosting via Variable Projection

[2603.23629] Steering Code LLMs with Activation Directions for Language and Library Control

[2603.23626] A Theory of LLM Information Susceptibility

[2603.23584] LineMVGNN: Anti-Money Laundering with Line-Graph-Assisted Multi-View Graph Neural Networks

Related Topics

Stay updated with AI News