Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Machine Learning

Elon Musk testifies that xAI trained Grok on OpenAI models | TechCrunch

"Distillation" is a hot topic as frontier labs try to prevent smaller competitors from copying their models.

TechCrunch - AI · 4 min · 3 minutes ago

Llms

This startup’s new mechanistic interpretability tool lets you debug LLMs | MIT Technology Review

Goodfire wants to make training AI models more like good old-fashioned software engineering.

MIT Technology Review · 7 min · about 2 hours ago

Machine Learning

[R] Joint Embedding Variational Bayes (TMLR ’26)

Disclosure: first author. The paper was just published in TMLR, and I figured it might be of interest to some people here. It is fairly d...

Reddit - Machine Learning · 1 min · about 3 hours ago

All Content

Llms

[2601.05656] HAG: Hierarchical Demographic Tree-based Agent Generation for Topic-Adaptive Simulation

Abstract page for arXiv paper 2601.05656: HAG: Hierarchical Demographic Tree-based Agent Generation for Topic-Adaptive Simulation

arXiv - AI · 3 min · 23 days ago

Machine Learning

[2512.13168] Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

Abstract page for arXiv paper 2512.13168: Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

arXiv - AI · 4 min · 23 days ago

Llms

[2511.14130] PRISM: Prompt-Refined In-Context System Modelling for Financial Retrieval

Abstract page for arXiv paper 2511.14130: PRISM: Prompt-Refined In-Context System Modelling for Financial Retrieval

arXiv - AI · 4 min · 23 days ago

Llms

[2510.09901] Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics

Abstract page for arXiv paper 2510.09901: Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics

arXiv - AI · 3 min · 23 days ago

Machine Learning

[2508.02900] Seemingly Simple Planning Problems are Computationally Challenging: The Countdown Game

Abstract page for arXiv paper 2508.02900: Seemingly Simple Planning Problems are Computationally Challenging: The Countdown Game

arXiv - AI · 4 min · 23 days ago

Llms

[2502.13388] Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

Abstract page for arXiv paper 2502.13388: Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

arXiv - AI · 3 min · 23 days ago

Machine Learning

[2411.06498] Barriers to Complexity-Theoretic Proofs that "AGI" Using Machine Learning is Impossible

Abstract page for arXiv paper 2411.06498: Barriers to Complexity-Theoretic Proofs that "AGI" Using Machine Learning is Impossible

arXiv - AI · 3 min · 23 days ago

Machine Learning

[2604.04924] Your Pre-trained Diffusion Model Secretly Knows Restoration

Abstract page for arXiv paper 2604.04924: Your Pre-trained Diffusion Model Secretly Knows Restoration

arXiv - AI · 3 min · 23 days ago

Machine Learning

[2604.04906] How AI Aggregation Affects Knowledge

Abstract page for arXiv paper 2604.04906: How AI Aggregation Affects Knowledge

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04917] Vero: An Open RL Recipe for General Visual Reasoning

Abstract page for arXiv paper 2604.04917: Vero: An Open RL Recipe for General Visual Reasoning

arXiv - AI · 4 min · 23 days ago

Machine Learning

[2604.04895] Agentic Federated Learning: The Future of Distributed Training Orchestration

Abstract page for arXiv paper 2604.04895: Agentic Federated Learning: The Future of Distributed Training Orchestration

arXiv - AI · 3 min · 23 days ago

Machine Learning

[2604.04901] FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Abstract page for arXiv paper 2604.04901: FileGram: Grounding Agent Personalization in File-System Behavioral Traces

arXiv - AI · 4 min · 23 days ago

Machine Learning

[2604.04891] Muon Dynamics as a Spectral Wasserstein Flow

Abstract page for arXiv paper 2604.04891: Muon Dynamics as a Spectral Wasserstein Flow

arXiv - AI · 4 min · 23 days ago

Llms

[2604.04852] Strengthening Human-Centric Chain-of-Thought Reasoning Integrity in LLMs via a Structured Prompt Framework

Abstract page for arXiv paper 2604.04852: Strengthening Human-Centric Chain-of-Thought Reasoning Integrity in LLMs via a Structured Promp...

arXiv - AI · 4 min · 23 days ago

Llms

[2604.04825] Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not

Abstract page for arXiv paper 2604.04825: Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04815] LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection

Abstract page for arXiv paper 2604.04815: LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection

arXiv - AI · 4 min · 23 days ago

Llms

[2604.04743] Hallucination Basins: A Dynamic Framework for Understanding and Controlling LLM Hallucinations

Abstract page for arXiv paper 2604.04743: Hallucination Basins: A Dynamic Framework for Understanding and Controlling LLM Hallucinations

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04741] Artificial Intelligence and Cost Reduction in Public Higher Education: A Scoping Review of Emerging Evidence

Abstract page for arXiv paper 2604.04741: Artificial Intelligence and Cost Reduction in Public Higher Education: A Scoping Review of Emer...

arXiv - AI · 4 min · 23 days ago

Llms

[2604.04733] Discovering Failure Modes in Vision-Language Models using RL

Abstract page for arXiv paper 2604.04733: Discovering Failure Modes in Vision-Language Models using RL

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04732] Metaphors We Compute By: A Computational Audit of Cultural Translation vs. Thinking in LLMs

Abstract page for arXiv paper 2604.04732: Metaphors We Compute By: A Computational Audit of Cultural Translation vs. Thinking in LLMs

arXiv - AI · 3 min · 23 days ago

Previous Page 294 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Elon Musk testifies that xAI trained Grok on OpenAI models | TechCrunch

This startup’s new mechanistic interpretability tool lets you debug LLMs | MIT Technology Review

[R] Joint Embedding Variational Bayes (TMLR ’26)

All Content

[2601.05656] HAG: Hierarchical Demographic Tree-based Agent Generation for Topic-Adaptive Simulation

[2512.13168] Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

[2511.14130] PRISM: Prompt-Refined In-Context System Modelling for Financial Retrieval

[2510.09901] Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics

[2508.02900] Seemingly Simple Planning Problems are Computationally Challenging: The Countdown Game

[2502.13388] Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

[2411.06498] Barriers to Complexity-Theoretic Proofs that "AGI" Using Machine Learning is Impossible

[2604.04924] Your Pre-trained Diffusion Model Secretly Knows Restoration

[2604.04906] How AI Aggregation Affects Knowledge

[2604.04917] Vero: An Open RL Recipe for General Visual Reasoning

[2604.04895] Agentic Federated Learning: The Future of Distributed Training Orchestration

[2604.04901] FileGram: Grounding Agent Personalization in File-System Behavioral Traces

[2604.04891] Muon Dynamics as a Spectral Wasserstein Flow

[2604.04852] Strengthening Human-Centric Chain-of-Thought Reasoning Integrity in LLMs via a Structured Prompt Framework

[2604.04825] Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not

[2604.04815] LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection

[2604.04743] Hallucination Basins: A Dynamic Framework for Understanding and Controlling LLM Hallucinations

[2604.04741] Artificial Intelligence and Cost Reduction in Public Higher Education: A Scoping Review of Emerging Evidence

[2604.04733] Discovering Failure Modes in Vision-Language Models using RL

[2604.04732] Metaphors We Compute By: A Computational Audit of Cultural Translation vs. Thinking in LLMs

Related Topics

Stay updated with AI News