Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Elon Musk confirms xAI used OpenAI’s models to train Grok | The Verge

During the Musk v. Altman trial, while on the stand, Elon Musk said it was “partly” true that xAI had used model distillation of OpenAI’s...

The Verge - AI · 5 min · 27 minutes ago

Llms

Applying Karpathy's autoresearch to a 33M-token public transit dataset (14% improvement, replication notes) [P]

Hello r/MachineLearning! I work in the US transit industry and I went all-in on learning AI & ML a few months ago. When I heard about...

Reddit - Machine Learning · 1 min · 27 minutes ago

Machine Learning

Seems ICML is rejecting MANY unanimous positively rated papers [D]

My 4444 (4443 pre-rebuttal) got rejected (as expected). Just copying a reply I wrote a couple of days ago before decisions were out: Ther...

Reddit - Machine Learning · 1 min · 27 minutes ago

All Content

Llms

[2604.04825] Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not

Abstract page for arXiv paper 2604.04825: Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04815] LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection

Abstract page for arXiv paper 2604.04815: LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection

arXiv - AI · 4 min · 23 days ago

Llms

[2604.04743] Hallucination Basins: A Dynamic Framework for Understanding and Controlling LLM Hallucinations

Abstract page for arXiv paper 2604.04743: Hallucination Basins: A Dynamic Framework for Understanding and Controlling LLM Hallucinations

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04741] Artificial Intelligence and Cost Reduction in Public Higher Education: A Scoping Review of Emerging Evidence

Abstract page for arXiv paper 2604.04741: Artificial Intelligence and Cost Reduction in Public Higher Education: A Scoping Review of Emer...

arXiv - AI · 4 min · 23 days ago

Llms

[2604.04733] Discovering Failure Modes in Vision-Language Models using RL

Abstract page for arXiv paper 2604.04733: Discovering Failure Modes in Vision-Language Models using RL

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04732] Metaphors We Compute By: A Computational Audit of Cultural Translation vs. Thinking in LLMs

Abstract page for arXiv paper 2604.04732: Metaphors We Compute By: A Computational Audit of Cultural Translation vs. Thinking in LLMs

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04723] Individual and Combined Effects of English as a Second Language and Typos on LLM Performance

Abstract page for arXiv paper 2604.04723: Individual and Combined Effects of English as a Second Language and Typos on LLM Performance

arXiv - AI · 4 min · 23 days ago

Machine Learning

[2604.04720] What Makes Good Multilingual Reasoning? Disentangling Reasoning Traces with Measurable Features

Abstract page for arXiv paper 2604.04720: What Makes Good Multilingual Reasoning? Disentangling Reasoning Traces with Measurable Features

arXiv - AI · 4 min · 23 days ago

Llms

[2604.04664] ROSClaw: A Hierarchical Semantic-Physical Framework for Heterogeneous Multi-Agent Collaboration

Abstract page for arXiv paper 2604.04664: ROSClaw: A Hierarchical Semantic-Physical Framework for Heterogeneous Multi-Agent Collaboration

arXiv - AI · 4 min · 23 days ago

Machine Learning

[2604.04646] Training-Free Refinement of Flow Matching with Divergence-based Sampling

Abstract page for arXiv paper 2604.04646: Training-Free Refinement of Flow Matching with Divergence-based Sampling

arXiv - AI · 3 min · 23 days ago

Machine Learning

[2604.04634] Preserving Forgery Artifacts: AI-Generated Video Detection at Native Scale

Abstract page for arXiv paper 2604.04634: Preserving Forgery Artifacts: AI-Generated Video Detection at Native Scale

arXiv - AI · 4 min · 23 days ago

Llms

[2604.04593] Ruling Out to Rule In: Contrastive Hypothesis Retrieval for Medical Question Answering

Abstract page for arXiv paper 2604.04593: Ruling Out to Rule In: Contrastive Hypothesis Retrieval for Medical Question Answering

arXiv - AI · 4 min · 23 days ago

Llms

[2604.04565] PassiveQA: A Three-Action Framework for Epistemically Calibrated Question Answering via Supervised Finetuning

Abstract page for arXiv paper 2604.04565: PassiveQA: A Three-Action Framework for Epistemically Calibrated Question Answering via Supervi...

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04563] Temporal Inversion for Learning Interval Change in Chest X-Rays

Abstract page for arXiv paper 2604.04563: Temporal Inversion for Learning Interval Change in Chest X-Rays

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04562] Paper Espresso: From Paper Overload to Research Insight

Abstract page for arXiv paper 2604.04562: Paper Espresso: From Paper Overload to Research Insight

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04561] Mapping the Exploitation Surface: A 10,000-Trial Taxonomy of What Makes LLM Agents Exploit Vulnerabilities

Abstract page for arXiv paper 2604.04561: Mapping the Exploitation Surface: A 10,000-Trial Taxonomy of What Makes LLM Agents Exploit Vuln...

arXiv - AI · 4 min · 23 days ago

Machine Learning

[2604.04552] StableTTA: Training-Free Test-Time Adaptation that Improves Model Accuracy on ImageNet1K to 96%

Abstract page for arXiv paper 2604.04552: StableTTA: Training-Free Test-Time Adaptation that Improves Model Accuracy on ImageNet1K to 96%

arXiv - AI · 3 min · 23 days ago

Machine Learning

[2604.04490] RAVEN: Radar Adaptive Vision Encoders for Efficient Chirp-wise Object Detection and Segmentation

Abstract page for arXiv paper 2604.04490: RAVEN: Radar Adaptive Vision Encoders for Efficient Chirp-wise Object Detection and Segmentation

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04450] Conversational Control with Ontologies for Large Language Models: A Lightweight Framework for Constrained Generation

Abstract page for arXiv paper 2604.04450: Conversational Control with Ontologies for Large Language Models: A Lightweight Framework for C...

arXiv - AI · 3 min · 23 days ago

Llms

[2604.04440] Training Transformers in Cosine Coefficient Space

Abstract page for arXiv paper 2604.04440: Training Transformers in Cosine Coefficient Space

arXiv - AI · 3 min · 23 days ago

Previous Page 295 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Elon Musk confirms xAI used OpenAI’s models to train Grok | The Verge

Applying Karpathy's autoresearch to a 33M-token public transit dataset (14% improvement, replication notes) [P]

Seems ICML is rejecting MANY unanimous positively rated papers [D]

All Content

[2604.04825] Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not

[2604.04815] LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection

[2604.04743] Hallucination Basins: A Dynamic Framework for Understanding and Controlling LLM Hallucinations

[2604.04741] Artificial Intelligence and Cost Reduction in Public Higher Education: A Scoping Review of Emerging Evidence

[2604.04733] Discovering Failure Modes in Vision-Language Models using RL

[2604.04732] Metaphors We Compute By: A Computational Audit of Cultural Translation vs. Thinking in LLMs

[2604.04723] Individual and Combined Effects of English as a Second Language and Typos on LLM Performance

[2604.04720] What Makes Good Multilingual Reasoning? Disentangling Reasoning Traces with Measurable Features

[2604.04664] ROSClaw: A Hierarchical Semantic-Physical Framework for Heterogeneous Multi-Agent Collaboration

[2604.04646] Training-Free Refinement of Flow Matching with Divergence-based Sampling

[2604.04634] Preserving Forgery Artifacts: AI-Generated Video Detection at Native Scale

[2604.04593] Ruling Out to Rule In: Contrastive Hypothesis Retrieval for Medical Question Answering

[2604.04565] PassiveQA: A Three-Action Framework for Epistemically Calibrated Question Answering via Supervised Finetuning

[2604.04563] Temporal Inversion for Learning Interval Change in Chest X-Rays

[2604.04562] Paper Espresso: From Paper Overload to Research Insight

[2604.04561] Mapping the Exploitation Surface: A 10,000-Trial Taxonomy of What Makes LLM Agents Exploit Vulnerabilities

[2604.04552] StableTTA: Training-Free Test-Time Adaptation that Improves Model Accuracy on ImageNet1K to 96%

[2604.04490] RAVEN: Radar Adaptive Vision Encoders for Efficient Chirp-wise Object Detection and Segmentation

[2604.04450] Conversational Control with Ontologies for Large Language Models: A Lightweight Framework for Constrained Generation

[2604.04440] Training Transformers in Cosine Coefficient Space

Related Topics

Stay updated with AI News