Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

A Hackable ML Compiler Stack in 5,000 Lines of Python [P]

Hey r/MachineLearning, The modern ML (LLM) compiler stack is brutal. TVM is 500K+ lines of C++. PyTorch piles Dynamo, Inductor, and Trito...

Reddit - Machine Learning · 1 min · 44 minutes ago

Machine Learning

Elon Musk confirms xAI used OpenAI’s models to train Grok | The Verge

During the Musk v. Altman trial, while on the stand, Elon Musk said it was “partly” true that xAI had used model distillation of OpenAI’s...

The Verge - AI · 5 min · about 2 hours ago

Llms

Applying Karpathy's autoresearch to a 33M-token public transit dataset (14% improvement, replication notes) [P]

Hello r/MachineLearning! I work in the US transit industry and I went all-in on learning AI & ML a few months ago. When I heard about...

Reddit - Machine Learning · 1 min · about 2 hours ago

All Content

Llms

[2604.04440] Training Transformers in Cosine Coefficient Space

Abstract page for arXiv paper 2604.04440: Training Transformers in Cosine Coefficient Space

arXiv - AI · 3 min · 24 days ago

Llms

[2604.04418] Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

Abstract page for arXiv paper 2604.04418: Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

arXiv - AI · 3 min · 24 days ago

Llms

[2604.04411] Responses Fall Short of Understanding: Revealing the Gap between Internal Representations and Responses in Visual Document Understanding

Abstract page for arXiv paper 2604.04411: Responses Fall Short of Understanding: Revealing the Gap between Internal Representations and R...

arXiv - AI · 3 min · 24 days ago

Llms

[2604.04384] Compressible Softmax-Attended Language under Incompressible Attention

Abstract page for arXiv paper 2604.04384: Compressible Softmax-Attended Language under Incompressible Attention

arXiv - AI · 3 min · 24 days ago

Llms

[2604.04359] GROUNDEDKG-RAG: Grounded Knowledge Graph Index for Long-document Question Answering

Abstract page for arXiv paper 2604.04359: GROUNDEDKG-RAG: Grounded Knowledge Graph Index for Long-document Question Answering

arXiv - AI · 4 min · 24 days ago

Machine Learning

[2604.04353] ReFinE: Streamlining UI Mockup Iteration with Research Findings

Abstract page for arXiv paper 2604.04353: ReFinE: Streamlining UI Mockup Iteration with Research Findings

arXiv - AI · 3 min · 24 days ago

Llms

[2604.04306] HighFM: Towards a Foundation Model for Learning Representations from High-Frequency Earth Observation Data

Abstract page for arXiv paper 2604.04306: HighFM: Towards a Foundation Model for Learning Representations from High-Frequency Earth Obser...

arXiv - AI · 4 min · 24 days ago

Machine Learning

[2604.04299] A Persistent Homology Design Space for 3D Point Cloud Deep Learning

Abstract page for arXiv paper 2604.04299: A Persistent Homology Design Space for 3D Point Cloud Deep Learning

arXiv - AI · 4 min · 24 days ago

Llms

[2604.04289] Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6

Abstract page for arXiv paper 2604.04289: Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6

arXiv - AI · 4 min · 24 days ago

Llms

[2604.04263] Commercial Persuasion in AI-Mediated Conversations

Abstract page for arXiv paper 2604.04263: Commercial Persuasion in AI-Mediated Conversations

arXiv - AI · 3 min · 24 days ago

Machine Learning

[2604.04211] LOCARD: An Agentic Framework for Blockchain Forensics

Abstract page for arXiv paper 2604.04211: LOCARD: An Agentic Framework for Blockchain Forensics

arXiv - AI · 3 min · 24 days ago

Llms

[2604.04172] GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models

Abstract page for arXiv paper 2604.04172: GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models

arXiv - AI · 4 min · 24 days ago

Llms

[2604.04144] Many Preferences, Few Policies: Towards Scalable Language Model Personalization

Abstract page for arXiv paper 2604.04144: Many Preferences, Few Policies: Towards Scalable Language Model Personalization

arXiv - AI · 3 min · 24 days ago

Llms

[2604.04133] Learning Robust Visual Features in Computed Tomography Enables Efficient Transfer Learning for Clinical Tasks

Abstract page for arXiv paper 2604.04133: Learning Robust Visual Features in Computed Tomography Enables Efficient Transfer Learning for ...

arXiv - AI · 4 min · 24 days ago

Llms

[2604.04089] From Paper to Program: A Multi-Stage LLM-Assisted Workflow for Accelerating Quantum Many-Body Algorithm Development

Abstract page for arXiv paper 2604.04089: From Paper to Program: A Multi-Stage LLM-Assisted Workflow for Accelerating Quantum Many-Body A...

arXiv - AI · 3 min · 24 days ago

Llms

[2604.04064] Extracting and Steering Emotion Representations in Small Language Models: A Methodological Comparison

Abstract page for arXiv paper 2604.04064: Extracting and Steering Emotion Representations in Small Language Models: A Methodological Comp...

arXiv - AI · 4 min · 24 days ago

Machine Learning

[2604.04078] BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of cardiovascular diseases from cardiac magnetic resonance imaging

Abstract page for arXiv paper 2604.04078: BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of ca...

arXiv - AI · 4 min · 24 days ago

Llms

[2604.04060] CoopGuard: Stateful Cooperative Agents Safeguarding LLMs Against Evolving Multi-Round Attacks

Abstract page for arXiv paper 2604.04060: CoopGuard: Stateful Cooperative Agents Safeguarding LLMs Against Evolving Multi-Round Attacks

arXiv - AI · 3 min · 24 days ago

Llms

[2604.03980] Gram-Anchored Prompt Learning for Vision-Language Models via Second-Order Statistics

Abstract page for arXiv paper 2604.03980: Gram-Anchored Prompt Learning for Vision-Language Models via Second-Order Statistics

arXiv - AI · 3 min · 24 days ago

Machine Learning

[2604.03968] TraceGuard: Structured Multi-Dimensional Monitoring as a Collusion-Resistant Control Protocol

Abstract page for arXiv paper 2604.03968: TraceGuard: Structured Multi-Dimensional Monitoring as a Collusion-Resistant Control Protocol

arXiv - AI · 4 min · 24 days ago

Previous Page 296 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

A Hackable ML Compiler Stack in 5,000 Lines of Python [P]

Elon Musk confirms xAI used OpenAI’s models to train Grok | The Verge

Applying Karpathy's autoresearch to a 33M-token public transit dataset (14% improvement, replication notes) [P]

All Content

[2604.04440] Training Transformers in Cosine Coefficient Space

[2604.04418] Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

[2604.04411] Responses Fall Short of Understanding: Revealing the Gap between Internal Representations and Responses in Visual Document Understanding

[2604.04384] Compressible Softmax-Attended Language under Incompressible Attention

[2604.04359] GROUNDEDKG-RAG: Grounded Knowledge Graph Index for Long-document Question Answering

[2604.04353] ReFinE: Streamlining UI Mockup Iteration with Research Findings

[2604.04306] HighFM: Towards a Foundation Model for Learning Representations from High-Frequency Earth Observation Data

[2604.04299] A Persistent Homology Design Space for 3D Point Cloud Deep Learning

[2604.04289] Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6

[2604.04263] Commercial Persuasion in AI-Mediated Conversations

[2604.04211] LOCARD: An Agentic Framework for Blockchain Forensics

[2604.04172] GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models

[2604.04144] Many Preferences, Few Policies: Towards Scalable Language Model Personalization

[2604.04133] Learning Robust Visual Features in Computed Tomography Enables Efficient Transfer Learning for Clinical Tasks

[2604.04089] From Paper to Program: A Multi-Stage LLM-Assisted Workflow for Accelerating Quantum Many-Body Algorithm Development

[2604.04064] Extracting and Steering Emotion Representations in Small Language Models: A Methodological Comparison

[2604.04078] BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of cardiovascular diseases from cardiac magnetic resonance imaging

[2604.04060] CoopGuard: Stateful Cooperative Agents Safeguarding LLMs Against Evolving Multi-Round Attacks

[2604.03980] Gram-Anchored Prompt Learning for Vision-Language Models via Second-Order Statistics

[2604.03968] TraceGuard: Structured Multi-Dimensional Monitoring as a Collusion-Resistant Control Protocol

Related Topics

Stay updated with AI News