Machine Learning

ML algorithms, training, and inference

Top This Week

Llms

A Hackable ML Compiler Stack in 5,000 Lines of Python [P]

Hey r/MachineLearning, The modern ML (LLM) compiler stack is brutal. TVM is 500K+ lines of C++. PyTorch piles Dynamo, Inductor, and Trito...

Reddit - Machine Learning · 1 min ·
Elon Musk confirms xAI used OpenAI’s models to train Grok | The Verge
Machine Learning

Elon Musk confirms xAI used OpenAI’s models to train Grok | The Verge

During the Musk v. Altman trial, while on the stand, Elon Musk said it was “partly” true that xAI had used model distillation of OpenAI’s...

The Verge - AI · 5 min ·
Llms

Applying Karpathy's autoresearch to a 33M-token public transit dataset (14% improvement, replication notes) [P]

Hello r/MachineLearning! I work in the US transit industry and I went all-in on learning AI & ML a few months ago. When I heard about...

Reddit - Machine Learning · 1 min ·

All Content

[2604.04440] Training Transformers in Cosine Coefficient Space
Llms

[2604.04440] Training Transformers in Cosine Coefficient Space

Abstract page for arXiv paper 2604.04440: Training Transformers in Cosine Coefficient Space

arXiv - AI · 3 min ·
[2604.04418] Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality
Llms

[2604.04418] Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

Abstract page for arXiv paper 2604.04418: Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

arXiv - AI · 3 min ·
[2604.04411] Responses Fall Short of Understanding: Revealing the Gap between Internal Representations and Responses in Visual Document Understanding
Llms

[2604.04411] Responses Fall Short of Understanding: Revealing the Gap between Internal Representations and Responses in Visual Document Understanding

Abstract page for arXiv paper 2604.04411: Responses Fall Short of Understanding: Revealing the Gap between Internal Representations and R...

arXiv - AI · 3 min ·
[2604.04384] Compressible Softmax-Attended Language under Incompressible Attention
Llms

[2604.04384] Compressible Softmax-Attended Language under Incompressible Attention

Abstract page for arXiv paper 2604.04384: Compressible Softmax-Attended Language under Incompressible Attention

arXiv - AI · 3 min ·
[2604.04359] GROUNDEDKG-RAG: Grounded Knowledge Graph Index for Long-document Question Answering
Llms

[2604.04359] GROUNDEDKG-RAG: Grounded Knowledge Graph Index for Long-document Question Answering

Abstract page for arXiv paper 2604.04359: GROUNDEDKG-RAG: Grounded Knowledge Graph Index for Long-document Question Answering

arXiv - AI · 4 min ·
[2604.04353] ReFinE: Streamlining UI Mockup Iteration with Research Findings
Machine Learning

[2604.04353] ReFinE: Streamlining UI Mockup Iteration with Research Findings

Abstract page for arXiv paper 2604.04353: ReFinE: Streamlining UI Mockup Iteration with Research Findings

arXiv - AI · 3 min ·
[2604.04306] HighFM: Towards a Foundation Model for Learning Representations from High-Frequency Earth Observation Data
Llms

[2604.04306] HighFM: Towards a Foundation Model for Learning Representations from High-Frequency Earth Observation Data

Abstract page for arXiv paper 2604.04306: HighFM: Towards a Foundation Model for Learning Representations from High-Frequency Earth Obser...

arXiv - AI · 4 min ·
[2604.04299] A Persistent Homology Design Space for 3D Point Cloud Deep Learning
Machine Learning

[2604.04299] A Persistent Homology Design Space for 3D Point Cloud Deep Learning

Abstract page for arXiv paper 2604.04299: A Persistent Homology Design Space for 3D Point Cloud Deep Learning

arXiv - AI · 4 min ·
[2604.04289] Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6
Llms

[2604.04289] Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6

Abstract page for arXiv paper 2604.04289: Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6

arXiv - AI · 4 min ·
[2604.04263] Commercial Persuasion in AI-Mediated Conversations
Llms

[2604.04263] Commercial Persuasion in AI-Mediated Conversations

Abstract page for arXiv paper 2604.04263: Commercial Persuasion in AI-Mediated Conversations

arXiv - AI · 3 min ·
[2604.04211] LOCARD: An Agentic Framework for Blockchain Forensics
Machine Learning

[2604.04211] LOCARD: An Agentic Framework for Blockchain Forensics

Abstract page for arXiv paper 2604.04211: LOCARD: An Agentic Framework for Blockchain Forensics

arXiv - AI · 3 min ·
[2604.04172] GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models
Llms

[2604.04172] GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models

Abstract page for arXiv paper 2604.04172: GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models

arXiv - AI · 4 min ·
[2604.04144] Many Preferences, Few Policies: Towards Scalable Language Model Personalization
Llms

[2604.04144] Many Preferences, Few Policies: Towards Scalable Language Model Personalization

Abstract page for arXiv paper 2604.04144: Many Preferences, Few Policies: Towards Scalable Language Model Personalization

arXiv - AI · 3 min ·
[2604.04133] Learning Robust Visual Features in Computed Tomography Enables Efficient Transfer Learning for Clinical Tasks
Llms

[2604.04133] Learning Robust Visual Features in Computed Tomography Enables Efficient Transfer Learning for Clinical Tasks

Abstract page for arXiv paper 2604.04133: Learning Robust Visual Features in Computed Tomography Enables Efficient Transfer Learning for ...

arXiv - AI · 4 min ·
[2604.04089] From Paper to Program: A Multi-Stage LLM-Assisted Workflow for Accelerating Quantum Many-Body Algorithm Development
Llms

[2604.04089] From Paper to Program: A Multi-Stage LLM-Assisted Workflow for Accelerating Quantum Many-Body Algorithm Development

Abstract page for arXiv paper 2604.04089: From Paper to Program: A Multi-Stage LLM-Assisted Workflow for Accelerating Quantum Many-Body A...

arXiv - AI · 3 min ·
[2604.04064] Extracting and Steering Emotion Representations in Small Language Models: A Methodological Comparison
Llms

[2604.04064] Extracting and Steering Emotion Representations in Small Language Models: A Methodological Comparison

Abstract page for arXiv paper 2604.04064: Extracting and Steering Emotion Representations in Small Language Models: A Methodological Comp...

arXiv - AI · 4 min ·
[2604.04078] BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of cardiovascular diseases from cardiac magnetic resonance imaging
Machine Learning

[2604.04078] BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of cardiovascular diseases from cardiac magnetic resonance imaging

Abstract page for arXiv paper 2604.04078: BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of ca...

arXiv - AI · 4 min ·
[2604.04060] CoopGuard: Stateful Cooperative Agents Safeguarding LLMs Against Evolving Multi-Round Attacks
Llms

[2604.04060] CoopGuard: Stateful Cooperative Agents Safeguarding LLMs Against Evolving Multi-Round Attacks

Abstract page for arXiv paper 2604.04060: CoopGuard: Stateful Cooperative Agents Safeguarding LLMs Against Evolving Multi-Round Attacks

arXiv - AI · 3 min ·
[2604.03980] Gram-Anchored Prompt Learning for Vision-Language Models via Second-Order Statistics
Llms

[2604.03980] Gram-Anchored Prompt Learning for Vision-Language Models via Second-Order Statistics

Abstract page for arXiv paper 2604.03980: Gram-Anchored Prompt Learning for Vision-Language Models via Second-Order Statistics

arXiv - AI · 3 min ·
[2604.03968] TraceGuard: Structured Multi-Dimensional Monitoring as a Collusion-Resistant Control Protocol
Machine Learning

[2604.03968] TraceGuard: Structured Multi-Dimensional Monitoring as a Collusion-Resistant Control Protocol

Abstract page for arXiv paper 2604.03968: TraceGuard: Structured Multi-Dimensional Monitoring as a Collusion-Resistant Control Protocol

arXiv - AI · 4 min ·
Previous Page 296 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime