A Hackable ML Compiler Stack in 5,000 Lines of Python [P]
Hey r/MachineLearning, The modern ML (LLM) compiler stack is brutal. TVM is 500K+ lines of C++. PyTorch piles Dynamo, Inductor, and Trito...
ML algorithms, training, and inference
Hey r/MachineLearning, The modern ML (LLM) compiler stack is brutal. TVM is 500K+ lines of C++. PyTorch piles Dynamo, Inductor, and Trito...
During the Musk v. Altman trial, while on the stand, Elon Musk said it was “partly” true that xAI had used model distillation of OpenAI’s...
Hello r/MachineLearning! I work in the US transit industry and I went all-in on learning AI & ML a few months ago. When I heard about...
Abstract page for arXiv paper 2604.04440: Training Transformers in Cosine Coefficient Space
Abstract page for arXiv paper 2604.04418: Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality
Abstract page for arXiv paper 2604.04411: Responses Fall Short of Understanding: Revealing the Gap between Internal Representations and R...
Abstract page for arXiv paper 2604.04384: Compressible Softmax-Attended Language under Incompressible Attention
Abstract page for arXiv paper 2604.04359: GROUNDEDKG-RAG: Grounded Knowledge Graph Index for Long-document Question Answering
Abstract page for arXiv paper 2604.04353: ReFinE: Streamlining UI Mockup Iteration with Research Findings
Abstract page for arXiv paper 2604.04306: HighFM: Towards a Foundation Model for Learning Representations from High-Frequency Earth Obser...
Abstract page for arXiv paper 2604.04299: A Persistent Homology Design Space for 3D Point Cloud Deep Learning
Abstract page for arXiv paper 2604.04289: Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6
Abstract page for arXiv paper 2604.04263: Commercial Persuasion in AI-Mediated Conversations
Abstract page for arXiv paper 2604.04211: LOCARD: An Agentic Framework for Blockchain Forensics
Abstract page for arXiv paper 2604.04172: GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models
Abstract page for arXiv paper 2604.04144: Many Preferences, Few Policies: Towards Scalable Language Model Personalization
Abstract page for arXiv paper 2604.04133: Learning Robust Visual Features in Computed Tomography Enables Efficient Transfer Learning for ...
Abstract page for arXiv paper 2604.04089: From Paper to Program: A Multi-Stage LLM-Assisted Workflow for Accelerating Quantum Many-Body A...
Abstract page for arXiv paper 2604.04064: Extracting and Steering Emotion Representations in Small Language Models: A Methodological Comp...
Abstract page for arXiv paper 2604.04078: BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of ca...
Abstract page for arXiv paper 2604.04060: CoopGuard: Stateful Cooperative Agents Safeguarding LLMs Against Evolving Multi-Round Attacks
Abstract page for arXiv paper 2604.03980: Gram-Anchored Prompt Learning for Vision-Language Models via Second-Order Statistics
Abstract page for arXiv paper 2604.03968: TraceGuard: Structured Multi-Dimensional Monitoring as a Collusion-Resistant Control Protocol
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime