Started a video series on building an orchestration layer for LLM post-training [P]
Hi everyone! Context, motivation, a lot of yapping, feel free to skip to TL;DR. A while back I posted here asking [D] What framework do y...
GPT, Claude, Gemini, and other LLMs
Hi everyone! Context, motivation, a lot of yapping, feel free to skip to TL;DR. A while back I posted here asking [D] What framework do y...
OpenAI announced on Thursday something that power users have been asking for: a $100/month plan. Previously, subscriptions jumped from $2...
Dubbed Claude Mythos, the software is part of the Claude AI family, an artificial intelligence model that can act like a chatbot and AI a...
Abstract page for arXiv paper 2511.22935: EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model
Abstract page for arXiv paper 2412.13091: LMUnit: Fine-grained Evaluation with Natural Language Unit Tests
Abstract page for arXiv paper 2510.15982: AMiD: Knowledge Distillation for LLMs with $α$-mixture Assistant Distribution
Abstract page for arXiv paper 2406.06512: Merlin: A Computed Tomography Vision-Language Foundation Model and Dataset
Abstract page for arXiv paper 2405.15374: Leveraging Large Language Models for Semantic Query Processing in a Scholarly Knowledge Graph
Abstract page for arXiv paper 2509.23405: Planner Aware Path Learning in Diffusion Language Models Training
Abstract page for arXiv paper 2509.22263: Erase or Hide? Suppressing Spurious Unlearning Neurons for Robust Unlearning
Abstract page for arXiv paper 2509.21465: Talking Trees: Reasoning-Assisted Induction of Decision Trees for Tabular Data
Abstract page for arXiv paper 2509.17874: Deep Hierarchical Learning with Nested Subspace Networks for Large Language Models
Abstract page for arXiv paper 2602.09937: Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?
Abstract page for arXiv paper 2506.15963: On the Limits of Sparse Autoencoders: A Theoretical Framework and Reweighted Remedy
Abstract page for arXiv paper 2601.16529: SycoEval-EM: Sycophancy Evaluation of Large Language Models in Simulated Clinical Encounters fo...
Abstract page for arXiv paper 2601.15160: Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning
Abstract page for arXiv paper 2511.22235: Training High-Level Schedulers with Execution-Feedback Reinforcement Learning for Long-Horizon ...
Abstract page for arXiv paper 2511.21471: SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition
Abstract page for arXiv paper 2511.05854: Can a Small Model Learn to Look Before It Leaps? Dynamic Learning and Proactive Correction for ...
Abstract page for arXiv paper 2510.26905: Cognition Envelopes for Bounded Decision Making in Autonomous UAS Operations
Abstract page for arXiv paper 2505.20065: SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety
Abstract page for arXiv paper 2510.09782: The Geometry of Reasoning: Flowing Logics in Representation Space
Abstract page for arXiv paper 2510.07972: SHE: Stepwise Hybrid Examination Reinforcement Learning Framework for E-commerce Search Relevance
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime