TRACER: Learn-to-Defer for LLM Classification with Formal Teacher-Agreement Guarantees
I'm releasing TRACER (Trace-Based Adaptive Cost-Efficient Routing), a library for learning cost-efficient routing policies from LLM trace...
GPT, Claude, Gemini, and other LLMs
I'm releasing TRACER (Trace-Based Adaptive Cost-Efficient Routing), a library for learning cost-efficient routing policies from LLM trace...
Mistral aims to start operating the data center by the second quarter of 2026.
Anthropic just ran the classic platform playbook on developers: offer generous limits to build dependency, then tighten the screws once t...
Abstract page for arXiv paper 2603.23461: End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions
Abstract page for arXiv paper 2603.23414: SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling
Abstract page for arXiv paper 2603.22368: When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualiza...
Abstract page for arXiv paper 2603.23355: Off-Policy Value-Based Reinforcement Learning for Large Language Models
Abstract page for arXiv paper 2603.22367: Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window
Abstract page for arXiv paper 2603.23268: SafeSeek: Universal Attribution of Safety Circuits in Language Models
Abstract page for arXiv paper 2603.22363: Early Discoveries of Algorithmist I: Promise of Provable Algorithm Synthesis at Scale
Abstract page for arXiv paper 2603.22341: T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search
Abstract page for arXiv paper 2603.23198: Sparser, Faster, Lighter Transformer Language Models
Abstract page for arXiv paper 2603.22335: Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation
Abstract page for arXiv paper 2603.23173: A Schrödinger Eigenfunction Method for Long-Horizon Stochastic Optimal Control
Abstract page for arXiv paper 2603.23140: DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models
Abstract page for arXiv paper 2603.23129: Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy...
Abstract page for arXiv paper 2603.22327: AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI
Abstract page for arXiv paper 2603.23043: Assessing the Robustness of Climate Foundation Models under No-Analog Distribution Shifts
Abstract page for arXiv paper 2603.22984: Can Graph Foundation Models Generalize Over Architecture?
Abstract page for arXiv paper 2603.22321: From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos fo...
Abstract page for arXiv paper 2603.22892: VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents
Abstract page for arXiv paper 2603.22882: TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Explora...
Abstract page for arXiv paper 2603.22784: Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime