Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

TRACER: Learn-to-Defer for LLM Classification with Formal Teacher-Agreement Guarantees

I'm releasing TRACER (Trace-Based Adaptive Cost-Efficient Routing), a library for learning cost-efficient routing policies from LLM trace...

Reddit - Machine Learning · 1 min ·
Mistral AI raises $830M in debt to set up a data center near Paris | TechCrunch
Llms

Mistral AI raises $830M in debt to set up a data center near Paris | TechCrunch

Mistral aims to start operating the data center by the second quarter of 2026.

TechCrunch - AI · 4 min ·
Llms

The Rationing: AI companies are using the "subsidize, addict, extract" playbook — and developers are the product

Anthropic just ran the classic platform playbook on developers: offer generous limits to build dependency, then tighten the screws once t...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.23461] End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions
Llms

[2603.23461] End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions

Abstract page for arXiv paper 2603.23461: End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions

arXiv - Machine Learning · 3 min ·
[2603.23414] SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling
Llms

[2603.23414] SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

Abstract page for arXiv paper 2603.23414: SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

arXiv - AI · 4 min ·
[2603.22368] When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualizations
Llms

[2603.22368] When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualizations

Abstract page for arXiv paper 2603.22368: When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualiza...

arXiv - AI · 4 min ·
[2603.23355] Off-Policy Value-Based Reinforcement Learning for Large Language Models
Llms

[2603.23355] Off-Policy Value-Based Reinforcement Learning for Large Language Models

Abstract page for arXiv paper 2603.23355: Off-Policy Value-Based Reinforcement Learning for Large Language Models

arXiv - Machine Learning · 3 min ·
[2603.22367] Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window
Llms

[2603.22367] Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window

Abstract page for arXiv paper 2603.22367: Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window

arXiv - AI · 3 min ·
[2603.23268] SafeSeek: Universal Attribution of Safety Circuits in Language Models
Llms

[2603.23268] SafeSeek: Universal Attribution of Safety Circuits in Language Models

Abstract page for arXiv paper 2603.23268: SafeSeek: Universal Attribution of Safety Circuits in Language Models

arXiv - AI · 4 min ·
[2603.22363] Early Discoveries of Algorithmist I: Promise of Provable Algorithm Synthesis at Scale
Llms

[2603.22363] Early Discoveries of Algorithmist I: Promise of Provable Algorithm Synthesis at Scale

Abstract page for arXiv paper 2603.22363: Early Discoveries of Algorithmist I: Promise of Provable Algorithm Synthesis at Scale

arXiv - AI · 4 min ·
[2603.22341] T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search
Llms

[2603.22341] T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

Abstract page for arXiv paper 2603.22341: T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

arXiv - AI · 3 min ·
[2603.23198] Sparser, Faster, Lighter Transformer Language Models
Llms

[2603.23198] Sparser, Faster, Lighter Transformer Language Models

Abstract page for arXiv paper 2603.23198: Sparser, Faster, Lighter Transformer Language Models

arXiv - Machine Learning · 3 min ·
[2603.22335] Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation
Llms

[2603.22335] Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation

Abstract page for arXiv paper 2603.22335: Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation

arXiv - AI · 3 min ·
[2603.23173] A Schrödinger Eigenfunction Method for Long-Horizon Stochastic Optimal Control
Llms

[2603.23173] A Schrödinger Eigenfunction Method for Long-Horizon Stochastic Optimal Control

Abstract page for arXiv paper 2603.23173: A Schrödinger Eigenfunction Method for Long-Horizon Stochastic Optimal Control

arXiv - Machine Learning · 4 min ·
[2603.23140] DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models
Llms

[2603.23140] DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models

Abstract page for arXiv paper 2603.23140: DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models

arXiv - Machine Learning · 4 min ·
[2603.23129] Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair
Llms

[2603.23129] Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair

Abstract page for arXiv paper 2603.23129: Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy...

arXiv - Machine Learning · 4 min ·
[2603.22327] AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI
Llms

[2603.22327] AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI

Abstract page for arXiv paper 2603.22327: AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI

arXiv - AI · 3 min ·
[2603.23043] Assessing the Robustness of Climate Foundation Models under No-Analog Distribution Shifts
Llms

[2603.23043] Assessing the Robustness of Climate Foundation Models under No-Analog Distribution Shifts

Abstract page for arXiv paper 2603.23043: Assessing the Robustness of Climate Foundation Models under No-Analog Distribution Shifts

arXiv - AI · 4 min ·
[2603.22984] Can Graph Foundation Models Generalize Over Architecture?
Llms

[2603.22984] Can Graph Foundation Models Generalize Over Architecture?

Abstract page for arXiv paper 2603.22984: Can Graph Foundation Models Generalize Over Architecture?

arXiv - AI · 4 min ·
[2603.22321] From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos for Evaluating Multimodal LLMs
Llms

[2603.22321] From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos for Evaluating Multimodal LLMs

Abstract page for arXiv paper 2603.22321: From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos fo...

arXiv - AI · 4 min ·
[2603.22892] VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents
Llms

[2603.22892] VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents

Abstract page for arXiv paper 2603.22892: VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents

arXiv - Machine Learning · 4 min ·
[2603.22882] TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration
Llms

[2603.22882] TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration

Abstract page for arXiv paper 2603.22882: TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Explora...

arXiv - Machine Learning · 4 min ·
[2603.22784] Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models
Llms

[2603.22784] Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

Abstract page for arXiv paper 2603.22784: Caterpillar of Thoughts: The Optimal Test-Time Algorithm for Large Language Models

arXiv - Machine Learning · 4 min ·
Previous Page 24 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime