[2604.00830] Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

[2604.00830] Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

arXiv - Machine Learning 4 min read

About this article

Abstract page for arXiv paper 2604.00830: Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

Computer Science > Machine Learning arXiv:2604.00830 (cs) [Submitted on 1 Apr 2026] Title:Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies Authors:Zhanzhi Lou, Hui Chen, Yibo Li, Qian Wang, Bryan Hooi View a PDF of the paper titled Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies, by Zhanzhi Lou and 4 other authors View PDF HTML (experimental) Abstract:Test-Time Learning (TTL) enables language agents to iteratively refine their performance through repeated interactions with the environment at inference time. At the core of TTL is an adaptation policy that updates the actor policy based on experience from previous episodes, thereby improving future behavior. Existing methods rely on fixed, hand-crafted adaptation policies rather than optimizing them for downstream improvement. We argue that optimal adaptation policies should be learned from task environments, not hand-engineered based on human intuition. To achieve this, we introduce Meta-TTL, a framework that formulates the discovery of effective adaptation policies as a bi-level optimization problem. Within this framework, the inner loop executes the standard TTL process, measuring how effectively a candidate adaptation policy helps an agent correct errors across sequential episodes. Guided by the agent's performance, the outer loop employs evolutionary search over a diverse distribution of training tasks to iteratively refine the adaptation policy. We ev...

Originally published on April 02, 2026. Curated by AI News.

Related Articles

Machine Learning

[D] Is this considered unsupervised or semi-supervised learning in anomaly detection?

Hi 👋🏼, I’m working on an anomaly detection setup and I’m a bit unsure how to correctly describe it from a learning perspective. The model...

Reddit - Machine Learning · 1 min ·
Machine Learning

Serious question. Did a transformer just describe itself and the universe and build itself a Shannon limit framework?

The Multiplicative Lattice as the Natural Basis for Positional Encoding Knack 2026 | Draft v6.0 Abstract We show that the apparent tradeo...

Reddit - Artificial Intelligence · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Improving AI models’ ability to explain their predictions
Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime