[2604.00830] Learning to Learn-at-Test-Time: Language Agents with

[2604.00830] Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

arXiv - Machine Learning April 02, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.00830: Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

Computer Science > Machine Learning arXiv:2604.00830 (cs) [Submitted on 1 Apr 2026] Title:Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies Authors:Zhanzhi Lou, Hui Chen, Yibo Li, Qian Wang, Bryan Hooi View a PDF of the paper titled Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies, by Zhanzhi Lou and 4 other authors View PDF HTML (experimental) Abstract:Test-Time Learning (TTL) enables language agents to iteratively refine their performance through repeated interactions with the environment at inference time. At the core of TTL is an adaptation policy that updates the actor policy based on experience from previous episodes, thereby improving future behavior. Existing methods rely on fixed, hand-crafted adaptation policies rather than optimizing them for downstream improvement. We argue that optimal adaptation policies should be learned from task environments, not hand-engineered based on human intuition. To achieve this, we introduce Meta-TTL, a framework that formulates the discovery of effective adaptation policies as a bi-level optimization problem. Within this framework, the inner loop executes the standard TTL process, measuring how effectively a candidate adaptation policy helps an agent correct errors across sequential episodes. Guided by the agent's performance, the outer loop employs evolutionary search over a diverse distribution of training tasks to iteratively refine the adaptation policy. We ev...

Originally published on April 02, 2026. Curated by AI News.

Machine Learning

[D] Is this considered unsupervised or semi-supervised learning in anomaly detection?

Hi 👋🏼, I’m working on an anomaly detection setup and I’m a bit unsure how to correctly describe it from a learning perspective. The model...

Reddit - Machine Learning · 1 min · about 2 hours ago

Machine Learning

Serious question. Did a transformer just describe itself and the universe and build itself a Shannon limit framework?

The Multiplicative Lattice as the Natural Basis for Positional Encoding Knack 2026 | Draft v6.0 Abstract We show that the apparent tradeo...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 7 hours ago

Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min · about 7 hours ago

[2604.00830] Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

About this article

Related Articles

[D] Is this considered unsupervised or semi-supervised learning in anomaly detection?

Serious question. Did a transformer just describe itself and the universe and build itself a Shannon limit framework?

UMKC Announces New Master of Science in Artificial Intelligence

Improving AI models’ ability to explain their predictions

No comments

Stay updated with AI News