[2604.02051] Ouroboros: Dynamic Weight Generation for Recursive

[2604.02051] Ouroboros: Dynamic Weight Generation for Recursive Transformers via Input-Conditioned LoRA Modulation

arXiv - Machine Learning April 03, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.02051: Ouroboros: Dynamic Weight Generation for Recursive Transformers via Input-Conditioned LoRA Modulation

Computer Science > Machine Learning arXiv:2604.02051 (cs) [Submitted on 2 Apr 2026] Title:Ouroboros: Dynamic Weight Generation for Recursive Transformers via Input-Conditioned LoRA Modulation Authors:Jaber Jaber, Osama Jaber View a PDF of the paper titled Ouroboros: Dynamic Weight Generation for Recursive Transformers via Input-Conditioned LoRA Modulation, by Jaber Jaber and 1 other authors View PDF HTML (experimental) Abstract:Recursive transformers reuse a shared weight block across multiple depth steps, trading parameters for compute. A core limitation: every step applies the same transformation, preventing the model from composing distinct operations across depth. We present Ouroboros, a system that attaches a compact Controller hypernetwork to a recursive transformer block. The Controller observes the current hidden state, produces a per-step diagonal modulation vector, and applies it to frozen SVD-initialized LoRA bases, making each recurrence step input-dependent. We combine this with gated recurrence (bias-initialized to 88% retention) and per-step LayerNorm for stable deep iteration. On Qwen2.5-3B split into a Prelude/Recurrent/Coda architecture (17 of 36 layers retained), Ouroboros reduces training loss by 43.4% over the unmodified 17-layer baseline, recovering 51.3% of the performance gap caused by layer removal. The full system adds only 9.2M trainable parameters (Controller, gate, and per-step norms) yet outperforms equivalently-sized static per-step LoRA by 1...

Originally published on April 03, 2026. Curated by AI News.

Machine Learning

Machine learning analysis of CT scans

An AI-powered tool can interpret 3D images from CT scans and diagnose certain disorders.

AI News - General · 5 min · about 1 hour ago

Machine Learning

Teaching AI models to say “I’m not sure”

MIT CSAIL's “Reinforcement Learning with Calibration Rewards” technique improves AI confidence estimates without sacrificing perform...

AI News - General · 7 min · about 1 hour ago

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 1 hour ago

Machine Learning

A Machine Learning Engineer Thought He Was Safe From AI Layoffs. Then He Got Some Depressing News

AI News - General · 4 min · about 1 hour ago

[2604.02051] Ouroboros: Dynamic Weight Generation for Recursive Transformers via Input-Conditioned LoRA Modulation

About this article

Related Articles

Machine learning analysis of CT scans

Teaching AI models to say “I’m not sure”

Accelerating science with AI and simulations

A Machine Learning Engineer Thought He Was Safe From AI Layoffs. Then He Got Some Depressing News

No comments

Stay updated with AI News