[2604.06014] Gated-SwinRMT: Unifying Swin Windowed Attention with

[2604.06014] Gated-SwinRMT: Unifying Swin Windowed Attention with Retentive Manhattan Decay via Input-Dependent Gating

arXiv - Machine Learning April 08, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.06014: Gated-SwinRMT: Unifying Swin Windowed Attention with Retentive Manhattan Decay via Input-Dependent Gating

Computer Science > Machine Learning arXiv:2604.06014 (cs) [Submitted on 7 Apr 2026] Title:Gated-SwinRMT: Unifying Swin Windowed Attention with Retentive Manhattan Decay via Input-Dependent Gating Authors:Dipan Maity, Suman Mondal, Arindam Roy View a PDF of the paper titled Gated-SwinRMT: Unifying Swin Windowed Attention with Retentive Manhattan Decay via Input-Dependent Gating, by Dipan Maity and 2 other authors View PDF HTML (experimental) Abstract:We introduce Gated-SwinRMT, a family of hybrid vision transformers that combine the shifted-window attention of the Swin Transformer with the Manhattan-distance spatial decay of Retentive Networks (RMT), augmented by input-dependent gating. Self-attention is decomposed into consecutive width-wise and height-wise retention passes within each shifted window, where per-head exponential decay masks provide a two-dimensional locality prior without learned positional biases. Two variants are proposed. \textbf{Gated-SwinRMT-SWAT} substitutes softmax with sigmoid activation, implements balanced ALiBi slopes with multiplicative post-activation spatial decay, and gates the value projection via SwiGLU; the Normalized output implicitly suppresses uninformative attention scores. \textbf{Gated-SwinRMT-Retention} retains softmax-normalized retention with an additive log-space decay bias and incorporates an explicit G1 sigmoid gate -- projected from the block input and applied after local context enhancement (LCE) but prior to the output proje...

Originally published on April 08, 2026. Curated by AI News.

Machine Learning

Microsoft wants lawyers to trust its new AI agent in Word documents | The Verge

Microsoft’s Legal Agent comes from the work of former Robin AI engineers.

The Verge - AI · 3 min · 28 minutes ago

Machine Learning

Newbie AI question

TBH I don't know if our current "AI" models are capable of thinking. There is a massive pattern i'm noticing when using AI and have been ...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 3 hours ago

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 3 hours ago

[2604.06014] Gated-SwinRMT: Unifying Swin Windowed Attention with Retentive Manhattan Decay via Input-Dependent Gating

About this article

Related Articles

Microsoft wants lawyers to trust its new AI agent in Word documents | The Verge

Newbie AI question

UMKC Announces New Master of Science in Artificial Intelligence

Accelerating science with AI and simulations

No comments

Stay updated with AI News