[2508.13773] PENGUIN: Enhancing Transformer with Periodic-Nested Group

[2508.13773] PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting

arXiv - Machine Learning March 31, 2026 3 min read

About this article

Abstract page for arXiv paper 2508.13773: PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting

Computer Science > Machine Learning arXiv:2508.13773 (cs) [Submitted on 19 Aug 2025 (v1), last revised 29 Mar 2026 (this version, v3)] Title:PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting Authors:Tian Sun, Yuqi Chen, Weiwei Sun View a PDF of the paper titled PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting, by Tian Sun and 2 other authors View PDF HTML (experimental) Abstract:Despite advances in the Transformer architecture, their effectiveness for long-term time series forecasting (LTSF) remains controversial. In this paper, we investigate the potential of integrating explicit periodicity modeling into the self-attention mechanism to enhance the performance of Transformer-based architectures for LTSF. Specifically, we propose PENGUIN, a simple yet effective periodic-nested group attention mechanism. Our approach introduces a periodic-aware relative attention bias to directly capture periodic structures and a grouped multi-query attention mechanism to handle multiple coexisting periodicities (e.g., daily and weekly cycles) within time series data. Extensive experiments across diverse benchmarks demonstrate that PENGUIN consistently outperforms both MLP-based and Transformer-based models. Code is available at this https URL. Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI) Cite as: arXiv:2508.13773 [cs.LG] (or arXiv:2508.13773v3 [cs.LG] fo...

Originally published on March 31, 2026. Curated by AI News.

Machine Learning

Is it actually possible to build a model-agnostic persistent text layer that keeps AI behavior stable?

Is it actually possible to define a persistent, model-agnostic text-based layer (loaded with the model each time) that keeps an AI system...

Reddit - Artificial Intelligence · 1 min · 19 minutes ago

Machine Learning

Are gamers being used as free labeling labor? The rise of "Simulators" that look like AI training grounds [D]

Hey everyone, I’m an AI news curator and editor currently working on a piece about a weird trend I’ve been spotting: technical simulators...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

Coherence Without Convergence: A New Protocol for Multi-Agent AI

Opening For the past year, most progress in multi-agent AI has followed a familiar pattern: Add more agents. Add more coordination. Watch...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Machine Learning

Week 6 AIPass update - answering the top questions from last post (file conflicts, remote models, scale)

Followup to last post with answers to the top questions from the comments. Appreciate everyone who jumped in. The most common one by a mi...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

[2508.13773] PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting

About this article

Related Articles

Is it actually possible to build a model-agnostic persistent text layer that keeps AI behavior stable?

Are gamers being used as free labeling labor? The rise of "Simulators" that look like AI training grounds [D]

Coherence Without Convergence: A New Protocol for Multi-Agent AI

Week 6 AIPass update - answering the top questions from last post (file conflicts, remote models, scale)

No comments

Stay updated with AI News