[2505.16308] Beyond All-to-All: Causal-Aligned Transformer with Dynamic Structure Learning for Multivariate Time Series Forecasting

[2505.16308] Beyond All-to-All: Causal-Aligned Transformer with Dynamic Structure Learning for Multivariate Time Series Forecasting

arXiv - Machine Learning 4 min read Article

Summary

This article presents a novel approach to multivariate time series forecasting using a Causal Decomposition Transformer (CDT) that learns dynamic causal structures, improving prediction accuracy by addressing limitations of traditional all-to-all models.

Why It Matters

As multivariate time series data becomes increasingly prevalent in various fields, understanding causal relationships is essential for accurate forecasting. This research introduces a method that enhances predictive performance by distinguishing between different types of causal influences, which can lead to better decision-making in fields like finance, healthcare, and climate science.

Key Takeaways

  • The proposed all-to-one forecasting paradigm improves accuracy by predicting each target variable separately.
  • The Causal Decomposition Transformer (CDT) incorporates dynamic causal adapters to refine causal structure during training.
  • The research addresses issues of spurious correlations and collider bias, enhancing model robustness.
  • Extensive experiments validate the CDT's effectiveness across multiple benchmark datasets.
  • Understanding causal influences can significantly improve forecasting in various applications.

Computer Science > Machine Learning arXiv:2505.16308 (cs) [Submitted on 22 May 2025 (v1), last revised 13 Feb 2026 (this version, v2)] Title:Beyond All-to-All: Causal-Aligned Transformer with Dynamic Structure Learning for Multivariate Time Series Forecasting Authors:Xingyu Zhang, Hanyun Du, Zeen Song, Siyu Zhao, Changwen Zheng, Wenwen Qiang View a PDF of the paper titled Beyond All-to-All: Causal-Aligned Transformer with Dynamic Structure Learning for Multivariate Time Series Forecasting, by Xingyu Zhang and 5 other authors View PDF HTML (experimental) Abstract:Most existing multivariate time series forecasting methods adopt an all-to-all paradigm that feeds all variable histories into a unified model to predict their future values without distinguishing their individual roles. However, this undifferentiated paradigm makes it difficult to identify variable-specific causal influences and often entangles causally relevant information with spurious correlations. To address this limitation, we propose an all-to-one forecasting paradigm that predicts each target variable separately. Specifically, we first construct a Structural Causal Model from observational data and then, for each target variable, we partition the historical sequence into four subsegments according to the inferred causal structure: endogenous, direct causal, collider causal, and spurious correlation. Furthermore, we propose the Causal Decomposition Transformer (CDT), which integrates a dynamic causal adapter...

Related Articles

Machine Learning

[D] ICML Rebuttal Question

I am currently working on my response on the rebuttal acknowledgments for ICML and I doubting how to handle the strawman argument of that...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] ML researcher looking to switch to a product company.

Hey, I am an AI researcher currently working in a deep tech company as a data scientist. Prior to this, I was doing my PhD. My current ro...

Reddit - Machine Learning · 1 min ·
Machine Learning

Building behavioural response models of public figures using Brain scan data (Predict their next move using psychological modelling) [P]

Hey guys, I’m the same creator of Netryx V2, the geolocation tool. I’ve been working on something new called COGNEX. It learns how a pers...

Reddit - Machine Learning · 1 min ·
Machine Learning

[P] bitnet-edge: Ternary-weight CNNs ({-1,0,+1}) on MNIST and CIFAR-10, deployed to ESP32-S3 with zero multiplications

I built a pipeline that takes ternary-quantized CNNs from PyTorch training all the way to bare-metal inference on an ESP32-S3 microcontro...

Reddit - Machine Learning · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime