[2603.00576] Efficient Long-Sequence Diffusion Modeling for Symbolic

[2603.00576] Efficient Long-Sequence Diffusion Modeling for Symbolic Music Generation

arXiv - AI March 03, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.00576: Efficient Long-Sequence Diffusion Modeling for Symbolic Music Generation

Computer Science > Sound arXiv:2603.00576 (cs) [Submitted on 28 Feb 2026] Title:Efficient Long-Sequence Diffusion Modeling for Symbolic Music Generation Authors:Jinhan Xu, Xing Tang, Houpeng Yang, Haoran Zhang, Shenghua Yuan, Jiatao Chen, Tianming Xi, Jing Wang, Jiaojiao Yu, Guangli Xiang View a PDF of the paper titled Efficient Long-Sequence Diffusion Modeling for Symbolic Music Generation, by Jinhan Xu and 9 other authors View PDF HTML (experimental) Abstract:Symbolic music generation is a challenging task in multimedia generation, involving long sequences with hierarchical temporal structures, long-range dependencies, and fine-grained local details. Though recent diffusion-based models produce high quality generations, they tend to suffer from high training and inference costs with long symbolic sequences due to iterative denoising and sequence-length-related costs. To deal with such problem, we put forth a diffusing strategy named SMDIM to combine efficient global structure construction and light local refinement. SMDIM uses structured state space models to capture long range musical context at near linear cost, and selectively refines local musical details via a hybrid refinement scheme. Experiments performed on a wide range of symbolic music datasets which encompass various Western classical music, popular music and traditional folk music show that the SMDIM model outperforms the other state-of-the-art approaches on both the generation quality and the computational e...

Originally published on March 03, 2026. Curated by AI News.

Machine Learning

Hub Group Using AI, Machine Learning for Real-Time Visibility of Shipments

AI Events · 4 min · about 2 hours ago

Llms

Von Hammerstein’s Ghost: What a Prussian General’s Officer Typology Can Teach Us About AI Misalignment

Greetings all - I've posted mostly in r/claudecode and r/aigamedev a couple of times previously. Working with CC for personal projects re...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

World models will be the next big thing, bye-bye LLMs

Was at Nvidia's GTC conference recently and honestly, it was one of the most eye-opening events I've attended in a while. There was a lot...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

Machine Learning

[D] Got my first offer after months of searching — below posted range, contract-to-hire, and worried it may pause my search. Do I take it?

I could really use some outside perspective. I’m a senior ML/CV engineer in Canada with about 5–6 years across research and industry. Mas...

Reddit - Machine Learning · 1 min · about 6 hours ago

[2603.00576] Efficient Long-Sequence Diffusion Modeling for Symbolic Music Generation

About this article

Related Articles

Hub Group Using AI, Machine Learning for Real-Time Visibility of Shipments

Von Hammerstein’s Ghost: What a Prussian General’s Officer Typology Can Teach Us About AI Misalignment

World models will be the next big thing, bye-bye LLMs

[D] Got my first offer after months of searching — below posted range, contract-to-hire, and worried it may pause my search. Do I take it?

No comments

Stay updated with AI News