[2603.27314] TokenDance: Token-to-Token Music-to-Dance Generation with

[2603.27314] TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba

arXiv - AI March 31, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.27314: TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba

Computer Science > Artificial Intelligence arXiv:2603.27314 (cs) [Submitted on 28 Mar 2026] Title:TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba Authors:Ziyue Yang, Kaixing Yang, Xulong Tang View a PDF of the paper titled TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba, by Ziyue Yang and 2 other authors View PDF HTML (experimental) Abstract:Music-to-dance generation has broad applications in virtual reality, dance education, and digital character animation. However, the limited coverage of existing 3D dance datasets confines current models to a narrow subset of music styles and choreographic patterns, resulting in poor generalization to real-world music. Consequently, generated dances often become overly simplistic and repetitive, substantially degrading expressiveness and realism. To tackle this problem, we present TokenDance, a two-stage music-to-dance generation framework that explicitly addresses this limitation through dual-modality tokenization and efficient token-level generation. In the first stage, we discretize both dance and music using Finite Scalar Quantization, where dance motions are factorized into upper and lower-body components with kinematic-dynamic constraints, and music is decomposed into semantic and acoustic features with dedicated codebooks to capture choreography-specific structures. In the second stage, we introduce a Local-Global-Local token-to-token generator built on a Bidirectional M...

Originally published on March 31, 2026. Curated by AI News.

Llms

[P] Dante-2B: I'm training a 2.1B bilingual fully open Italian/English LLM from scratch on 2×H200. Phase 1 done — here's what I've built.

The problem If you work with Italian text and local models, you know the pain. Every open-source LLM out there treats Italian as an after...

Reddit - Machine Learning · 1 min · 35 minutes ago

Machine Learning

[R] Architecture Determines Optimization: Deriving Weight Updates from Network Topology (seeking arXiv endorsement - cs.LG)

Abstract: We derive neural network weight updates from first principles without assuming gradient descent or a specific loss function. St...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

[P] ML project (XGBoost + Databricks + MLflow) — how to talk about “production issues” in interviews?

Hey all, I recently built an end-to-end fraud detection project using a large banking dataset: Trained an XGBoost model Used Databricks f...

Reddit - Machine Learning · 1 min · about 4 hours ago

Machine Learning

[D] The memory chip market lost tens of billions over a paper this community would have understood in 10 minutes

TurboQuant was teased recently and tens of billions gone from memory chip market in 48 hours but anyone in this community who read the pa...

Reddit - Machine Learning · 1 min · about 4 hours ago

[2603.27314] TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba

About this article

Related Articles

[P] Dante-2B: I'm training a 2.1B bilingual fully open Italian/English LLM from scratch on 2×H200. Phase 1 done — here's what I've built.

[R] Architecture Determines Optimization: Deriving Weight Updates from Network Topology (seeking arXiv endorsement - cs.LG)

[P] ML project (XGBoost + Databricks + MLflow) — how to talk about “production issues” in interviews?

[D] The memory chip market lost tens of billions over a paper this community would have understood in 10 minutes

No comments

Stay updated with AI News