[2509.21739] Noise-to-Notes: Diffusion-based Generation and Refinement for Automatic Drum Transcription

[2509.21739] Noise-to-Notes: Diffusion-based Generation and Refinement for Automatic Drum Transcription

arXiv - Machine Learning 3 min read

About this article

Abstract page for arXiv paper 2509.21739: Noise-to-Notes: Diffusion-based Generation and Refinement for Automatic Drum Transcription

Computer Science > Sound arXiv:2509.21739 (cs) [Submitted on 26 Sep 2025 (v1), last revised 5 Mar 2026 (this version, v2)] Title:Noise-to-Notes: Diffusion-based Generation and Refinement for Automatic Drum Transcription Authors:Michael Yeung, Keisuke Toyama, Toya Teramoto, Shusuke Takahashi, Tamaki Kojima View a PDF of the paper titled Noise-to-Notes: Diffusion-based Generation and Refinement for Automatic Drum Transcription, by Michael Yeung and 4 other authors View PDF HTML (experimental) Abstract:Automatic drum transcription (ADT) is traditionally formulated as a discriminative task to predict drum events from audio spectrograms. In this work, we redefine ADT as a conditional generative task and introduce Noise-to-Notes (N2N), a framework leveraging diffusion modeling to transform audio-conditioned Gaussian noise into drum events with associated velocities. This generative diffusion approach offers distinct advantages, including a flexible speed-accuracy trade-off and strong inpainting capabilities. However, the generation of binary onset and continuous velocity values presents a challenge for diffusion models, and to overcome this, we introduce an Annealed Pseudo-Huber loss to facilitate effective joint optimization. Finally, to augment low-level spectrogram features, we propose incorporating features extracted from music foundation models (MFMs), which capture high-level semantic information and enhance robustness to out-of-domain drum audio. Experimental results demo...

Originally published on March 06, 2026. Curated by AI News.

Related Articles

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Llms

ChatGPT Critiques My Approach to AI

I uploaded VulcanAMI into ChatGPT and had it to a deep analysis. I then asked one simple question: What would be the result of wider adop...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

I have created a biologically based AI model

I've spent the last year building NIMCP — a biologically-inspired artificial brain in C that trains six different neural network types si...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[D] Thinking about augmentation as invariance assumptions

Data augmentation is still used much more heuristically than it should be. A training pipeline can easily turn into a stack of intuition,...

Reddit - Machine Learning · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime