[2603.04122] FastWave: Optimized Diffusion Model for Audio Super-Resolution
About this article
Abstract page for arXiv paper 2603.04122: FastWave: Optimized Diffusion Model for Audio Super-Resolution
Computer Science > Sound arXiv:2603.04122 (cs) [Submitted on 4 Mar 2026] Title:FastWave: Optimized Diffusion Model for Audio Super-Resolution Authors:Nikita Kuznetsov, Maksim Kaledin View a PDF of the paper titled FastWave: Optimized Diffusion Model for Audio Super-Resolution, by Nikita Kuznetsov and 1 other authors View PDF HTML (experimental) Abstract:Audio Super-Resolution is a set of techniques aimed at high-quality estimation of the given signal as if it would be sampled with higher sample rate. Among suggested methods there are diffusion and flow models (which are considered slower), generative adversarial networks (which are considered faster), however both approaches are currently presented by high-parametric networks, requiring high computational costs both for training and inference. We propose a solution to both these problems by re-considering the recent advances in the training of diffusion models and applying them to super-resolution from any to 48 kHz sample rate. Our approach shows better results than NU-Wave 2 and is comparable to state-of-the-art models. Our model called FastWave has around 50 GFLOPs of computational complexity and 1.3 M parameters and can be trained with less resources and significantly faster than the majority of recently proposed diffusion- and flow-based solutions. The code has been made publicly available. Subjects: Sound (cs.SD); Machine Learning (cs.LG) Cite as: arXiv:2603.04122 [cs.SD] (or arXiv:2603.04122v1 [cs.SD] for this ver...