[2506.18703] Context Biasing for Pronunciation-Orthography Mismatch in

[2506.18703] Context Biasing for Pronunciation-Orthography Mismatch in Automatic Speech Recognition

arXiv - Machine Learning March 05, 2026 3 min read

About this article

Abstract page for arXiv paper 2506.18703: Context Biasing for Pronunciation-Orthography Mismatch in Automatic Speech Recognition

Computer Science > Computation and Language arXiv:2506.18703 (cs) [Submitted on 23 Jun 2025 (v1), last revised 4 Mar 2026 (this version, v3)] Title:Context Biasing for Pronunciation-Orthography Mismatch in Automatic Speech Recognition Authors:Christian Huber, Alexander Waibel View a PDF of the paper titled Context Biasing for Pronunciation-Orthography Mismatch in Automatic Speech Recognition, by Christian Huber and Alexander Waibel View PDF HTML (experimental) Abstract:Neural sequence-to-sequence systems deliver state-of-the-art performance for automatic speech recognition. When using appropriate modeling units, e.g., byte-pair encoding, these systems are in principle open vocabulary systems. In practice, however, they often fail to recognize words not seen during training, e.g., named entities, acronyms, or domain-specific special words. To address this problem, many context biasing methods have been proposed; however, these methods may still struggle when they are unable to relate audio and corresponding text, e.g., in case of a pronunciation-orthography mismatch. We propose a method where corrections of substitution errors can be used to improve the recognition accuracy of such challenging words. Users can add corrections on the fly during inference. We show that with this method we get a relative improvement in biased word error rate between 22% and 34% compared to a text-based replacement method, while maintaining the overall performance. Subjects: Computation and Lan...

Originally published on March 05, 2026. Curated by AI News.

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 2 hours ago

Machine Learning

Using machine learning to identify individuals at risk for intimate partner violence

Researchers at Mass General Brigham have developed a series of artificial intelligence (AI) tools that uses machine learning to identify ...

AI News - General · 7 min · about 3 hours ago

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 3 hours ago

Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min · about 3 hours ago

[2506.18703] Context Biasing for Pronunciation-Orthography Mismatch in Automatic Speech Recognition

About this article

Related Articles

UMKC Announces New Master of Science in Artificial Intelligence

Using machine learning to identify individuals at risk for intimate partner violence

Accelerating science with AI and simulations

Improving AI models’ ability to explain their predictions

No comments

Stay updated with AI News