[2603.28086] MOSS-VoiceGenerator: Create Realistic Voices with Natural

[2603.28086] MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions

arXiv - AI March 31, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.28086: MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions

Computer Science > Sound arXiv:2603.28086 (cs) [Submitted on 30 Mar 2026] Title:MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions Authors:Kexin Huang, Liwei Fan, Botian Jiang, Yaozhou Jiang, Qian Tu, Jie Zhu, Yuqian Zhang, Yiwei Zhao, Chenchen Yang, Zhaoye Fei, Shimin Li, Xiaogui Yang, Qinyuan Cheng, Xipeng Qiu View a PDF of the paper titled MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions, by Kexin Huang and 13 other authors View PDF HTML (experimental) Abstract:Voice design from natural language aims to generate speaker timbres directly from free-form textual descriptions, allowing users to create voices tailored to specific roles, personalities, and emotions. Such controllable voice creation benefits a wide range of downstream applications-including storytelling, game dubbing, role-play agents, and conversational assistants, making it a significant task for modern Text-to-Speech models. However, existing models are largely trained on carefully recorded studio data, which produces speech that is clean and well-articulated, yet lacks the lived-in qualities of real human voices. To address these limitations, we present MOSS-VoiceGenerator, an open-source instruction-driven voice generation model that creates new timbres directly from natural language prompts. Motivated by the hypothesis that exposure to real-world acoustic variation produces more perceptually natural voices, we train on large-scale expressive spe...

Originally published on March 31, 2026. Curated by AI News.

Machine Learning

Your prompts aren’t the problem — something else is

I keep seeing people focus heavily on prompt optimization. But in practice, a lot of failures I’ve observed don’t come from the prompt it...

Reddit - Artificial Intelligence · 1 min · 18 minutes ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Machine Learning

[R], 31 MILLIONS High frequency data, Light GBM worked perfectly

We just published a paper on predicting adverse selection in high-frequency crypto markets using LightGBM, and I wanted to share it here ...

Reddit - Machine Learning · 1 min · about 2 hours ago

Machine Learning

[D] Those of you with 10+ years in ML — what is the public completely wrong about?

For those of you who've been in ML/AI research or applied ML for 10+ years — what's the gap between what the public thinks AI is doing vs...

Reddit - Machine Learning · 1 min · about 2 hours ago

[2603.28086] MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions

About this article

Related Articles

Your prompts aren’t the problem — something else is

UMKC Announces New Master of Science in Artificial Intelligence

[R], 31 MILLIONS High frequency data, Light GBM worked perfectly

[D] Those of you with 10+ years in ML — what is the public completely wrong about?

No comments

Stay updated with AI News