[2512.23994] PhyAVBench: A Challenging Audio Physics-Sensitivity

[2512.23994] PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation

arXiv - AI April 08, 2026 4 min read

About this article

Abstract page for arXiv paper 2512.23994: PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation

Computer Science > Sound arXiv:2512.23994 (cs) [Submitted on 30 Dec 2025 (v1), last revised 7 Apr 2026 (this version, v2)] Title:PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation Authors:Tianxin Xie, Wentao Lei, Kai Jiang, Guanjie Huang, Pengfei Zhang, Chunhui Zhang, Fengji Ma, Haoyu He, Han Zhang, Jiangshan He, Jinting Wang, Linghan Fang, Lufei Gao, Orkesh Ablet, Peihua Zhang, Ruolin Hu, Shengyu Li, Weilin Lin, Xiaoyang Feng, Xinyue Yang, Yan Rong, Yanyun Wang, Zihang Shao, Zelin Zhao, Chenxing Li, Shan Yang, Wenfu Wang, Meng Yu, Dong Yu, Li Liu View a PDF of the paper titled PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation, by Tianxin Xie and 29 other authors View PDF HTML (experimental) Abstract:Text-to-audio-video (T2AV) generation is central to applications such as filmmaking and world modeling. However, current models often fail to produce physically plausible sounds. Previous benchmarks primarily focus on audio-video temporal synchronization, while largely overlooking explicit evaluation of audio-physics grounding, thereby limiting the study of physically plausible audio-visual generation. To address this issue, we present PhyAVBench, the first benchmark that systematically evaluates the audio-physics grounding capabilities of T2AV, image-to-audio-video (I2AV), and video-to-audio (V2A) models. PhyAVBench offers PhyAV-Sound-11K, a new...

Originally published on April 08, 2026. Curated by AI News.

Machine Learning

Google quietly launched an AI dictation app that works offline | TechCrunch

Google's new offline-first dictation app uses Gemma AI models to take on the apps like Wispr Flow.

TechCrunch - AI · 4 min · about 1 hour ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Machine Learning

CONESTOGA COLLEGE Robots deepen AI and data analytics training for Conestoga students

AI News - General · 5 min · about 1 hour ago

Machine Learning

Alabama A&M University chosen for Amazon Web Services AI training program

AI News - General · 2 min · about 1 hour ago

[2512.23994] PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation

About this article

Related Articles

Google quietly launched an AI dictation app that works offline | TechCrunch

UMKC Announces New Master of Science in Artificial Intelligence

CONESTOGA COLLEGE Robots deepen AI and data analytics training for Conestoga students

Alabama A&M University chosen for Amazon Web Services AI training program

No comments

Stay updated with AI News