[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language

[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images

arXiv - AI March 30, 2026 4 min read

About this article

Abstract page for arXiv paper 2511.18746: Any4D: Open-Prompt 4D Generation from Natural Language and Images

Computer Science > Computer Vision and Pattern Recognition arXiv:2511.18746 (cs) This paper has been withdrawn by Qiao Sun [Submitted on 24 Nov 2025 (v1), last revised 27 Mar 2026 (this version, v2)] Title:Any4D: Open-Prompt 4D Generation from Natural Language and Images Authors:Hao Li, Qiao Sun View a PDF of the paper titled Any4D: Open-Prompt 4D Generation from Natural Language and Images, by Hao Li and 1 other authors No PDF available, click to view other formats Abstract:While video-generation-based embodied world models have gained increasing attention, their reliance on large-scale embodied interaction data remains a key bottleneck. The scarcity, difficulty of collection, and high dimensionality of embodied data fundamentally limit the alignment granularity between language and actions and exacerbate the challenge of long-horizon video generation--hindering generative models from achieving a \textit{"GPT moment"} in the embodied domain. There is a naive observation: \textit{the diversity of embodied data far exceeds the relatively small space of possible primitive motions}. Based on this insight, we propose \textbf{Primitive Embodied World Models} (PEWM), which restricts video generation to fixed shorter horizons, our approach \textit{1) enables} fine-grained alignment between linguistic concepts and visual representations of robotic actions, \textit{2) reduces} learning complexity, \textit{3) improves} data efficiency in embodied data collection, and \textit{4) decr...

Originally published on March 30, 2026. Curated by AI News.

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 23 minutes ago

Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min · 23 minutes ago

Machine Learning

[2603.23899] SM-Net: Learning a Continuous Spectral Manifold from Multiple Stellar Libraries

Abstract page for arXiv paper 2603.23899: SM-Net: Learning a Continuous Spectral Manifold from Multiple Stellar Libraries

arXiv - AI · 4 min · about 2 hours ago

Llms

[2603.16629] MLLM-based Textual Explanations for Face Comparison

Abstract page for arXiv paper 2603.16629: MLLM-based Textual Explanations for Face Comparison