[2603.01804] Non-verbal Real-time Human-AI Interaction in Constrained Robotic Environments

[2603.01804] Non-verbal Real-time Human-AI Interaction in Constrained Robotic Environments

arXiv - AI 4 min read

About this article

Abstract page for arXiv paper 2603.01804: Non-verbal Real-time Human-AI Interaction in Constrained Robotic Environments

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.01804 (cs) [Submitted on 2 Mar 2026] Title:Non-verbal Real-time Human-AI Interaction in Constrained Robotic Environments Authors:Dragos Costea, Alina Marcu, Cristina Lazar, Marius Leordeanu View a PDF of the paper titled Non-verbal Real-time Human-AI Interaction in Constrained Robotic Environments, by Dragos Costea and 3 other authors View PDF HTML (experimental) Abstract:We study the ongoing debate regarding the statistical fidelity of AI-generated data compared to human-generated data in the context of non-verbal communication using full body motion. Concretely, we ask if contemporary generative models move beyond surface mimicry to participate in the silent, but expressive dialogue of body language. We tackle this question by introducing the first framework that generates a natural non-verbal interaction between Human and AI in real-time from 2D body keypoints. Our experiments utilize four lightweight architectures which run at up to 100 FPS on an NVIDIA Orin Nano, effectively closing the perception-action loop needed for natural Human-AI interaction. We trained on 437 human video clips and demonstrated that pretraining on synthetically-generated sequences reduces motion errors significantly, without sacrificing speed. Yet, a measurable reality gap persists. When the best model is evaluated on keypoints extracted from cutting-edge text-to-video systems, such as SORA and VEO, we observe that performanc...

Originally published on March 03, 2026. Curated by AI News.

Related Articles

[2603.23899] SM-Net: Learning a Continuous Spectral Manifold from Multiple Stellar Libraries
Machine Learning

[2603.23899] SM-Net: Learning a Continuous Spectral Manifold from Multiple Stellar Libraries

Abstract page for arXiv paper 2603.23899: SM-Net: Learning a Continuous Spectral Manifold from Multiple Stellar Libraries

arXiv - AI · 4 min ·
[2603.16629] MLLM-based Textual Explanations for Face Comparison
Llms

[2603.16629] MLLM-based Textual Explanations for Face Comparison

Abstract page for arXiv paper 2603.16629: MLLM-based Textual Explanations for Face Comparison

arXiv - AI · 4 min ·
[2603.15159] To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation
Llms

[2603.15159] To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

Abstract page for arXiv paper 2603.15159: To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

arXiv - AI · 4 min ·
[2603.14375] The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics
Machine Learning

[2603.14375] The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics

Abstract page for arXiv paper 2603.14375: The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics

arXiv - AI · 4 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime