[2410.06355] UNCOM: Zero-shot Context-Aware Command Understanding for Tabletop Scenarios

[2410.06355] UNCOM: Zero-shot Context-Aware Command Understanding for Tabletop Scenarios

arXiv - AI 3 min read

About this article

Abstract page for arXiv paper 2410.06355: UNCOM: Zero-shot Context-Aware Command Understanding for Tabletop Scenarios

Computer Science > Robotics arXiv:2410.06355 (cs) [Submitted on 8 Oct 2024 (v1), last revised 8 May 2026 (this version, v3)] Title:UNCOM: Zero-shot Context-Aware Command Understanding for Tabletop Scenarios Authors:Antonio Galiza Cerdeira Gonzalez, Paweł Gajewski, Bipin Indurkhya View a PDF of the paper titled UNCOM: Zero-shot Context-Aware Command Understanding for Tabletop Scenarios, by Antonio Galiza Cerdeira Gonzalez and 2 other authors View PDF HTML (experimental) Abstract:This paper presents UNCOM, a novel hybrid framework for interpreting natural human commands in tabletop scenarios. The system integrates multiple sources of information -- speech, gestures, and scene context -- to extract structured, actionable instructions for robots. Addressing the need for general-purpose human-robot interaction in domestic environments, UNCOM is designed for zero-shot operation, without reliance on predefined object models or training data specific to a given task. Using foundational and task-specific deep learning models, it allows out-of-the-box speech recognition, natural language understanding, gesture detection, and object segmentation. The modular architecture enhances transparency and explainability by explicitly parsing commands into object-action-target representations, enabling integration with symbolic robotic frameworks. We demonstrate the system in a TIAGo++ robot and provide an evaluation on a real-world data set of human-robot interaction scenarios; achieving an 8...

Originally published on May 11, 2026. Curated by AI News.

Related Articles

Joanna Stern is not a robot, but she lived with them | The Verge
Robotics

Joanna Stern is not a robot, but she lived with them | The Verge

The journalist and author of I Am Not a Robot on her year living with AI and starting New Things, her new media company.

The Verge - AI · 54 min ·
Machine Learning

Are Enterprises Using AI in the Wrong Places?

Most enterprise AI discussions still revolve around one question: But I’m starting to think that may be the wrong question entirely. The ...

Reddit - Artificial Intelligence · 1 min ·
The AI Paradox: More Humanlike Means Less Autonomous
Robotics

The AI Paradox: More Humanlike Means Less Autonomous

Originally published in Forbes The AI executives are at it again, promising human-level machines in the near future. In Davos, the CEOs ...

AI News - General · 8 min ·
Machine Learning

I gave a local AI agent system file access and a mechanical "suffering" metric. Scaling the model changed its behavior entirely

I’ve been obsessed with autonomous agents lately, but it got tiring when they keep hitting walls because they didn't have the right capab...

Reddit - Artificial Intelligence · 1 min ·
More in Robotics: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime