[2602.13444] FlowHOI: Flow-based Semantics-Grounded Generation of Hand-Object Interactions for Dexterous Robot Manipulation

[2602.13444] FlowHOI: Flow-based Semantics-Grounded Generation of Hand-Object Interactions for Dexterous Robot Manipulation

arXiv - AI 4 min read Article

Summary

FlowHOI presents a novel framework for generating hand-object interactions in robotic manipulation, enhancing the realism and efficiency of robot task execution.

Why It Matters

This research addresses a critical gap in robotics by providing a structured approach to hand-object interactions, which are essential for effective robot manipulation. The proposed FlowHOI framework improves the accuracy and speed of robotic actions, making it a significant advancement in the field of robotics and AI.

Key Takeaways

  • FlowHOI introduces a two-stage flow-matching framework for generating hand-object interactions.
  • The framework enhances action recognition accuracy and physics simulation success rates.
  • It achieves a significant speedup in inference times compared to existing methods.
  • Real-robot execution demonstrates the practical applicability of the generated interactions.
  • The research addresses the scarcity of high-fidelity supervision for hand-object interactions.

Computer Science > Robotics arXiv:2602.13444 (cs) [Submitted on 13 Feb 2026] Title:FlowHOI: Flow-based Semantics-Grounded Generation of Hand-Object Interactions for Dexterous Robot Manipulation Authors:Huajian Zeng, Lingyun Chen, Jiaqi Yang, Yuantai Zhang, Fan Shi, Peidong Liu, Xingxing Zuo View a PDF of the paper titled FlowHOI: Flow-based Semantics-Grounded Generation of Hand-Object Interactions for Dexterous Robot Manipulation, by Huajian Zeng and Lingyun Chen and Jiaqi Yang and Yuantai Zhang and Fan Shi and Peidong Liu and Xingxing Zuo View PDF HTML (experimental) Abstract:Recent vision-language-action (VLA) models can generate plausible end-effector motions, yet they often fail in long-horizon, contact-rich tasks because the underlying hand-object interaction (HOI) structure is not explicitly represented. An embodiment-agnostic interaction representation that captures this structure would make manipulation behaviors easier to validate and transfer across robots. We propose FlowHOI, a two-stage flow-matching framework that generates semantically grounded, temporally coherent HOI sequences, comprising hand poses, object poses, and hand-object contact states, conditioned on an egocentric observation, a language instruction, and a 3D Gaussian splatting (3DGS) scene reconstruction. We decouple geometry-centric grasping from semantics-centric manipulation, conditioning the latter on compact 3D scene tokens and employing a motion-text alignment loss to semantically ground th...

Related Articles

Machine Learning

[D] Does ML have a "bible"/reference textbook at the Intermediate/Advanced level?

Hello, everyone! This is my first time posting here and I apologise if the question is, perhaps, a bit too basic for this sub-reddit. A b...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] ICML 2026 review policy debate: 100 responses suggest Policy B may score higher, while Policy A shows higher confidence

A week ago I made a thread asking whether ICML 2026’s review policy might have affected review outcomes, especially whether Policy A pape...

Reddit - Machine Learning · 1 min ·
Nomadic raises $8.4 million to wrangle the data pouring off autonomous vehicles | TechCrunch
Machine Learning

Nomadic raises $8.4 million to wrangle the data pouring off autonomous vehicles | TechCrunch

The company turns footage from robots into structured, searchable datasets with a deep learning model.

TechCrunch - AI · 6 min ·
Machine Learning

[D] Applied AI/Machine learning course by Srikanth Varma

I have all 10 modules of this course, along with all the notes, assignments, and solutions. If anyone need this course DM me. submitted b...

Reddit - Machine Learning · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime