Researchers have developed GenHOI, a novel framework enabling humanoid robots to perform diverse object-interaction tasks in a zero-shot manner by directly imitating generated videos, without task-specific training or physical demonstration data. The framework reconstructs the robot-object scene in simulation, generates task-oriented interaction videos, and extracts contact-relevant constraints that convert visual cues into physically grounded optimization priors. Validated across tasks including box grasping, bimanual chair carrying, table lifting, and cylindrical-object enveloping.
ResearchJune 12, 2026•Zhihai Bi et al.
GenHOI: Zero-Shot Contact-Aware Humanoid-Object Interaction via Video Imitation
A new arXiv paper presents GenHOI, enabling humanoid robots to learn object manipulation skills by imitating generated videos without task-specific training.
#humanoid#robotics#arxiv#zero-shot#manipulation#video-imitation
Reading in English
Language: English- Showing content in English
Trending Now
Industry
LG CNS and LX Pantos Partner to Build Next-Generation Unmanned Warehouse with Humanoid Robots
Jun 11, 2026 · 0 views

Research
X Square Robot Open-Sources XRZero-G0 Framework for Scalable Robot Learning
Jun 10, 2026 · 0 views
Research
Human Archive Raises $8.2M to Train Robots Using India's Gig Economy Workers
Jun 7, 2026 · 0 views
Industry
The Economist Highlights Ningbo as the Unlikely Heart of Global Humanoid Robot Component Supply Chain
Jun 7, 2026 · 0 views
More in Research
Research
GenHOI: Zero-Shot Humanoid-Object Interaction by Imitating Generated Videos
Jun 13, 2026
Research
MIT Ultrasound Wristband Tracks Every Finger Movement, Controls Robot Hand in Real Time
Jun 13, 2026
Research
Generalist AI Unveils GEN-0: Embodied Foundation Model That Scales with Physical Interaction
Jun 13, 2026