DHAGrasp: Synthesizing Affordance-Aware Dual-Hand Grasps with Text Instructions

Quanzhou Li¹ Zhonghua Wu² Jingbo Wang³ Chen Change Loy^1† Bo Dai⁴

¹S-Lab, Nanyang Technological University ²SenseTime Research; ³Shanghai AI Laboratory; ⁴The University of Hong Kong

^†Corresponding author

arxiv

Code (coming soon)

Dataset

Abstract

Learning to generate dual-hand grasps that respect object semantics is essential for robust hand–object interaction but remains largely underexplored due to dataset scarcity. Existing grasp datasets predominantly focus on single-hand interactions and contain only limited semantic part annotations. To address these challenges, we introduce a pipeline, SymOpt, that constructs a large-scale dual-hand grasp dataset by leveraging existing single-hand datasets and exploiting object and hand symmetries. Building on this, we propose a text-guided dual-hand grasp generator, DHAGrasp, that synthesizes Dual-Hand Affordance-aware Grasps for unseen objects. Our approach incorporates a novel dual-hand affordance representation and follows a two-stage design, which enables effective learning from a small set of segmented training objects while scaling to a much larger pool of unsegmented data. Extensive experiments demonstrate that our method produces diverse and semantically consistent grasps, outperforming strong baselines in both grasp quality and generalization to unseen objects.