GigaHands: A Massive Annotated Dataset of Bimanual Hand Activities
Rao Fu, Dingxi Zhang, Alex Jiang, Wanjia Fu, Austin Funk, Daniel Ritchie, Srinath Sridhar
TL;DR
GigaHands tackles the scarcity of large-scale, richly annotated 3D bimanual hand activity data by introducing a markerless, multi-view dataset that captures 34 hours of activity from 56 subjects across 417 objects. It provides 183 million frames, 14k motion clips, and 84k text annotations, paired with fully automatic 3D hand and object estimation and dense camera views enabling dynamic radiance field reconstruction. The paper demonstrates the dataset's value through improvements in text-driven hand motion synthesis, motion captioning, and 3D scene reconstruction, powered by an instruct-to-annotate data collection pipeline. Overall, GigaHands advances the scalability and diversity of hand-action data, with significant implications for AI, robotics, and interactive 3D understanding.
Abstract
Understanding bimanual human hand activities is a critical problem in AI and robotics. We cannot build large models of bimanual activities because existing datasets lack the scale, coverage of diverse hand activities, and detailed annotations. We introduce GigaHands, a massive annotated dataset capturing 34 hours of bimanual hand activities from 56 subjects and 417 objects, totaling 14k motion clips derived from 183 million frames paired with 84k text annotations. Our markerless capture setup and data acquisition protocol enable fully automatic 3D hand and object estimation while minimizing the effort required for text annotation. The scale and diversity of GigaHands enable broad applications, including text-driven action synthesis, hand motion captioning, and dynamic radiance field reconstruction. Our website are avaliable at https://ivl.cs.brown.edu/research/gigahands.html .
