DORA: Object Affordance-Guided Reinforcement Learning for Dexterous Robotic Manipulation
Lei Zhang, Soumya Mondal, Zhenshan Bing, Kaixin Bai, Diwen Zheng, Zhaopeng Chen, Alois Christian Knoll, Jianwei Zhang
TL;DR
This work tackles efficient dexterous manipulation by incorporating object affordances into both grasp generation and reinforcement learning. It introduces an object affordance-guided pipeline that produces functionally grounded grasp candidates and uses an affordance-aware reward to shape policy learning, evaluated on cube, jug, and hammer tasks. The results show notable improvements in task success and sample efficiency, with average gains around 15.4% and faster convergence, while also highlighting algorithm-dependent effects (PPO vs SAC). This approach advances semantically grounded, generalizable manipulation policies and points to future refinements in realistic contact modeling and mesh-aware training.
Abstract
Dexterous robotic manipulation remains a longstanding challenge in robotics due to the high dimensionality of control spaces and the semantic complexity of object interaction. In this paper, we propose an object affordance-guided reinforcement learning framework that enables a multi-fingered robotic hand to learn human-like manipulation strategies more efficiently. By leveraging object affordance maps, our approach generates semantically meaningful grasp pose candidates that serve as both policy constraints and priors during training. We introduce a voting-based grasp classification mechanism to ensure functional alignment between grasp configurations and object affordance regions. Furthermore, we incorporate these constraints into a generalizable RL pipeline and design a reward function that unifies affordance-awareness with task-specific objectives. Experimental results across three manipulation tasks - cube grasping, jug grasping and lifting, and hammer use - demonstrate that our affordance-guided approach improves task success rates by an average of 15.4% compared to baselines. These findings highlight the critical role of object affordance priors in enhancing sample efficiency and learning generalizable, semantically grounded manipulation policies. For more details, please visit our project website https://sites.google.com/view/dora-manip.
