DROP: Dexterous Reorientation via Online Planning
Albert H. Li, Preston Culbertson, Vince Kurtz, Aaron D. Ames
TL;DR
DROP investigates online planning for dexterous in-hand cube reorientation by replacing offline RL with a lightweight sampling-based predictive controller (SPC) and a vision-based pose estimator. The system uses a simple cross-entropy (CEM) or predictive sampling (PS) planner, parallel rollouts via real-time simulation, and a collision-aware corrector to maintain feasible state estimates, achieving hardware performance close to RL baselines while offering flexibility to task changes without retraining. Key findings show that CEM-based SPC with depth perception, a pose-smoothing pipeline, and an online corrector yields robust, contact-rich rotation sequences; ablations reveal the importance of the corrector, sufficient rollout count, and depth for stability, while robustness tests demonstrate CEM’s resilience to model and estimation error. Overall, the work demonstrates that online planning can be a viable path for dexterous manipulation, with potential for extending to more objects and tools given improvements in perception and search efficiency.
Abstract
Achieving human-like dexterity is a longstanding challenge in robotics, in part due to the complexity of planning and control for contact-rich systems. In reinforcement learning (RL), one popular approach has been to use massively-parallelized, domain-randomized simulations to learn a policy offline over a vast array of contact conditions, allowing robust sim-to-real transfer. Inspired by recent advances in real-time parallel simulation, this work considers instead the viability of online planning methods for contact-rich manipulation by studying the well-known in-hand cube reorientation task. We propose a simple architecture that employs a sampling-based predictive controller and vision-based pose estimator to search for contact-rich control actions online. We conduct thorough experiments to assess the real-world performance of our method, architectural design choices, and key factors for robustness, demonstrating that our simple sampling-based approach achieves performance comparable to prior RL-based works. Supplemental material: https://caltech-amber.github.io/drop.
