Dex-NeRF: Using a Neural Radiance Field to Grasp Transparent Objects
Jeffrey Ichnowski, Yahav Avigal, Justin Kerr, Ken Goldberg
TL;DR
Dex-NeRF demonstrates that Neural Radiance Fields can recover the geometry of transparent objects well enough to support robust robotic grasping. By rendering a transparency-aware depth map from NeRF and feeding it into Dex-Net, and by strategically placing lights to induce informative specular reflections, the method substantially improves grasp success on transparent objects. The work contributes (i) a NeRF-based pipeline integrated with robot grasp planning, (ii) a depth-rendering approach tailored for transparency, and (iii) synthetic and real datasets capturing transparent scenes; physical experiments yield 90–100% grasp success on ABB YuMi, outperforming baselines. This approach broadens automated manipulation capabilities in cluttered, transparent-rich environments such as kitchens and warehouses, and highlights practical considerations for camera arrays in robot workcells.
Abstract
The ability to grasp and manipulate transparent objects is a major challenge for robots. Existing depth cameras have difficulty detecting, localizing, and inferring the geometry of such objects. We propose using neural radiance fields (NeRF) to detect, localize, and infer the geometry of transparent objects with sufficient accuracy to find and grasp them securely. We leverage NeRF's view-independent learned density, place lights to increase specular reflections, and perform a transparency-aware depth-rendering that we feed into the Dex-Net grasp planner. We show how additional lights create specular reflections that improve the quality of the depth map, and test a setup for a robot workcell equipped with an array of cameras to perform transparent object manipulation. We also create synthetic and real datasets of transparent objects in real-world settings, including singulated objects, cluttered tables, and the top rack of a dishwasher. In each setting we show that NeRF and Dex-Net are able to reliably compute robust grasps on transparent objects, achieving 90% and 100% grasp success rates in physical experiments on an ABB YuMi, on objects where baseline methods fail.
