Find the Fruit: Zero-Shot Sim2Real RL for Occlusion-Aware Plant Manipulation
Nitesh Subedi, Hsin-Jung Yang, Devesh K. Jha, Soumik Sarkar
TL;DR
This work tackles occlusion-heavy fruit harvesting with deformable plants by learning a policy in a high-fidelity FEM-based simulator and transferring it zero-shot to real hardware. The approach decouples high-level kinematic planning from a low-level compliant controller and uses domain randomization to bridge the sim-to-real gap, aided by training-time privileged fruit masks. Results show up to 86.7% real-world success across stemmed plants and demonstrate generalization to sequential multi-fruit exposure, with ablations illustrating the importance of simulation fidelity and training aids. The study suggests that structurally faithful abstract plant models, coupled with robust perception and compliant actuation, can enable scalable autonomous harvesting in cluttered, deformable environments.
Abstract
Autonomous harvesting in the open presents a complex manipulation problem. In most scenarios, an autonomous system has to deal with significant occlusion and require interaction in the presence of large structural uncertainties (every plant is different). Perceptual and modeling uncertainty make design of reliable manipulation controllers for harvesting challenging, resulting in poor performance during deployment. We present a sim2real reinforcement learning (RL) framework for occlusion-aware plant manipulation, where a policy is learned entirely in simulation to reposition stems and leaves to reveal target fruit(s). In our proposed approach, we decouple high-level kinematic planning from low-level compliant control which simplifies the sim2real transfer. This decomposition allows the learned policy to generalize across multiple plants with different stiffness and morphology. In experiments with multiple real-world plant setups, our system achieves up to 86.7% success in exposing target fruits, demonstrating robustness to occlusion variation and structural uncertainty.
