Few-shot Scooping Under Domain Shift via Simulated Maximal Deployment Gaps
Yifan Zhu, Pranay Thangeda, Erica L Tevere, Ashish Goel, Erik Kramer, Hari D Nayar, Melkior Ornik, Kris Hauser
TL;DR
This work tackles autonomous extraterrestrial terrain sampling under large domain shifts by formulating a few-shot scooping problem and introducing a vision-based adaptive strategy built on a deep kernel Gaussian process (GP). The core contribution, Deep Kernel Calibration with Maximal Deployment Gaps (kCMD), trains kernels to handle maximal simulated deployment gaps by OT-based task splitting, and integrates this with Bayesian optimization to rapidly adapt to novel terrains with limited experience. Extensive validation includes 5,100 offline scoops across UIUC terrains and zero-shot transfer to NASA OWLAT, where kCMD outperforms non-adaptive baselines and shows strong generalization to out-of-distribution materials. These results demonstrate the potential of training high-capacity models with simulated deployment gaps for robust meta-learning in robotic sampling and underline its applicability to autonomous lander missions facing Earth-to-space deployment gaps.
Abstract
Autonomous lander missions on extraterrestrial bodies need to sample granular materials while coping with domain shifts, even when sampling strategies are extensively tuned on Earth. To tackle this challenge, this paper studies the few-shot scooping problem and proposes a vision-based adaptive scooping strategy that uses the deep kernel Gaussian process method trained with a novel meta-training strategy to learn online from very limited experience on out-of-distribution target terrains. Our Deep Kernel Calibration with Maximal Deployment Gaps (kCMD) strategy explicitly trains a deep kernel model to adapt to large domain shifts by creating simulated maximal deployment gaps from an offline training dataset and training models to overcome these deployment gaps during training. Employed in a Bayesian Optimization sequential decision-making framework, the proposed method allows the robot to perform high-quality scooping actions on out-of-distribution terrains after a few attempts, significantly outperforming non-adaptive methods proposed in the excavation literature as well as other state-of-the-art meta-learning methods. The proposed method also demonstrates zero-shot transfer capability, successfully adapting to the NASA OWLAT platform, which serves as a state-of-the-art simulator for potential future planetary missions. These results demonstrate the potential of training deep models with simulated deployment gaps for more generalizable meta-learning in high-capacity models. Furthermore, they highlight the promise of our method in autonomous lander sampling missions by enabling landers to overcome the deployment gap between Earth and extraterrestrial bodies.
