Orbit: A Unified Simulation Framework for Interactive Robot Learning Environments
Mayank Mittal, Calvin Yu, Qinxi Yu, Jingzhou Liu, Nikita Rudin, David Hoeller, Jia Lin Yuan, Ritvik Singh, Yunrong Guo, Hammad Mazhar, Ajay Mandlekar, Buck Babich, Gavriel State, Marco Hutter, Animesh Garg
TL;DR
Orbit provides a fast, realistic, and extensible robot-learning simulation platform by unifying rigid and deformable physics within NVIDIA Isaac Sim and PhysX 5. It offers a rich collection of robots, sensors, motion generators, and tasks, plus wrappers for multiple RL and IL frameworks, enabling RL, imitation learning, and motion planning at scale. The framework emphasizes modular abstractions (World, Agent, and learning tasks) and demonstrates strong throughput and sim-to-real potential through exemplar workflows in RL, IL, and motion planning. By combining high-fidelity simulation, versatile scene design, and broad task coverage, Orbit aims to lower barriers to entry and catalyze interdisciplinary robotics research.
Abstract
We present Orbit, a unified and modular framework for robot learning powered by NVIDIA Isaac Sim. It offers a modular design to easily and efficiently create robotic environments with photo-realistic scenes and high-fidelity rigid and deformable body simulation. With Orbit, we provide a suite of benchmark tasks of varying difficulty -- from single-stage cabinet opening and cloth folding to multi-stage tasks such as room reorganization. To support working with diverse observations and action spaces, we include fixed-arm and mobile manipulators with different physically-based sensors and motion generators. Orbit allows training reinforcement learning policies and collecting large demonstration datasets from hand-crafted or expert solutions in a matter of minutes by leveraging GPU-based parallelization. In summary, we offer an open-sourced framework that readily comes with 16 robotic platforms, 4 sensor modalities, 10 motion generators, more than 20 benchmark tasks, and wrappers to 4 learning libraries. With this framework, we aim to support various research areas, including representation learning, reinforcement learning, imitation learning, and task and motion planning. We hope it helps establish interdisciplinary collaborations in these communities, and its modularity makes it easily extensible for more tasks and applications in the future.
