ChatSim: Underwater Simulation with Natural Language Prompting
Aadi Palnitkar, Rashmi Kapu, Xiaomin Lin, Cheng Liu, Nare Karapetyan, Yiannis Aloimonos
TL;DR
This work addresses the barrier of coding underwater simulations by integrating a Large Language Model with the OysterSim/Blender pipeline to control a BlueROV via natural language. The authors implement a constrained function library and prompting framework to translate NL instructions into executable Python calls that manipulate the simulation and capture imagery. Four experiments demonstrate movement, object placement, region editing, and trajectory generation driven by NL prompts, highlighting improvements in accessibility and rapid scenario testing. The approach holds promise for accelerating marine robotics research and environmental monitoring by reducing setup time and enabling diverse, realistic test environments.
Abstract
Robots are becoming an essential part of many operations including marine exploration or environmental monitoring. However, the underwater environment presents many challenges, including high pressure, limited visibility, and harsh conditions that can damage equipment. Real-world experimentation can be expensive and difficult to execute. Therefore, it is essential to simulate the performance of underwater robots in comparable environments to ensure their optimal functionality within practical real-world contexts.OysterSim generates photo-realistic images and segmentation masks of objects in marine environments, providing valuable training data for underwater computer vision applications. By integrating ChatGPT into underwater simulations, users can convey their thoughts effortlessly and intuitively create desired underwater environments without intricate coding. \invis{Moreover, researchers can realize substantial time and cost savings by evaluating their algorithms across diverse underwater conditions in the simulation.} The objective of ChatSim is to integrate Large Language Models (LLM) with a simulation environment~(OysterSim), enabling direct control of the simulated environment via natural language input. This advancement can greatly enhance the capabilities of underwater simulation, with far-reaching benefits for marine exploration and broader scientific research endeavors.
