Motion Planning on Visual Manifolds
M Seetha Ramaiah
TL;DR
The thesis proposes Visual Configuration Space (VCS) as a vision-based alternative to explicit configuration spaces for robot motion planning, enabling learning of body structure from images via manifold learning and planning on a Visual Roadmap (VRM). It demonstrates that planning paths over a graph embedded in the image space can handle static and dynamic obstacles without detailed geometry, using Isomap and other NLDR methods to reveal the low-dimensional structure underlying high-dimensional visual data. The work extends the VCS framework to model body schema, infant-like visuomotor learning, and avatar head-motion animation, highlighting the versatility of visually grounded representations for planning, perception, and synthetic motion generation. Key contributions include the Visual Roadmap methodology, collision detection via image overlap and RLE, and demonstrations on planar and spatial robots, as well as a vision-based approach to head animation, all built upon manifold learning foundations. The approach offers a robust, less geometry-dependent alternative with practical implications for real-time planning in changing environments and immersive virtual environments, while acknowledging conservatism in obstacle approximation and the need for multi-modal fusion for stronger robustness.
Abstract
In this thesis, we propose an alternative characterization of the notion of Configuration Space, which we call Visual Configuration Space (VCS). This new characterization allows an embodied agent (e.g., a robot) to discover its own body structure and plan obstacle-free motions in its peripersonal space using a set of its own images in random poses. Here, we do not assume any knowledge of geometry of the agent, obstacles or the environment. We demonstrate the usefulness of VCS in (a) building and working with geometry-free models for robot motion planning, (b) explaining how a human baby might learn to reach objects in its peripersonal space through motor babbling, and (c) automatically generating natural looking head motion animations for digital avatars in virtual environments. This work is based on the formalism of manifolds and manifold learning using the agent's images and hence we call it Motion Planning on Visual Manifolds.
