Learning to Represent Surroundings, Anticipate Motion and Take Informed Actions in Unstructured Environments
Weiming Zhi
TL;DR
The thesis tackles enabling robots to operate in unstructured, dynamic environments by learning representations of surroundings, predicting motion, and selecting informed actions. It develops continuous occupancy representations (Fast-BHM) and continuous spatiotemporal maps to model environmental structure and motion, enabling scalable multi-agent mapping and long-horizon behavior understanding. It then introduces anticipatory navigation through SPAN and OTNet, along with a probabilistic-constraint framework to fuse learning with environment structure, and extends to robot manipulators using Diffeomorphic Templates and Geometric Fabric Command Sequences for globally feasible yet reactive motion. Collectively, the work demonstrates end-to-end learning-and-planning pipelines that exploit environment representations, probabilistic motion predictions, and stable, generalisable control for safe, efficient robot operation in unstructured spaces. These contributions offer scalable, data-driven tools for real-world robotics, with potential impact across autonomous driving, service robots, and collaborative manipulation in crowded or complex environments.
Abstract
Contemporary robots have become exceptionally skilled at achieving specific tasks in structured environments. However, they often fail when faced with the limitless permutations of real-world unstructured environments. This motivates robotics methods which learn from experience, rather than follow a pre-defined set of rules. In this thesis, we present a range of learning-based methods aimed at enabling robots, operating in dynamic and unstructured environments, to better understand their surroundings, anticipate the actions of others, and take informed actions accordingly.
