ArtReg: Visuo-Tactile based Pose Tracking and Manipulation of Unseen Articulated Objects
Prajval Kumar Murali, Mohsen Kaboli
TL;DR
This paper tackles the challenge of tracking and manipulating unseen articulated objects without prior geometric or kinematic knowledge. It introduces ArtReg, a SE(3) Lie-group Unscented Kalman Filter that fuses visuo-tactile data for robust 6-DoF pose tracking, coupled with interactive perception to detect articulation and a closed-loop visuo-tactile controller for goal-driven manipulation. The framework is validated through extensive real-robot experiments with two robots, demonstrating reliable articulation detection, accurate pose tracking (often sub-centimeter to a few centimeters), and successful manipulation under challenging lighting and backgrounds. Benchmarking on the PartNet-Mobility dataset indicates significant performance gains over state-of-the-art baselines, underscoring the practical impact of combining tactile sensing with visual perception for autonomous interaction with unseen articulated objects.
Abstract
Robots operating in real-world environments frequently encounter unknown objects with complex structures and articulated components, such as doors, drawers, cabinets, and tools. The ability to perceive, track, and manipulate these objects without prior knowledge of their geometry or kinematic properties remains a fundamental challenge in robotics. In this work, we present a novel method for visuo-tactile-based tracking of unseen objects (single, multiple, or articulated) during robotic interaction without assuming any prior knowledge regarding object shape or dynamics. Our novel pose tracking approach termed ArtReg (stands for Articulated Registration) integrates visuo-tactile point clouds in an unscented Kalman Filter formulation in the SE(3) Lie Group for point cloud registration. ArtReg is used to detect possible articulated joints in objects using purposeful manipulation maneuvers such as pushing or hold-pulling with a two-robot team. Furthermore, we leverage ArtReg to develop a closed-loop controller for goal-driven manipulation of articulated objects to move the object into the desired pose configuration. We have extensively evaluated our approach on various types of unknown objects through real robot experiments. We also demonstrate the robustness of our method by evaluating objects with varying center of mass, low-light conditions, and with challenging visual backgrounds. Furthermore, we benchmarked our approach on a standard dataset of articulated objects and demonstrated improved performance in terms of pose accuracy compared to state-of-the-art methods. Our experiments indicate that robust and accurate pose tracking leveraging visuo-tactile information enables robots to perceive and interact with unseen complex articulated objects (with revolute or prismatic joints).
