A Probabilistic Model for Skill Acquisition with Switching Latent Feedback Controllers

Juyan Zhang; Dana Kulic; Michael Burke

A Probabilistic Model for Skill Acquisition with Switching Latent Feedback Controllers

Juyan Zhang, Dana Kulic, Michael Burke

TL;DR

This paper addresses robust skill acquisition for robotic manipulation by modeling skills as latent switching feedback controllers in a latent space, where latent state $z_t$ and skill index $\delta_t$ govern control. It reinterprets a one-layer network as a latent-space feedback controller and extends MDNs with a probabilistic switching mechanism trained via a novel ELBO that includes a switching-consistency term. The approach yields improved task success, robustness to observation noise, and clearer skill transitions across tasks like Franka Kitchen, FetchPush, and robot handwriting, with demonstrated gains in sample efficiency and interpretability. The work has practical impact for deploying robust, multi-skill policies on real robots in multimodal environments, and opens avenues for nonparametric extensions and latent-dynamics-informed control.

Abstract

Manipulation tasks often consist of subtasks, each representing a distinct skill. Mastering these skills is essential for robots, as it enhances their autonomy, efficiency, adaptability, and ability to work in their environment. Learning from demonstrations allows robots to rapidly acquire new skills without starting from scratch, with demonstrations typically sequencing skills to achieve tasks. Behaviour cloning approaches to learning from demonstration commonly rely on mixture density network output heads to predict robot actions. In this work, we first reinterpret the mixture density network as a library of feedback controllers (or skills) conditioned on latent states. This arises from the observation that a one-layer linear network is functionally equivalent to a classical feedback controller, with network weights corresponding to controller gains. We use this insight to derive a probabilistic graphical model that combines these elements, describing the skill acquisition process as segmentation in a latent space, where each skill policy functions as a feedback control law in this latent space. Our approach significantly improves not only task success rate, but also robustness to observation noise when trained with human demonstrations. Our physical robot experiments further show that the induced robustness improves model deployment on robots.

A Probabilistic Model for Skill Acquisition with Switching Latent Feedback Controllers

TL;DR

Abstract

A Probabilistic Model for Skill Acquisition with Switching Latent Feedback Controllers

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)