From Pixels to Torques with Linear Feedback

Jeong Hun Lee; Sam Schoedel; Aditya Bhardwaj; Zachary Manchester

From Pixels to Torques with Linear Feedback

Jeong Hun Lee, Sam Schoedel, Aditya Bhardwaj, Zachary Manchester

TL;DR

This work tackles the problem of controlling a dynamic robotic system using only camera images by learning a linear observer-based policy that maps pixels to torques. It introduces a data-efficient teacher-student framework to learn a Luenberger observer from state-feedback demonstrations and combines it with a linear output-feedback controller, with stability guaranteed via a convex LMI constraint. A nonlinear extension based on Koopman embeddings broadens the approach to handle nonlinear dynamics, and the method is validated on a cartpole in both simulation and hardware, showing robustness to noise, disturbances, and occlusions. The results highlight the practicality of image-to-state estimation and linear control theory for vision-based robotics, offering data efficiency, interpretability, and stability guarantees that are advantageous for real-world deployment.

Abstract

We demonstrate the effectiveness of simple observer-based linear feedback policies for "pixels-to-torques" control of robotic systems using only a robot-facing camera. Specifically, we show that the matrices of an image-based Luenberger observer (linear state estimator) for a "student" output-feedback policy can be learned from demonstration data provided by a "teacher" state-feedback policy via simple linear-least-squares regression. The resulting linear output-feedback controller maps directly from high-dimensional raw images to torques while being amenable to the rich set of analytical tools from linear systems theory, allowing us to enforce closed-loop stability constraints in the learning problem. We also investigate a nonlinear extension of the method via the Koopman embedding. Finally, we demonstrate the surprising effectiveness of linear pixels-to-torques policies on a cartpole system, both in simulation and on real hardware. The policy successfully executes both stabilizing and swing-up trajectory-tracking tasks using only camera feedback while subject to model mismatch, process and sensor noise, perturbations, and occlusions. Open-source code for all experiments can be found here: https://roboticexplorationlab.org/projects/linear_pixels_to_torques.html

From Pixels to Torques with Linear Feedback

TL;DR

Abstract

From Pixels to Torques with Linear Feedback

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)