A Robust Task-Level Control Architecture for Learned Dynamical Systems
Eshika Pathak, Ahmed Aboudonia, Sandeep Banik, Naira Hovakimyan
TL;DR
This work tackles task-execution mismatch in dynamical-systems–based learning from demonstration by introducing L1-DS, a task-level control architecture that augments nominal learned dynamics with a Control Lyapunov Function (CLF)–based stabilizer and an $\$ adaptive controller. The approach also incorporates a windowed DTW-based target selector to maintain phase-consistent tracking despite temporal misalignments. The key contributions are (i) a robust nominal stabilization layer, (ii) a principled L1 adaptive augmentation to handle matched and unmatched disturbances at the task level, and (iii) a forward-looking DTW-based target selection mechanism that improves phase alignment. Empirical validation on LASA and IROS handwriting datasets demonstrates improved trajectory tracking under various disturbances, highlighting the practical potential of robust task-level control for learned dynamical systems.
Abstract
Dynamical system (DS)-based learning from demonstration (LfD) is a powerful tool for generating motion plans in the operation (`task') space of robotic systems. However, the realization of the generated motion plans is often compromised by a ''task-execution mismatch'', where unmodeled dynamics, persistent disturbances, and system latency cause the robot's actual task-space state to diverge from the desired motion trajectory. We propose a novel task-level robust control architecture, L1-augmented Dynamical Systems (L1-DS), that explicitly handles the task-execution mismatch in tracking a nominal motion plan generated by any DS-based LfD scheme. Our framework augments any DS-based LfD model with a nominal stabilizing controller and an L1 adaptive controller. Furthermore, we introduce a windowed Dynamic Time Warping (DTW)-based target selector, which enables the nominal stabilizing controller to handle temporal misalignment for improved phase-consistent tracking. We demonstrate the efficacy of our architecture on the LASA and IROS handwriting datasets.
