Learning responsibility allocations for multi-agent interactions: A differentiable optimization approach with control barrier functions
Isaac Remy, David Fridovich-Keil, Karen Leung
TL;DR
This work introduces a data-driven framework to learn how agents share responsibility for maintaining safety in multi-agent interactions using Control Barrier Functions (CBFs) and differentiable optimization. By defining a responsibility vector $\boldsymbol{\gamma}$ and a CBF-based safety filter, the authors formulate a bi-level, differentiable optimization problem to infer how much each agent should adjust its control to satisfy collision-avoidance constraints, from data. They further impose symmetry and relative-coordinate formulations to improve data efficiency and interpretability, and demonstrate results on synthetic multi-agent dynamics as well as real traffic-weaving data, revealing intuitive, interpretable responsibility allocations and highlighting limitations in multimodal settings. The approach enables offline analysis and potential online estimation of responsibility, offering a principled lens to evaluate socially-aware autonomous policies and guide robot control design. Key future directions include learning the desired-control policy from data, extending to probabilistic formulations to handle multimodal interactions, and leveraging inferred responsibilities to inform policy construction in real-world systems.
Abstract
From autonomous driving to package delivery, ensuring safe yet efficient multi-agent interaction is challenging as the interaction dynamics are influenced by hard-to-model factors such as social norms and contextual cues. Understanding these influences can aid in the design and evaluation of socially-aware autonomous agents whose behaviors are aligned with human values. In this work, we seek to codify factors governing safe multi-agent interactions via the lens of responsibility, i.e., an agent's willingness to deviate from their desired control to accommodate safe interaction with others. Specifically, we propose a data-driven modeling approach based on control barrier functions and differentiable optimization that efficiently learns agents' responsibility allocation from data. We demonstrate on synthetic and real-world datasets that we can obtain an interpretable and quantitative understanding of how much agents adjust their behavior to ensure the safety of others given their current environment.
