Learning responsibility allocations for multi-agent interactions: A differentiable optimization approach with control barrier functions

Isaac Remy; David Fridovich-Keil; Karen Leung

Learning responsibility allocations for multi-agent interactions: A differentiable optimization approach with control barrier functions

Isaac Remy, David Fridovich-Keil, Karen Leung

TL;DR

This work introduces a data-driven framework to learn how agents share responsibility for maintaining safety in multi-agent interactions using Control Barrier Functions (CBFs) and differentiable optimization. By defining a responsibility vector $\boldsymbol{\gamma}$ and a CBF-based safety filter, the authors formulate a bi-level, differentiable optimization problem to infer how much each agent should adjust its control to satisfy collision-avoidance constraints, from data. They further impose symmetry and relative-coordinate formulations to improve data efficiency and interpretability, and demonstrate results on synthetic multi-agent dynamics as well as real traffic-weaving data, revealing intuitive, interpretable responsibility allocations and highlighting limitations in multimodal settings. The approach enables offline analysis and potential online estimation of responsibility, offering a principled lens to evaluate socially-aware autonomous policies and guide robot control design. Key future directions include learning the desired-control policy from data, extending to probabilistic formulations to handle multimodal interactions, and leveraging inferred responsibilities to inform policy construction in real-world systems.

Abstract

From autonomous driving to package delivery, ensuring safe yet efficient multi-agent interaction is challenging as the interaction dynamics are influenced by hard-to-model factors such as social norms and contextual cues. Understanding these influences can aid in the design and evaluation of socially-aware autonomous agents whose behaviors are aligned with human values. In this work, we seek to codify factors governing safe multi-agent interactions via the lens of responsibility, i.e., an agent's willingness to deviate from their desired control to accommodate safe interaction with others. Specifically, we propose a data-driven modeling approach based on control barrier functions and differentiable optimization that efficiently learns agents' responsibility allocation from data. We demonstrate on synthetic and real-world datasets that we can obtain an interpretable and quantitative understanding of how much agents adjust their behavior to ensure the safety of others given their current environment.

Learning responsibility allocations for multi-agent interactions: A differentiable optimization approach with control barrier functions

TL;DR

and a CBF-based safety filter, the authors formulate a bi-level, differentiable optimization problem to infer how much each agent should adjust its control to satisfy collision-avoidance constraints, from data. They further impose symmetry and relative-coordinate formulations to improve data efficiency and interpretability, and demonstrate results on synthetic multi-agent dynamics as well as real traffic-weaving data, revealing intuitive, interpretable responsibility allocations and highlighting limitations in multimodal settings. The approach enables offline analysis and potential online estimation of responsibility, offering a principled lens to evaluate socially-aware autonomous policies and guide robot control design. Key future directions include learning the desired-control policy from data, extending to probabilistic formulations to handle multimodal interactions, and leveraging inferred responsibilities to inform policy construction in real-world systems.

Learning responsibility allocations for multi-agent interactions: A differentiable optimization approach with control barrier functions

TL;DR

Abstract

Learning responsibility allocations for multi-agent interactions: A differentiable optimization approach with control barrier functions

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (10)