A Functional Learning Approach for Team-Optimal Traffic Coordination

Weihao Sun; Gehui Xu; Alessio Moreschini; Thomas Parisini; Andreas A. Malikopoulos

A Functional Learning Approach for Team-Optimal Traffic Coordination

Weihao Sun, Gehui Xu, Alessio Moreschini, Thomas Parisini, Andreas A. Malikopoulos

Abstract

In this paper, we develop a kernel-based policy iteration functional learning framework for computing team-optimal strategies in traffic coordination problems. We consider a multi-agent discrete-time linear system with a cost function that combines quadratic regulation terms and nonlinear safety penalties. Building on the Hilbert space formulation of offline receding-horizon policy iteration, we seek approximate solutions within a reproducing kernel Hilbert space, where the policy improvement step is implemented via a discrete Fréchet derivative. We further study the model-free receding-horizon scenario, where the system dynamics are estimated using recursive least squares, followed by updating the policy using rolling online data. The proposed method is tested in signal-free intersection scenarios via both model-based and model-free simulations and validated in SUMO.

A Functional Learning Approach for Team-Optimal Traffic Coordination

Abstract

A Functional Learning Approach for Team-Optimal Traffic Coordination

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (9)