Federated Online Adaptation for Deep Stereo
Matteo Poggi, Fabio Tosi
TL;DR
The paper tackles domain shift in real-time stereo by proposing a federated online adaptation framework (FedFULL and FedMAD) that distributes the expensive optimization across a fleet of devices. It introduces MADNet 2, a lightweight backbone that uses all-pairs correlation volumes to expand the effective search range while maintaining efficiency. Experiments across KITTI, DrivingStereo, and DSEC demonstrate that federated adaptation can match or surpass on-device adaptation in accuracy while preserving real-time speeds on constrained hardware, with FedMAD offering substantial reductions in communication. This approach enables scalable, cloud-assisted adaptation for fleets of autonomous systems while keeping per-device latency low, with implications for robust depth sensing in diverse deployment conditions.
Abstract
We introduce a novel approach for adapting deep stereo networks in a collaborative manner. By building over principles of federated learning, we develop a distributed framework allowing for demanding the optimization process to a number of clients deployed in different environments. This makes it possible, for a deep stereo network running on resourced-constrained devices, to capitalize on the adaptation process carried out by other instances of the same architecture, and thus improve its accuracy in challenging environments even when it cannot carry out adaptation on its own. Experimental results show how federated adaptation performs equivalently to on-device adaptation, and even better when dealing with challenging environments.
