Subspace-Configurable Networks
Dong Wang, Olga Saukh, Xiaoxi He, Lothar Thiele
TL;DR
Subspace-Configurable Networks (SCNs) tackle robustness to sensor drift and deployment-domain shifts by learning a low-dimensional configuration subspace that covers optimal models for a family of input transformations. A configuration network maps transformation parameters $\alpha$ to a coefficient vector $\beta$ that combines base models $\theta_i$ as $\theta = \sum_{i=1}^{D} \beta_i \theta_i$, enabling efficient post-deployment adaptation on edge devices. The authors provide theoretical continuity results linking continuous transformation curves to continuous weight curves, show that small $D$ suffices across diverse transformations, and demonstrate strong empirical performance across 10 transformations, multiple datasets, and hardware-constrained settings, including IoT deployments. They also introduce an $\\
Abstract
While the deployment of deep learning models on edge devices is increasing, these models often lack robustness when faced with dynamic changes in sensed data. This can be attributed to sensor drift, or variations in the data compared to what was used during offline training due to factors such as specific sensor placement or naturally changing sensing conditions. Hence, achieving the desired robustness necessitates the utilization of either an invariant architecture or specialized training approaches, like data augmentation techniques. Alternatively, input transformations can be treated as a domain shift problem, and solved by post-deployment model adaptation. In this paper, we train a parameterized subspace of configurable networks, where an optimal network for a particular parameter setting is part of this subspace. The obtained subspace is low-dimensional and has a surprisingly simple structure even for complex, non-invertible transformations of the input, leading to an exceptionally high efficiency of subspace-configurable networks (SCNs) when limited storage and computing resources are at stake.
