Adapting to Covariate Shift in Real-time by Encoding Trees with Motion Equations
Tham Yik Foong, Heng Zhang, Mao Po Yuan, Danilo Vasconcellos Vargas
TL;DR
The study tackles covariate shift in online settings by proposing Xenovert, a motion-equation–driven, perfect binary tree that adaptively partitions the input space into quasi-quantile intervals $q_i$ to maintain alignment between source and shifted target distributions without retraining. Xenovert updates the quasi-quantiles via $q_{i,t+1} = q_{i,t} + \alpha v_{i,t+1} s$ with $v_{i,t+1} = \theta v_{i,t} + |q_{i,t}-x_t|$, propagating updates from the root to leaves and converting inputs to interval indices, yielding an $O(NL)$ complexity. When integrated with a neural network, Xenovert improved robustness to covariate shift, delivering best performance in 4 of 5 shifted datasets and preserving accuracy on severely shifted Iris data, while also reducing regression error in Abalone tasks compared to a plain MLP. The approach offers a simple, online alternative to reweighting methods, with potential extensions to high-dimensional inputs and broader applications in systems facing unforeseen distribution changes. Overall, Xenovert provides a practical, low-overhead mechanism for maintaining model relevance under non-stationary environments by continuously adapting input representations without full retraining.
Abstract
Input distribution shift presents a significant problem in many real-world systems. Here we present Xenovert, an adaptive algorithm that can dynamically adapt to changes in input distribution. It is a perfect binary tree that adaptively divides a continuous input space into several intervals of uniform density while receiving a continuous stream of input. This process indirectly maps the source distribution to the shifted target distribution, preserving the data's relationship with the downstream decoder/operation, even after the shift occurs. In this paper, we demonstrated how a neural network integrated with Xenovert achieved better results in 4 out of 5 shifted datasets, saving the hurdle of retraining a machine learning model. We anticipate that Xenovert can be applied to many more applications that require adaptation to unforeseen input distribution shifts, even when the distribution shift is drastic.
