RCCDA: Adaptive Model Updates in the Presence of Concept Drift under a Constrained Resource Budget
Adam Piaseczny, Md Kamran Chowdhury Shisher, Shiqiang Wang, Christopher G. Brinton
TL;DR
RCCDA addresses real-time model adaptation under concept drift with strict resource budgets by deriving a Lyapunov drift-plus-penalty-based threshold policy that decides updates using only past loss information. The method provides provable bounds on both convergence (time-averaged gradient norms) and budget compliance (update-frequency constraint) while remaining lightweight and distribution-ignorant. The authors validate RCCDA on four domain-generalization datasets under multiple drift schedules, showing improved accuracy and faster drift recovery compared with baseline policies, all within the specified resource budget. The approach is practical for edge and real-time deployments, offering a principled, low-overhead alternative to drift-detection-based schemes and highlighting trade-offs between update frequency, drift, and performance.
Abstract
Machine learning (ML) algorithms deployed in real-world environments are often faced with the challenge of adapting models to concept drift, where the task data distributions are shifting over time. The problem becomes even more difficult when model performance must be maintained under adherence to strict resource constraints. Existing solutions often depend on drift-detection methods that produce high computational overhead for resource-constrained environments, and fail to provide strict guarantees on resource usage or theoretical performance assurances. To address these shortcomings, we propose RCCDA: a dynamic model update policy that optimizes ML training dynamics while ensuring compliance to predefined resource constraints, utilizing only past loss information and a tunable drift threshold. In developing our policy, we analytically characterize the evolution of model loss under concept drift with arbitrary training update decisions. Integrating these results into a Lyapunov drift-plus-penalty framework produces a lightweight greedy-optimal policy that provably limits update frequency and cost. Experimental results on four domain generalization datasets demonstrate that our policy outperforms baseline methods in inference accuracy while adhering to strict resource constraints under several schedules of concept drift, making our solution uniquely suited for real-time ML deployments.
