On Sample Selection for Continual Learning: a Video Streaming Case Study
Alexander Dietmüller, Romain Jacob, Laurent Vanbever
TL;DR
The paper tackles the challenge of continual learning for ML-based adaptive bitrate in evolving networks by focusing on tail performance rather than just average metrics. It introduces Memento, a density-based sample-space coverage method that selects training samples from low-density regions to maximize coverage and trigger retraining only when new information arises. Through real-world deployment on the Puffer ABR system and synthetic-shift simulations, Memento achieves a 14% reduction in stalls with modest SSIM impact and demonstrates robust, architecture-agnostic benefits. The work situates density-based sampling as complementary to existing strategies like JTT and MatchMaker, offering practical improvements for tail reliability and providing artifacts for reproducibility and broader application in networking ML tasks.
Abstract
Machine learning (ML) is a powerful tool to model the complexity of communication networks. As networks evolve, we cannot only train once and deploy. Retraining models, known as continual learning, is necessary. Yet, to date, there is no established methodology to answer the key questions: With which samples to retrain? When should we retrain? We address these questions with the sample selection system Memento, which maintains a training set with the "most useful" samples to maximize sample space coverage. Memento particularly benefits rare patterns -- the notoriously long "tail" in networking -- and allows assessing rationally when retraining may help, i.e., when the coverage changes. We deployed Memento on Puffer, the live-TV streaming project, and achieved a 14% reduction of stall time, 3.5x the improvement of random sample selection. Finally, Memento does not depend on a specific model architecture; it is likely to yield benefits in other ML-based networking applications.
