Interference-Aware Edge Runtime Prediction with Conformal Matrix Completion
Tianshu Huang, Arjun Ramesh, Emily Ruppel, Nuno Pereira, Anthony Rowe, Carlee Joe-Wong
TL;DR
The paper addresses edge-runtime prediction under interference in heterogeneous devices by formulating runtime estimation as an interference-aware matrix completion problem. It proposes Pitot, a two-tower matrix factorization model that incorporates workload/platform features, a low-rank interference term, and log-residual training, together with Conformalized Quantile Regression for calibrated, tight uncertainty bounds. On a WebAssembly-based dataset spanning 24 devices and 249 workloads, Pitot achieves about 5.2% mean absolute percent error and provides tight, calibrated prediction intervals, outperforming baselines by up to 2x in accuracy and significantly improving bound tightness. The work demonstrates strong data efficiency, interpretable embeddings, and practical potential for edge orchestration, with open avenues for online learning and broader benchmarking datasets.
Abstract
Accurately estimating workload runtime is a longstanding goal in computer systems, and plays a key role in efficient resource provisioning, latency minimization, and various other system management tasks. Runtime prediction is particularly important for managing increasingly complex distributed systems in which more sophisticated processing is pushed to the edge in search of better latency. Previous approaches for runtime prediction in edge systems suffer from poor data efficiency or require intensive instrumentation; these challenges are compounded in heterogeneous edge computing environments, where historical runtime data may be sparsely available and instrumentation is often challenging. Moreover, edge computing environments often feature multi-tenancy due to limited resources at the network edge, potentially leading to interference between workloads and further complicating the runtime prediction problem. Drawing from insights across machine learning and computer systems, we design a matrix factorization-inspired method that generates accurate interference-aware predictions with tight provably-guaranteed uncertainty bounds. We validate our method on a novel WebAssembly runtime dataset collected from 24 unique devices, achieving a prediction error of 5.2% -- 2x better than a naive application of existing methods.
