Performance Prediction of On-NIC Network Functions with Multi-Resource Contention and Traffic Awareness
Shaofeng Wu, Qiang Su, Zhixiong Niu, Hong Xu
TL;DR
Yala tackles accurate performance prediction for on-NIC NFs under multi-resource contention and dynamic traffic. It introduces a divide-and-compose approach with per-resource contention models (hardware accelerators via a queueing-based white-box model and memory via gradient-boosted regression) and an execution-pattern-based composition, augmented by traffic-aware features and adaptive profiling. Empirical results on BlueField-2 show substantial improvements in prediction accuracy (average MAPE around $3.7\%$) and SLA adherence, enabling effective contention-aware NF placement and fast bottleneck diagnosis. The framework demonstrates strong potential for practical deployment and generalizes to other SoC SmartNICs, with open-source tooling and neural-guidance-free inference benefiting operators and researchers alike.
Abstract
Network function (NF) offloading on SmartNICs has been widely used in modern data centers, offering benefits in host resource saving and programmability. Co-running NFs on the same SmartNICs can cause performance interference due to contention of onboard resources. To meet performance SLAs while ensuring efficient resource management, operators need mechanisms to predict NF performance under such contention. However, existing solutions lack SmartNIC-specific knowledge and exhibit limited traffic awareness, leading to poor accuracy for on-NIC NFs. This paper proposes Yala, a novel performance predictive system for on-NIC NFs. Yala builds upon the key observation that co-located NFs contend for multiple resources, including onboard accelerators and the memory subsystem. It also facilitates traffic awareness according to the behaviors of individual resources to maintain accuracy as the external traffic attributes vary. Evaluation using BlueField-2 SmartNICs shows that Yala improves the prediction accuracy by 78.8% and reduces SLA violations by 92.2% compared to state-of-the-art approaches, and enables new practical usecases.
