FIGRET: Fine-Grained Robustness-Enhanced Traffic Engineering
Ximeng Liu, Shizhen Zhao, Yong Cui, Xinbing Wang
TL;DR
Traffic Engineering faces a fundamental tension between normal-case performance and robustness to traffic bursts. FIGRET addresses this by applying fine-grained robustness that is customized per source-destination pair through a burst-aware loss and path-sensitivity constraints, learned in an end-to-end fashion. The method demonstrates substantial improvements in $MLU$ and solution speed over Jupiter and DOTE across WAN and DC topologies, and provides resilience to failures and traffic drift with limited retraining. This work offers a practical, scalable approach to robust TE that can adapt to diverse traffic characteristics while maintaining high utilization and responsiveness.
Abstract
Traffic Engineering (TE) is critical for improving network performance and reliability. A key challenge in TE is the management of sudden traffic bursts. Existing TE schemes either do not handle traffic bursts or uniformly guard against traffic bursts, thereby facing difficulties in achieving a balance between normal-case performance and burst-case performance. To address this issue, we introduce FIGRET, a Fine-Grained Robustness-Enhanced TE scheme. FIGRET offers a novel approach to TE by providing varying levels of robustness enhancements, customized according to the distinct traffic characteristics of various source-destination pairs. By leveraging a burst-aware loss function and deep learning techniques, FIGRET is capable of generating high-quality TE solutions efficiently. Our evaluations of real-world production networks, including Wide Area Networks and data centers, demonstrate that FIGRET significantly outperforms existing TE schemes. Compared to the TE scheme currently deployed in Jupiter data center networks of Google, FIGRET achieves a 9\%-34\% reduction in average Maximum Link Utilization and improves solution speed by $35\times$-$1800 \times$. Against DOTE, a state-of-the-art deep learning-based TE method, FIGRET substantially lowers the occurrence of significant congestion events triggered by traffic bursts by 41\%-53.9\% in topologies with high traffic dynamics.
