Bridging the Gap Between Simulated and Real Network Data Using Transfer Learning

Carlos Güemes-Palau; Miquel Ferriol-Galmés; Jordi Paillisse-Vilanova; Albert López-Brescó; Pere Barlet-Ros; Albert Cabellos-Aparicio

Bridging the Gap Between Simulated and Real Network Data Using Transfer Learning

Carlos Güemes-Palau, Miquel Ferriol-Galmés, Jordi Paillisse-Vilanova, Albert López-Brescó, Pere Barlet-Ros, Albert Cabellos-Aparicio

TL;DR

The paper addresses the data scarcity and domain shift problem confronted by ML-based network performance models trained on simulated data but deployed in real networks. It introduces a transfer-learning pipeline that pre-trains RouteNet-Fermi on abundant simulated data and then fine-tunes the model with a small real-world dataset, exploring both manual and automated weight-transfer strategies. Experiments with OMNeT++ simulations and a real testbed show substantial improvements in predictive accuracy, with up to 88% MAPE reduction for Poisson traffic, 80% for On/Off, and 48% for MAWI when using 50 real scenarios (37% with only 10 scenarios). The study demonstrates data-efficient adaptation of network predictors to real environments and provides practical guidance on selecting fine-tuning configurations, thereby reducing reliance on large real-world datasets while preserving performance.

Abstract

Machine Learning (ML)-based network models provide fast and accurate predictions for complex network behaviors but require substantial training data. Collecting such data from real networks is often costly and limited, especially for critical scenarios like failures. As a result, researchers commonly rely on simulated data, which reduces accuracy when models are deployed in real environments. We propose a hybrid approach leveraging transfer learning to combine simulated and real-world data. Using RouteNet-Fermi, we show that fine-tuning a pre-trained model with a small real dataset significantly improves performance. Our experiments with OMNeT++ and a custom testbed reduce the Mean Absolute Percentage Error (MAPE) in packet delay prediction by up to 88%. With just 10 real scenarios, MAPE drops by 37%, and with 50 scenarios, by 48%.

Bridging the Gap Between Simulated and Real Network Data Using Transfer Learning

TL;DR

Abstract

Bridging the Gap Between Simulated and Real Network Data Using Transfer Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)