Towards a Foundation Model for Communication Systems
Davide Buffelli, Sowmen Das, Yu-Wei Lin, Sattar Vakili, Chien-Yi Wang, Masoud Attarifar, Pritthijit Nath, Da-shan Shiu
TL;DR
This work proposes a transformer-based foundation model for communication systems that processes raw, heterogeneous communication data. It addresses domain-specific challenges such as multi-feature inputs, mixed data types, and variable-size representations through tailored tokenization, per-slot feature embeddings, and robust preprocessing. A simulation-based, unsupervised data generation pipeline (via Sionna) supports self-supervised pre-training with masked feature prediction, targeting five key estimable features (transmission rank, selected precoder, Doppler width, and delay-profile center/length). Experiments demonstrate reliable forecasting and interpolation across these features and reveal scaling behavior: larger models trained on more data achieve better estimation accuracy, suggesting practical paths toward scalable foundation models in 6G-type systems. The work lays groundwork for broader datasets, additional features, and community-accessible pipelines to accelerate development of foundation models for communication networks.
Abstract
Artificial Intelligence (AI) has demonstrated unprecedented performance across various domains, and its application to communication systems is an active area of research. While current methods focus on task-specific solutions, the broader trend in AI is shifting toward large general models capable of supporting multiple applications. In this work, we take a step toward a foundation model for communication data--a transformer-based, multi-modal model designed to operate directly on communication data. We propose methodologies to address key challenges, including tokenization, positional embedding, multimodality, variable feature sizes, and normalization. Furthermore, we empirically demonstrate that such a model can successfully estimate multiple features, including transmission rank, selected precoder, Doppler spread, and delay profile.
