Interpreting Manifolds and Graph Neural Embeddings from Internet of Things Traffic Flows
Enrique Feito-Casares, Francisco M. Melgarejo-Meseguer, Elena Casiraghi, Giorgio Valentini, José-Luis Rojo-Álvarez
TL;DR
This work addresses the challenge of interpreting high-dimensional GNN embeddings for IoT traffic by embedding them onto a low-dimensional latent manifold using a jointly trained, MnL-informed framework. It combines a GIN-based graph representation with a parametric P-UMAP projection and SHAP-based feature attribution to produce directly visualizable embeddings that preserve topology and offer explanations. The approach achieves a Binary F1 score of 0.830 for intrusion detection and reveals concept drift phenomena where evolving botnet behavior mimics DoS patterns, highlighting both strengths and challenges of multiclass separation. Practically, the method enables interpretable network monitoring and interoperability in dynamic IoT environments, guiding security analysts and administrators through topological and semantic shifts in traffic behavior.
Abstract
The rapid expansion of Internet of Things (IoT) ecosystems has led to increasingly complex and heterogeneous network topologies. Traditional network monitoring and visualization tools rely on aggregated metrics or static representations, which fail to capture the evolving relationships and structural dependencies between devices. Although Graph Neural Networks (GNNs) offer a powerful way to learn from relational data, their internal representations often remain opaque and difficult to interpret for security-critical operations. Consequently, this work introduces an interpretable pipeline that generates directly visualizable low-dimensional representations by mapping high-dimensional embeddings onto a latent manifold. This projection enables the interpretable monitoring and interoperability of evolving network states, while integrated feature attribution techniques decode the specific characteristics shaping the manifold structure. The framework achieves a classification F1-score of 0.830 for intrusion detection while also highlighting phenomena such as concept drift. Ultimately, the presented approach bridges the gap between high-dimensional GNN embeddings and human-understandable network behavior, offering new insights for network administrators and security analysts.
