TelApart: Differentiating Network Faults from Customer-Premise Faults in Cable Broadband Networks
Jiyao Hu, Zhenyu Zhou, Xiaowei Yang
TL;DR
TelApart addresses the practical problem of differentiating maintenance- from service-issues in cable broadband networks using unlabeled PNM telemetry. It decouples fault detection from fault diagnosis, applying unsupervised clustering to identify device groups with similar anomalous patterns and then uses a cluster-size threshold to label maintenance versus service faults. Hyper-parameters are automatically tuned with guidance from customer-ticket statistics, enabling deployment without manual tuning. The approach robustly handles missing, duplicated, and misaligned PNM data and demonstrates strong performance (Rand Index ≈ 0.91) and field-validated effectiveness, offering significant potential to reduce misdispatches and operation costs in real-world ISP settings. The work contributes a turn-key, data-preprocessing–aware fault-diagnosis framework that relies on time-series similarity and ticket-guided optimization rather than labeled training data.
Abstract
Two types of radio frequency (RF) impairments frequently occur in a cable broadband network: impairments that occur inside a cable network and impairments occur at the edge of the broadband network, i.e., in a subscriber's premise. Differentiating these two types of faults is important, as different faults require different types of technical personnel to repair them. Presently, the cable industry lacks publicly available tools to automatically diagnose the type of fault. In this work, we present TelApart, a fault diagnosis system for cable broadband networks. TelApart uses telemetry data collected by the Proactive Network Maintenance (PNM) infrastructure in cable networks to effectively differentiate the type of fault. Integral to TelApart's design is an unsupervised machine learning model that groups cable devices sharing similar anomalous patterns together. We use metrics derived from an ISP's customer trouble tickets to programmatically tune the model's hyper-parameters so that an ISP can deploy TelApart in various conditions without hand-tuning its hyper-parameters. We also address the data challenge that the telemetry data collected by the PNM system contain numerous missing, duplicated, and unaligned data points. Using real-world data contributed by a cable ISP, we show that TelApart can effectively identify different types of faults.
