Deep-TEMPEST: Using Deep Learning to Eavesdrop on HDMI from its Unintended Electromagnetic Emanations
Santiago Fernández, Emilio Martínez, Gabriel Varela, Pablo Musé, Federico Larroca
TL;DR
This work tackles HDMI TEMPEST eavesdropping by formulating it as an inverse problem and deploying a deep CNN (DRUNet) to recover displayed images directly from complex baseband samples captured by an SDR. It derives and exploits an analytical forward model of HDMI emissions, avoids traditional AM demodulation, and demonstrates substantial CER improvements (over $60$ percentage points) over prior methods, using an open-source dataset of ~3500 samples (simulated and real). The approach jointly optimizes data-driven restoration with a physics-informed forward model, achieving superior text recovery and providing practical countermeasures to mitigate leakage. The contribution includes an accessible dataset and code, enabling broader evaluation and extension of TEMPEST defenses and attacks in digital-display contexts.
Abstract
In this work, we address the problem of eavesdropping on digital video displays by analyzing the electromagnetic waves that unintentionally emanate from the cables and connectors, particularly HDMI. This problem is known as TEMPEST. Compared to the analog case (VGA), the digital case is harder due to a 10-bit encoding that results in a much larger bandwidth and non-linear mapping between the observed signal and the pixel's intensity. As a result, eavesdropping systems designed for the analog case obtain unclear and difficult-to-read images when applied to digital video. The proposed solution is to recast the problem as an inverse problem and train a deep learning module to map the observed electromagnetic signal back to the displayed image. However, this approach still requires a detailed mathematical analysis of the signal, firstly to determine the frequency at which to tune but also to produce training samples without actually needing a real TEMPEST setup. This saves time and avoids the need to obtain these samples, especially if several configurations are being considered. Our focus is on improving the average Character Error Rate in text, and our system improves this rate by over 60 percentage points compared to previous available implementations. The proposed system is based on widely available Software Defined Radio and is fully open-source, seamlessly integrated into the popular GNU Radio framework. We also share the dataset we generated for training, which comprises both simulated and over 1000 real captures. Finally, we discuss some countermeasures to minimize the potential risk of being eavesdropped by systems designed based on similar principles.
