Perceptual Hash Inversion Attacks on Image-Based Sexual Abuse Removal Tools
Sophie Hawkes, Christian Weinert, Teresa Almeida, Maryam Mehrnezhad
TL;DR
Perceptual hashes underpin IBSA removal tools but are vulnerable to low-cost GAN-based hash inversion attacks across several popular PHFs, threatening user privacy. The authors demonstrate a Pix2Pix-style inversion pipeline trained on a CelebA subset that reconstructs recognizable features from hashes for aHash, PDQ, NeuralHash, and PhotoDNA on consumer hardware. They quantify perceptual similarity and show best-case reconstructions reaching near-complete similarity, challenging the reversibility and safety claims of tools such as Take It Down. To mitigate risk, they propose secure hash matching via private set intersection, including outsourced and unbalanced PSI variants, and call for privacy-preserving designs in IBSA content removal. The work highlights urgent need for cryptographic protections and user-centric usability considerations.
Abstract
We show that perceptual hashing, crucial for detecting and removing image-based sexual abuse (IBSA) online, faces vulnerabilities from low-budget inversion attacks based on generative AI. This jeopardizes the privacy of users, especially vulnerable groups. We advocate to implement secure hash matching in IBSA removal tools to mitigate potentially fatal consequences.
