Unveiling Hidden Visual Information: A Reconstruction Attack Against Adversarial Visual Information Hiding
Jonggyu Jang, Hyeonsu Lyu, Seongjin Hwang, Hyun Jong Yang
TL;DR
The paper addresses privacy vulnerabilities in adversarial visual information hiding (AVIH) for cloud-based image recognition by introducing a data reconstruction attack that does not require access to the original key model. It trains an attacker key model to reproduce original gallery images from encrypted inputs while preserving the service model’s outputs, using a combination of $L_t$, $L_d$, $L_v$, and $L_r$ losses in AVIH and augmenting with augmented identity loss and a generative adversarial (GAN) framework constrained by the Jensen-Shannon divergence $D_{JS}$. The authors demonstrate an exact reconstruction attack and show that the attack’s effectiveness increases as more images share the same key model, with substantial gains from the proposed ablations, across face recognition and re-identification tasks. These results highlight significant privacy risks in AVIH-based systems and motivate stronger defenses, such as per-image unique keys or additional cryptographic protections, to prevent data reconstruction in cloud-based inference scenarios.
Abstract
This paper investigates the security vulnerabilities of adversarial-example-based image encryption by executing data reconstruction (DR) attacks on encrypted images. A representative image encryption method is the adversarial visual information hiding (AVIH), which uses type-I adversarial example training to protect gallery datasets used in image recognition tasks. In the AVIH method, the type-I adversarial example approach creates images that appear completely different but are still recognized by machines as the original ones. Additionally, the AVIH method can restore encrypted images to their original forms using a predefined private key generative model. For the best security, assigning a unique key to each image is recommended; however, storage limitations may necessitate some images sharing the same key model. This raises a crucial security question for AVIH: How many images can safely share the same key model without being compromised by a DR attack? To address this question, we introduce a dual-strategy DR attack against the AVIH encryption method by incorporating (1) generative-adversarial loss and (2) augmented identity loss, which prevent DR from overfitting -- an issue akin to that in machine learning. Our numerical results validate this approach through image recognition and re-identification benchmarks, demonstrating that our strategy can significantly enhance the quality of reconstructed images, thereby requiring fewer key-sharing encrypted images. Our source code to reproduce our results will be available soon.
