Seeing Through the Mask: Rethinking Adversarial Examples for CAPTCHAs

Yahya Jabary; Andreas Plesner; Turlan Kuzhagaliyev; Roger Wattenhofer

Seeing Through the Mask: Rethinking Adversarial Examples for CAPTCHAs

Yahya Jabary, Andreas Plesner, Turlan Kuzhagaliyev, Roger Wattenhofer

TL;DR

It is demonstrated that by adding masks of various intensities the Accuracy @ 1 (Acc@1) drops by more than 50%-points for all models, and supposedly robust models such as vision transformers see an Acc@1 drop of 80%-points, thus showing that machines have not caught up with humans -- yet.

Abstract

Modern CAPTCHAs rely heavily on vision tasks that are supposedly hard for computers but easy for humans. However, advances in image recognition models pose a significant threat to such CAPTCHAs. These models can easily be fooled by generating some well-hidden "random" noise and adding it to the image, or hiding objects in the image. However, these methods are model-specific and thus can not aid CAPTCHAs in fooling all models. We show in this work that by allowing for more significant changes to the images while preserving the semantic information and keeping it solvable by humans, we can fool many state-of-the-art models. Specifically, we demonstrate that by adding masks of various intensities the Accuracy @ 1 (Acc@1) drops by more than 50%-points for all models, and supposedly robust models such as vision transformers see an Acc@1 drop of 80%-points. These masks can therefore effectively fool modern image classifiers, thus showing that machines have not caught up with humans -- yet.

Seeing Through the Mask: Rethinking Adversarial Examples for CAPTCHAs

TL;DR

Abstract

Paper Structure (21 sections, 2 figures, 12 tables)

This paper contains 21 sections, 2 figures, 12 tables.

Introduction
Approach
Related Work
Methodology
Models
Data
Perceptual Quality and the Accuracy Metric
Results
Experiment 1 -- SubSet500
Experiment 2 -- SubSet200
Experiment 3 -- ResizedAll
Conclusion
Appendix / supplemental material
Acc@1 and Acc@5 accuracy of the tested models.
Hyperparameter Optimization
...and 6 more sections

Figures (2)

Figure 1: Selected examples by hCaptcha and their optimized reconstructions. The "Word" overlay was omitted and replaced with a custom "Knit" mask.
Figure 2: Accuracy vs. Perceptual Quality Trade-off

Seeing Through the Mask: Rethinking Adversarial Examples for CAPTCHAs

TL;DR

Abstract

Seeing Through the Mask: Rethinking Adversarial Examples for CAPTCHAs

Authors

TL;DR

Abstract

Table of Contents

Figures (2)