Stroke of Surprise: Progressive Semantic Illusions in Vector Sketching
Huai-Hsun Cheng, Siang-Ling Zhang, Yu-Lun Liu
TL;DR
Stroke of Surprise introduces Progressive Semantic Illusions to vector sketching, enabling a single stroke sequence to morph from an initial concept to a distinct later concept through delta strokes. The method employs a sequence-aware dual-branch Score Distillation Sampling that jointly optimizes shared stroke parameters for both the prefix and the full sketch, augmented by an Overlay Loss that enforces spatial complementarity and prevents occlusion. By discovering a common structural subspace that supports multi-phase transitions, the approach significantly improves recognizability and illusion strength over state-of-the-art baselines and scales to more than two phases with varied vector representations. The framework offers robust, human-aligned synthesis with potential applications in AI-assisted drawing and visual communication, while acknowledging limitations related to pre-trained diffusion priors and complex shape guidance.
Abstract
Visual illusions traditionally rely on spatial manipulations such as multi-view consistency. In this work, we introduce Progressive Semantic Illusions, a novel vector sketching task where a single sketch undergoes a dramatic semantic transformation through the sequential addition of strokes. We present Stroke of Surprise, a generative framework that optimizes vector strokes to satisfy distinct semantic interpretations at different drawing stages. The core challenge lies in the "dual-constraint": initial prefix strokes must form a coherent object (e.g., a duck) while simultaneously serving as the structural foundation for a second concept (e.g., a sheep) upon adding delta strokes. To address this, we propose a sequence-aware joint optimization framework driven by a dual-branch Score Distillation Sampling (SDS) mechanism. Unlike sequential approaches that freeze the initial state, our method dynamically adjusts prefix strokes to discover a "common structural subspace" valid for both targets. Furthermore, we introduce a novel Overlay Loss that enforces spatial complementarity, ensuring structural integration rather than occlusion. Extensive experiments demonstrate that our method significantly outperforms state-of-the-art baselines in recognizability and illusion strength, successfully expanding visual anagrams from the spatial to the temporal dimension. Project page: https://stroke-of-surprise.github.io/
