Scan-and-Print: Patch-level Data Summarization and Augmentation for Content-aware Layout Generation in Poster Design
HsiaoYuan Hsu, Yuxin Peng
TL;DR
This paper addresses the computational and data-efficiency challenges in content-aware poster layout generation by introducing Scan-and-Print, a compact autoregressive framework guided by a vertex-based layout representation (VLR). The Scan component performs patch-level data summarization to focus perception on informative image regions, while the Print component uses patch and vertex mixup across image-layout pairs to synthesize diverse, plausible samples for robust training. The approach achieves state-of-the-art results on PKU PosterLayout and CGL benchmarks with substantially reduced encoder FLOPs and parameters, and demonstrates effective constrained-generation capabilities for real-world workflows. Overall, Scan-and-Print offers a practical, scalable solution that improves layout quality and generation speed, with strong potential for broader multi-modal design tasks.
Abstract
In AI-empowered poster design, content-aware layout generation is crucial for the on-image arrangement of visual-textual elements, e.g., logo, text, and underlay. To perceive the background images, existing work demanded a high parameter count that far exceeds the size of available training data, which has impeded the model's real-time performance and generalization ability. To address these challenges, we proposed a patch-level data summarization and augmentation approach, vividly named Scan-and-Print. Specifically, the scan procedure selects only the patches suitable for placing element vertices to perform fine-grained perception efficiently. Then, the print procedure mixes up the patches and vertices across two image-layout pairs to synthesize over 100% new samples in each epoch while preserving their plausibility. Besides, to facilitate the vertex-level operations, a vertex-based layout representation is introduced. Extensive experimental results on widely used benchmarks demonstrated that Scan-and-Print can generate visually appealing layouts with state-of-the-art quality while dramatically reducing computational bottleneck by 95.2%.
