Textured Word-As-Image illustration
Mohammad Javadian Farzaneh, Selim Balcisoy
TL;DR
The paper tackles semantic typography by presenting an automatic pipeline that reshapes a selected letter to reflect a semantic concept and textures the letter via diffusion-based generation, ensuring readability. It integrates Word-as-Illustration for letter deformation with Stable Diffusion XL for concept-driven textures, followed by a precise texture-mapping stage using alpha masking and Lanczos resampling. A Gradio-based UI enables real-time texture scale and background color adjustments, and a user study demonstrates that textures convey semantic meaning without significantly compromising legibility. While effective, the approach depends on pretrained diffusion models and acknowledges limitations for abstract concepts, suggesting future model retraining or fine-tuning to broaden coverage.
Abstract
In this paper, we propose a novel fully automatic pipeline to generate text images that are legible and strongly aligned to the desired semantic concept taken from the users' inputs. In our method, users are able to put three inputs into the system, including a semantic concept, a word, and a letter. The semantic concept will be used to change the shape of the input letter and generate the texture based on the pre-defined prompt using stable diffusion models. Our pipeline maps the texture on a text image in a way that preserves the readability of the whole output while preserving legibility. The system also provides real-time adjustments for the user to change the scale of the texture and apply it to the text image. User evaluations demonstrate that our method effectively represents semantic meaning without compromising legibility, making it a robust and innovative tool for graphic design, logo creation, and artistic typography.
