FontCraft: Multimodal Font Design Using Interactive Bayesian Optimization

Yuki Tatsukawa; I-Chao Shen; Mustafa Doga Dogan; Anran Qi; Yuki Koyama; Ariel Shamir; Takeo Igarashi

FontCraft: Multimodal Font Design Using Interactive Bayesian Optimization

Yuki Tatsukawa, I-Chao Shen, Mustafa Doga Dogan, Anran Qi, Yuki Koyama, Ariel Shamir, Takeo Igarashi

TL;DR

FontCraft addresses the barrier non-experts face in font design by enabling interactive exploration of a font style latent space through preferential Bayesian optimization guided by multimodal references. It combines a pretrained font-generative model (DG-Font) and FontCLIP with a history-enabled UI to allow one-dimensional slider exploration, multimodal cue incorporation, style propagation across characters, and reversion to past states, exporting final fonts in OpenType format. Key contributions include multimodal-guided subspaces, retractable preference modeling, and an iterative style propagation/refinement loop, validated by simulations and a user study showing improved efficiency and consistency for Roman and CJK fonts. The approach is architecture- and model-agnostic, enabling future integration with newer font generators while supporting practical design tasks such as logos and posters, highlighting potential real-world impact for designers and non-experts alike.

Abstract

Creating new fonts requires a lot of human effort and professional typographic knowledge. Despite the rapid advancements of automatic font generation models, existing methods require users to prepare pre-designed characters with target styles using font-editing software, which poses a problem for non-expert users. To address this limitation, we propose FontCraft, a system that enables font generation without relying on pre-designed characters. Our approach integrates the exploration of a font-style latent space with human-in-the-loop preferential Bayesian optimization and multimodal references, facilitating efficient exploration and enhancing user control. Moreover, FontCraft allows users to revisit previous designs, retracting their earlier choices in the preferential Bayesian optimization process. Once users finish editing the style of a selected character, they can propagate it to the remaining characters and further refine them as needed. The system then generates a complete outline font in OpenType format. We evaluated the effectiveness of FontCraft through a user study comparing it to a baseline interface. Results from both quantitative and qualitative evaluations demonstrate that FontCraft enables non-expert users to design fonts efficiently.

FontCraft: Multimodal Font Design Using Interactive Bayesian Optimization

TL;DR

Abstract

FontCraft: Multimodal Font Design Using Interactive Bayesian Optimization

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)