AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations
Minjun Zhu, Zhen Lin, Yixuan Weng, Panzhong Lu, Qiujie Xie, Yifan Wei, Sifan Liu, Qiyao Sun, Yue Zhang
TL;DR
This work tackles the bottleneck of producing publication-ready scientific illustrations from long-context documents by introducing FigureBench, a large-scale benchmark of 3,300 long-text–illustration pairs, and AutoFigure, a two-stage, agentic framework based on the Reasoned Rendering paradigm. Stage I performs conceptual grounding and layout planning to produce a symbolic, structurally coherent blueprint, while Stage II renders this blueprint into high-fidelity visuals with an erase-and-correct post-processing pipeline to ensure textual accuracy. Through extensive automated and human evaluations, AutoFigure consistently surpasses baselines in visual design, communication, and content fidelity, achieving publication-ready quality across diverse document types. The work demonstrates a practical path toward automated, high-quality scientific visualization, with implications for broader AI-assisted scientific communication and future extensions to dynamic and interactive figures.
Abstract
High-quality scientific illustrations are crucial for effectively communicating complex scientific and technical concepts, yet their manual creation remains a well-recognized bottleneck in both academia and industry. We present FigureBench, the first large-scale benchmark for generating scientific illustrations from long-form scientific texts. It contains 3,300 high-quality scientific text-figure pairs, covering diverse text-to-illustration tasks from scientific papers, surveys, blogs, and textbooks. Moreover, we propose AutoFigure, the first agentic framework that automatically generates high-quality scientific illustrations based on long-form scientific text. Specifically, before rendering the final result, AutoFigure engages in extensive thinking, recombination, and validation to produce a layout that is both structurally sound and aesthetically refined, outputting a scientific illustration that achieves both structural completeness and aesthetic appeal. Leveraging the high-quality data from FigureBench, we conduct extensive experiments to test the performance of AutoFigure against various baseline methods. The results demonstrate that AutoFigure consistently surpasses all baseline methods, producing publication-ready scientific illustrations. The code, dataset and huggingface space are released in https://github.com/ResearAI/AutoFigure.
