VIS30K: A Collection of Figures and Tables from IEEE Visualization Conference Publications
Jian Chen, Meng Ling, Rui Li, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Torsten Möller, Robert S. Laramee, Han-Wei Shen, Katharina Wünsche, Qiru Wang
TL;DR
VIS30K tackles the lack of image-centric access to IEEE VIS literature by assembling a complete 30-year collection of figures and tables from 1990–2019 and providing a searchable browser, VIN. The authors implement a semi-automatic extraction pipeline that uses synthetic pseudo-papers to train CNN detectors (YOLOv3 and Faster R-CNN) and then applies manual curation to produce high-quality bounding-box annotations for 26,776 figures and 2,913 tables across 2,916 papers. They release VIS30K data, grounding metadata, training data, and pretrained models to support reproducible research and tool-building. The work enables visual-bibliometric analyses, teaching with image-based access, and platform-enabled related-work discovery and ML benchmarking on scholarly documents.
Abstract
We present the VIS30K dataset, a collection of 29,689 images that represents 30 years of figures and tables from each track of the IEEE Visualization conference series (Vis, SciVis, InfoVis, VAST). VIS30K's comprehensive coverage of the scientific literature in visualization not only reflects the progress of the field but also enables researchers to study the evolution of the state-of-the-art and to find relevant work based on graphical content. We describe the dataset and our semi-automatic collection process, which couples convolutional neural networks (CNN) with curation. Extracting figures and tables semi-automatically allows us to verify that no images are overlooked or extracted erroneously. To improve quality further, we engaged in a peer-search process for high-quality figures from early IEEE Visualization papers. With the resulting data, we also contribute VISImageNavigator (VIN, visimagenavigator.github.io), a web-based tool that facilitates searching and exploring VIS30K by author names, paper keywords, title and abstract, and years.
