PanTS: The Pancreatic Tumor Segmentation Dataset
Wenxuan Li, Xinze Zhou, Qi Chen, Tianyu Lin, Pedro R. A. S. Bassi, Szymon Plotka, Jaroslaw B. Cwikla, Xiaoxi Chen, Chen Ye, Zheren Zhu, Kai Ding, Heng Li, Kang Wang, Yang Yang, Yucheng Tang, Daguang Xu, Alan L. Yuille, Zongwei Zhou
TL;DR
PanTS delivers the largest multi-institutional CT dataset for pancreatic tumor analysis, featuring 36,390 scans from 145 centers with voxel-wise annotations for tumors, pancreas subregions, and 24 surrounding structures. The work demonstrates that both dataset scale and rich anatomical context substantially improve AI performance, particularly under out-of-distribution conditions, and provides a public baseline model and benchmarking protocol. Through rigorous annotation standards and quality control, PanTS enables robust evaluation of anatomy-aware segmentation methods for detection, localization, and surgical planning. This resource has the potential to accelerate clinically translatable AI tools for early pancreatic cancer detection and radiotherapy planning.
Abstract
PanTS is a large-scale, multi-institutional dataset curated to advance research in pancreatic CT analysis. It contains 36,390 CT scans from 145 medical centers, with expert-validated, voxel-wise annotations of over 993,000 anatomical structures, covering pancreatic tumors, pancreas head, body, and tail, and 24 surrounding anatomical structures such as vascular/skeletal structures and abdominal/thoracic organs. Each scan includes metadata such as patient age, sex, diagnosis, contrast phase, in-plane spacing, slice thickness, etc. AI models trained on PanTS achieve significantly better performance in pancreatic tumor detection, localization, and segmentation compared to those trained on existing public datasets. Our analysis indicates that these gains are directly attributable to the 16x larger-scale tumor annotations and indirectly supported by the 24 additional surrounding anatomical structures. As the largest and most comprehensive resource of its kind, PanTS offers a new benchmark for developing and evaluating AI models in pancreatic CT analysis.
