Revealing Hierarchical Structure of Leaf Venations in Plant Science via Label-Efficient Segmentation: Dataset and Method
Weizhen Liu, Ao Li, Ze Wu, Yue Li, Baobin Ge, Guangyu Lan, Shilin Chen, Minghe Li, Yunfei Liu, Xiaohui Yuan, Nanqing Dong
TL;DR
This work introduces HALVS, the first public dataset for hierarchical leaf vein segmentation with 1°, 2°, and 3° vein annotations across soybean, sweet cherry, and London planetree, captured using transmission-scanning and organized into 256×256 patches. It also proposes a label-efficient learning paradigm, Partially Supervised Semantic Segmentation (PSSS), that leverages fully labeled, partially labeled (1°/2° only), and unlabeled data within a semi-supervised framework to improve segmentation of the challenging 3° veins. Extensive experiments show that PSSS consistently boosts 3°-vein IoU across multiple SSL baselines and backbones, and ablations reveal the importance of the partial-label loss components and hyperparameters; cross-species experiments reveal transfer challenges due to domain shift. Overall, HALVS provides a valuable dataset and a practical methodology that advance plant phenomics by enabling hierarchical vein analysis with reduced labeling effort and by highlighting cross-species learning considerations.
Abstract
Hierarchical leaf vein segmentation is a crucial but under-explored task in agricultural sciences, where analysis of the hierarchical structure of plant leaf venation can contribute to plant breeding. While current segmentation techniques rely on data-driven models, there is no publicly available dataset specifically designed for hierarchical leaf vein segmentation. To address this gap, we introduce the HierArchical Leaf Vein Segmentation (HALVS) dataset, the first public hierarchical leaf vein segmentation dataset. HALVS comprises 5,057 real-scanned high-resolution leaf images collected from three plant species: soybean, sweet cherry, and London planetree. It also includes human-annotated ground truth for three orders of leaf veins, with a total labeling effort of 83.8 person-days. Based on HALVS, we further develop a label-efficient learning paradigm that leverages partial label information, i.e. missing annotations for tertiary veins. Empirical studies are performed on HALVS, revealing new observations, challenges, and research directions on leaf vein segmentation.
