Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection -- Towards Precise Fish Morphological Assessment in Aquaculture Breeding
Weizhen Liu, Jiayu Tan, Guangyu Lan, Ao Li, Dongye Li, Le Zhao, Xiaohui Yuan, Nanqing Dong
TL;DR
This work tackles the challenge of precise morphological phenotyping in aquaculture by releasing FishPhenoKey, a large, multi-species dataset with 22 keypoints enabling 23 phenotypes. It introduces the Percentage of Measured Phenotype (PMP) as a phenotype-aware evaluation metric and the Anatomically-Calibrated Regularization (ACR) loss to inject anatomical priors into keypoint detection, improving both localization and downstream phenotype measurement. Experiments show PMP offers robust per-keypoint assessment over OKS/PCK and that ACR enhances performance across backbones and species, yielding substantial gains in morphology accuracy (e.g., $R^2$ rising to 0.96 for a target phenotype). The dataset and code pave the way for more reliable, scalable phenotypic analysis in aquaculture breeding and genetics, supporting sustainable practices and diverse research avenues.
Abstract
Accurate phenotypic analysis in aquaculture breeding necessitates the quantification of subtle morphological phenotypes. Existing datasets suffer from limitations such as small scale, limited species coverage, and inadequate annotation of keypoints for measuring refined and complex morphological phenotypes of fish body parts. To address this gap, we introduce FishPhenoKey, a comprehensive dataset comprising 23,331 high-resolution images spanning six fish species. Notably, FishPhenoKey includes 22 phenotype-oriented annotations, enabling the capture of intricate morphological phenotypes. Motivated by the nuanced evaluation of these subtle morphologies, we also propose a new evaluation metric, Percentage of Measured Phenotype (PMP). It is designed to assess the accuracy of individual keypoint positions and is highly sensitive to the phenotypes measured using the corresponding keypoints. To enhance keypoint detection accuracy, we further propose a novel loss, Anatomically-Calibrated Regularization (ACR), that can be integrated into keypoint detection models, leveraging biological insights to refine keypoint localization. Our contributions set a new benchmark in fish phenotype analysis, addressing the challenges of precise morphological quantification and opening new avenues for research in sustainable aquaculture and genetic studies. Our dataset and code are available at https://github.com/WeizhenLiuBioinform/Fish-Phenotype-Detect.
