Single Shot AI-assisted quantification of KI-67 proliferation index in breast cancer
Deepti Madurai Muthu, Priyanka S, Lalitha Rani N, P. G. Kubendran Amos
TL;DR
This work tackles the variability of Ki-67 proliferation scoring in breast cancer by deploying an AI-assisted approach based on YOLOv8 for automated two-class detection of Ki-67–positive and Ki-67–negative tumor cells in hotspot regions. A dataset of 180 high-resolution Ki-67 IHC images was annotated and expanded to 1,863 samples, with a transfer-learning pipeline leveraging COCO-pretrained weights. Among YOLOv8 variants, the Medium model achieved the best detection performance, with $mAP_{50}$ exceeding $85$ for Ki-67–positive cells and about $73$ for negatives, reflecting both data imbalance and staining clarity. The method offers a scalable, objective alternative to manual scoring and demonstrates potential for integration into pathology workflows, with future work focusing on user-friendly interfaces and multi-institutional validation to improve generalizability and adoption.
Abstract
Reliable quantification of Ki-67, a key proliferation marker in breast cancer, is essential for molecular subtyping and informed treatment planning. Conventional approaches, including visual estimation and manual counting, suffer from interobserver variability and limited reproducibility. This study introduces an AI-assisted method using the YOLOv8 object detection framework for automated Ki-67 scoring. High-resolution digital images (40x magnification) of immunohistochemically stained tumor sections were captured from Ki-67 hotspot regions and manually annotated by a domain expert to distinguish Ki-67-positive and negative tumor cells. The dataset was augmented and divided into training (80%), validation (10%), and testing (10%) subsets. Among the YOLOv8 variants tested, the Medium model achieved the highest performance, with a mean Average Precision at 50% Intersection over Union (mAP50) exceeding 85% for Ki-67-positive cells. The proposed approach offers an efficient, scalable, and objective alternative to conventional scoring methods, supporting greater consistency in Ki-67 evaluation. Future directions include developing user-friendly clinical interfaces and expanding to multi-institutional datasets to enhance generalizability and facilitate broader adoption in diagnostic practice.
