Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection
Shiman Zhang, Lakshmikar Reddy Polamreddy, Youshan Zhang
TL;DR
The paper addresses the challenge of detecting canine cardiomegaly from X-ray images with limited labeled data. It introduces Confident Pseudo-labeled Diffusion Augmentation (CDA), a framework that generates synthetic X-ray images via diffusion models and assigns VHS-related labels, while using Monte Carlo Dropout-based pseudo-labeling to select high-confidence unlabeled samples for training. A composite loss and VHS computation scheme promotes accurate localization of key points and VHS-based categorization into three clinically meaningful classes. The results show state-of-the-art VHS prediction accuracy on a canine chest X-ray dataset, with 92.8% test accuracy, demonstrating improved generalization through synthetic augmentation and selective semi-supervised learning. The work has practical implications for scalable, robust canine cardiomegaly diagnosis in veterinary practice.
Abstract
Canine cardiomegaly, marked by an enlarged heart, poses serious health risks if undetected, requiring accurate diagnostic methods. Current detection models often rely on small, poorly annotated datasets and struggle to generalize across diverse imaging conditions, limiting their real-world applicability. To address these issues, we propose a Confident Pseudo-labeled Diffusion Augmentation (CDA) model for identifying canine cardiomegaly. Our approach addresses the challenge of limited high-quality training data by employing diffusion models to generate synthetic X-ray images and annotate Vertebral Heart Score key points, thereby expanding the dataset. We also employ a pseudo-labeling strategy with Monte Carlo Dropout to select high-confidence labels, refine the synthetic dataset, and improve accuracy. Iteratively incorporating these labels enhances the model's performance, overcoming the limitations of existing approaches. Experimental results show that the CDA model outperforms traditional methods, achieving state-of-the-art accuracy in canine cardiomegaly detection. The code implementation is available at https://github.com/Shira7z/CDA.
