Deep Learning-Based Automatic Diagnosis System for Developmental Dysplasia of the Hip
Yang Li, Leo Yan Li-Han, Hua Tian
TL;DR
This work addresses the variability and inefficiency of manual DDH diagnosis by introducing an automated end-to-end system that detects eight pelvic keypoints with a Mask-RCNN-based detector (ResNet-$50$ backbone), computes the $CE$, $T ext{önnis}$, and $Sharp$ angles, and integrates them through a data-driven scoring scheme. The system demonstrates strong angle measurement accuracy (ICC$CE=0.957$, ICC$T ext{önnis}=0.942$, ICC$Sharp=0.966$) and DDH diagnostic performance (F1$=0.863$), outperforming a cohort of moderately experienced orthopedists and single-angle criteria. A key contribution is the explicit, data-driven scoring that weights angle information to produce an explainable diagnosis with high specificity ($0.996$) and balanced sensitivity ($0.824$). This approach has potential clinical impact by reducing measurement variability, supporting less-experienced clinicians, and enabling remote or centralized second opinions, though external validation is needed. The work advances automated radiographic DDH assessment by unifying keypoint localization, angle computation, and interpretable diagnostic reasoning.
Abstract
Objective: The clinical diagnosis of developmental dysplasia of the hip (DDH) typically involves manually measuring key radiological angles -- Center-Edge (CE), Tonnis, and Sharp angles -- from pelvic radiographs, a process that is time-consuming and susceptible to variability. This study aims to develop an automated system that integrates these measurements to enhance the accuracy and consistency of DDH diagnosis. Methods and procedures: We developed an end-to-end deep learning model for keypoint detection that accurately identifies eight anatomical keypoints from pelvic radiographs, enabling the automated calculation of CE, Tonnis, and Sharp angles. To support the diagnostic decision, we introduced a novel data-driven scoring system that combines the information from all three angles into a comprehensive and explainable diagnostic output. Results: The system demonstrated superior consistency in angle measurements compared to a cohort of eight moderately experienced orthopedists. The intraclass correlation coefficients for the CE, Tonnis, and Sharp angles were 0.957 (95% CI: 0.952--0.962), 0.942 (95% CI: 0.937--0.947), and 0.966 (95% CI: 0.964--0.968), respectively. The system achieved a diagnostic F1 score of 0.863 (95% CI: 0.851--0.876), significantly outperforming the orthopedist group (0.777, 95% CI: 0.737--0.817, p = 0.005), as well as using clinical diagnostic criteria for each angle individually (p<0.001). Conclusion: The proposed system provides reliable and consistent automated measurements of radiological angles and an explainable diagnostic output for DDH, outperforming moderately experienced clinicians. Clinical impact: This AI-powered solution reduces the variability and potential errors of manual measurements, offering clinicians a more consistent and interpretable tool for DDH diagnosis.
