Morphology-Aware Interactive Keypoint Estimation
Jinhee Kim, Taesung Kim, Taewoo Kim, Jaegul Choo, Dong-Wook Kim, Byungduk Ahn, In-Seok Song, Yoon-Ji Kim
TL;DR
This work addresses the bottleneck of manual anatomical keypoint annotation in medical X-ray images by introducing an interactive framework that enables clinicians to correct a small subset of predictions and automatically revise the remaining keypoints. It combines a morphology-aware loss and an interaction-guided gating mechanism within an HRNet-based heatmap regression network to propagate user corrections across the image and preserve inter-keypoint geometry. The key contributions are the interactive keypoint estimation network, the interaction-guided gating network, and the morphology-aware loss, validated on Cephalometric X-ray and AASCE datasets, showing improved efficiency and accuracy over baselines. The approach has practical implications for reducing annotation costs and supporting diagnostic workflows with human-in-the-loop guidance, though it assumes correct user edits and points to future work on handling noisy inputs and guidance strategies for revision order.
Abstract
Diagnosis based on medical images, such as X-ray images, often involves manual annotation of anatomical keypoints. However, this process involves significant human efforts and can thus be a bottleneck in the diagnostic process. To fully automate this procedure, deep-learning-based methods have been widely proposed and have achieved high performance in detecting keypoints in medical images. However, these methods still have clinical limitations: accuracy cannot be guaranteed for all cases, and it is necessary for doctors to double-check all predictions of models. In response, we propose a novel deep neural network that, given an X-ray image, automatically detects and refines the anatomical keypoints through a user-interactive system in which doctors can fix mispredicted keypoints with fewer clicks than needed during manual revision. Using our own collected data and the publicly available AASCE dataset, we demonstrate the effectiveness of the proposed method in reducing the annotation costs via extensive quantitative and qualitative results. A demo video of our approach is available on our project webpage.
