Cranio-ID: Graph-Based Craniofacial Identification via Automatic Landmark Annotation in 2D Multi-View X-rays
Ravi Shankar Prasad, Nandani Sharma, Dinesh Singh
TL;DR
Cranio-ID tackles forensic craniofacial identification by automating landmark annotation on 2D skulls and enabling robust cross-modal skull–face and sketch–face matching. It builds patches around landmarks as graph nodes, encodes them with a GCN alongside global ViT features, and aligns skull and face modalities through cross-attention and entropic-regularized optimal transport within a triplet-learning framework. The approach is evaluated on S2F and CUHK, showing significant gains in recall and mAP over baselines and revealing domain gaps that favor sketch–face matching over skull–face. The results demonstrate the method's potential for fast, reliable landmark annotation and cross-domain identification in forensic contexts, with implications for both practical identification and future reconstruction tasks.
Abstract
In forensic craniofacial identification and in many biomedical applications, craniometric landmarks are important. Traditional methods for locating landmarks are time-consuming and require specialized knowledge and expertise. Current methods utilize superimposition and deep learning-based methods that employ automatic annotation of landmarks. However, these methods are not reliable due to insufficient large-scale validation studies. In this paper, we proposed a novel framework Cranio-ID: First, an automatic annotation of landmarks on 2D skulls (which are X-ray scans of faces) with their respective optical images using our trained YOLO-pose models. Second, cross-modal matching by formulating these landmarks into graph representations and then finding semantic correspondence between graphs of these two modalities using cross-attention and optimal transport framework. Our proposed framework is validated on the S2F and CUHK datasets (CUHK dataset resembles with S2F dataset). Extensive experiments have been conducted to evaluate the performance of our proposed framework, which demonstrates significant improvements in both reliability and accuracy, as well as its effectiveness in cross-domain skull-to-face and sketch-to-face matching in forensic science.
