State-of-the-Art Periorbital Distance Prediction and Disease Classification Using Periorbital Features
George R. Nahass, Sasha Hubschman, Jeffrey C. Peterson, Ghasem Yazdanpanah, Nicholas Tomaras, Madison Cheung, Alex Palacios, Kevin Heinze, Chad A. Purnell, Pete Setabutr, Ann Q. Tran, Darvin Yi
TL;DR
This study targets automated, anatomically grounded periorbital distance prediction and its use as robust features for disease classification under real-world imaging variability. It develops a DeepLabV3-based segmentation pipeline and benchmarks it against SAM and PeriorbitAI across diverse healthy and diseased cohorts, achieving state-of-the-art distance accuracy often within intergrader variability. Beyond segmentation, the work demonstrates that periorbital distances enable competitive ID disease classification and, crucially, superior generalization under distribution shift compared to CNNs trained only on images, with XGBoost and Lasso maintaining strong OOD performance and fusion models offering peak ID accuracy. The findings support deployment of anatomy-based AI pipelines for accessible oculoplastic and craniofacial care, and point to future advances in fusion architectures and semi-supervised learning to further improve robustness and clinical utility.
Abstract
Periorbital distances are critical markers for diagnosing and monitoring a range of oculoplastic and craniofacial conditions. Manual measurement, however, is subjective and prone to intergrader variability. Automated methods have been developed but remain limited by standardized imaging requirements, small datasets, and a narrow focus on individual measurements. We developed a segmentation pipeline trained on a domain-specific dataset of healthy eyes and compared its performance against the Segment Anything Model (SAM) and the prior benchmark, PeriorbitAI. Segmentation accuracy was evaluated across multiple disease classes and imaging conditions. We further investigated the use of predicted periorbital distances as features for disease classification under in-distribution (ID) and out-of-distribution (OOD) settings, comparing shallow classifiers, CNNs, and fusion models. Our segmentation model achieved state-of-the-art accuracy across all datasets, with error rates within intergrader variability and superior performance relative to SAM and PeriorbitAI. In classification tasks, models trained on periorbital distances matched CNN performance on ID data (77--78\% accuracy) and substantially outperformed CNNs under OOD conditions (63--68\% accuracy vs. 14\%). Fusion models achieved the highest ID accuracy (80\%) but were sensitive to degraded CNN features under OOD shifts. Segmentation-derived periorbital distances provide robust, explainable features for disease classification and generalize better under domain shift than CNN image classifiers. These results establish a new benchmark for periorbital distance prediction and highlight the potential of anatomy-based AI pipelines for real-world deployment in oculoplastic and craniofacial care.
