Gaze-DETR: Using Expert Gaze to Reduce False Positives in Vulvovaginal Candidiasis Screening
Yan Kong, Sheng Wang, Jiangdong Cai, Zihao Zhao, Zhenrong Shen, Yonghao Li, Manman Fei, Qian Wang
TL;DR
This work tackles false positives in vulvovaginal candidiasis detection from Thin-prep cytologic test images by leveraging expert gaze data. It introduces Gaze-DETR, a training framework with a gaze-guided warm-up and a gaze-guided rectification stage that uses gaze-based signals during training while keeping inference gaze-free. Key contributions include the definition of 'gaze only' boxes, a universal warm-up strategy adaptable to CNN and DETR-like detectors, and a rectification mechanism that concentrates training queries on confounding regions. Empirical results on an in-house dataset show improved precision and generalization over state-of-the-art detectors, with broad compatibility across backbones and detection paradigms, highlighting the potential of human expert gaze to enhance medical image analysis.
Abstract
Accurate detection of vulvovaginal candidiasis is critical for women's health, yet its sparse distribution and visually ambiguous characteristics pose significant challenges for accurate identification by pathologists and neural networks alike. Our eye-tracking data reveals that areas garnering sustained attention - yet not marked by experts after deliberation - are often aligned with false positives of neural networks. Leveraging this finding, we introduce Gaze-DETR, a pioneering method that integrates gaze data to enhance neural network precision by diminishing false positives. Gaze-DETR incorporates a universal gaze-guided warm-up protocol applicable across various detection methods and a gaze-guided rectification strategy specifically designed for DETR-based models. Our comprehensive tests confirm that Gaze-DETR surpasses existing leading methods, showcasing remarkable improvements in detection accuracy and generalizability.
