FOSSIL: Regret-Minimizing Curriculum Learning for Metadata-Free and Low-Data Mpox Diagnosis
Sahng-Min Han, Minjae Kim, Jinho Cha, Se-woon Choe, Eunchan Daniel Cha, Jungwon Choi, Kyudong Jung
TL;DR
Mpox diagnosis from small, imbalanced dermatology datasets suffers from unstable optimization and poor calibration. The authors propose FOSSIL, a regret-minimizing, sample-sensitive weighting scheme where $w_i = \exp(-d_i/T)$ using softmax-based difficulty $d_i$, integrated into a four-stage Easy–Very Hard curriculum and applied across CNNs and transformers. The approach delivers superior discrimination (e.g., AUC up to $0.9573$) and calibration (ECE ~ $0.053$), with robustness to real-world perturbations and strong external validation (MCSI AUC $0.963$) without metadata or synthetic augmentation. The framework is architecture- and modality-agnostic, offering theoretical guarantees and practical reliability for data-scarce medical imaging and telemedicine, with broad applicability to radiology, histopathology, and longitudinal monitoring. Overall, FOSSIL provides a principled, interpretable path to stable, data-efficient clinical AI under strict data constraints.
Abstract
Deep learning in small and imbalanced biomedical datasets remains fundamentally constrained by unstable optimization and poor generalization. We present the first biomedical implementation of FOSSIL (Flexible Optimization via Sample-Sensitive Importance Learning), a regret-minimizing weighting framework that adaptively balances training emphasis according to sample difficulty. Using softmax-based uncertainty as a continuous measure of difficulty, we construct a four-stage curriculum (Easy-Very Hard) and integrate FOSSIL into both convolutional and transformer-based architectures for Mpox skin lesion diagnosis. Across all settings, FOSSIL substantially improves discrimination (AUC = 0.9573), calibration (ECE = 0.053), and robustness under real-world perturbations, outperforming conventional baselines without metadata, manual curation, or synthetic augmentation. The results position FOSSIL as a generalizable, data-efficient, and interpretable framework for difficulty-aware learning in medical imaging under data scarcity.
