Iris-SAM: Iris Segmentation Using a Foundation Model
Parisa Farmanifard, Arun Ross
TL;DR
This work adapts the Segment Anything Model (SAM) to iris segmentation by fine-tuning with a Focal Loss to address extreme class imbalance between iris and non-iris pixels. It innovates by generating training-time bounding-box prompts from ground-truth masks and enabling inference-time automatic bounding boxes with a single iris mask output, optimizing for Intersection over Union (IoU). Iris-SAM achieves near-perfect IoU on ND-Iris-0405 (99.58%) and high IoU on CASIA-Iris-Interval-v3 (96.94%), with strong cross-dataset generalization (e.g., 93.75–95.26% on unseen datasets) and low performance variance across tests. The results demonstrate the viability of foundation-model-based segmentation in specialized biometric domains and lay groundwork for extending to off-axis and multi-spectral iris imaging in future work.
Abstract
Iris segmentation is a critical component of an iris biometric system and it involves extracting the annular iris region from an ocular image. In this work, we develop a pixel-level iris segmentation model from a foundational model, viz., Segment Anything Model (SAM), that has been successfully used for segmenting arbitrary objects. The primary contribution of this work lies in the integration of different loss functions during the fine-tuning of SAM on ocular images. In particular, the importance of Focal Loss is borne out in the fine-tuning process since it strategically addresses the class imbalance problem (i.e., iris versus non-iris pixels). Experiments on ND-IRIS-0405, CASIA-Iris-Interval-v3, and IIT-Delhi-Iris datasets convey the efficacy of the trained model for the task of iris segmentation. For instance, on the ND-IRIS-0405 dataset, an average segmentation accuracy of 99.58% was achieved, compared to the best baseline performance of 89.75%.
