Homeomorphism Prior for False Positive and Negative Problem in Medical Image Dense Contrastive Representation Learning
Yuting He, Boyu Wang, Rongjun Ge, Yang Chen, Guanyu Yang, Shuo Li
TL;DR
Medical image dense contrastive learning faces large-scale FP&N pairs due to intrinsic image properties. GEMINI addresses this by embedding a homeomorphism prior through Deformable Homeomorphism Learning (DHL) and Geometric Semantic Similarities (GSS), enabling reliable pixel-wise correspondence and soft negative-pair learning. The framework is instantiated in two practical variants: GEMINI-Semi for few-shot semi-supervised segmentation and GEMINI-MIP for self-supervised pre-training, achieving state-of-the-art results across seven datasets and demonstrating strong data efficiency and transferability. By combining topology-preserving deformation with semantic-aware similarity, GEMINI significantly mitigates FP&N issues and enhances dense representation learning in MIDP tasks, with potential for broad clinical impact.
Abstract
Dense contrastive representation learning (DCRL) has greatly improved the learning efficiency for image-dense prediction tasks, showing its great potential to reduce the large costs of medical image collection and dense annotation. However, the properties of medical images make unreliable correspondence discovery, bringing an open problem of large-scale false positive and negative (FP&N) pairs in DCRL. In this paper, we propose GEoMetric vIsual deNse sImilarity (GEMINI) learning which embeds the homeomorphism prior to DCRL and enables a reliable correspondence discovery for effective dense contrast. We propose a deformable homeomorphism learning (DHL) which models the homeomorphism of medical images and learns to estimate a deformable mapping to predict the pixels' correspondence under topological preservation. It effectively reduces the searching space of pairing and drives an implicit and soft learning of negative pairs via a gradient. We also propose a geometric semantic similarity (GSS) which extracts semantic information in features to measure the alignment degree for the correspondence learning. It will promote the learning efficiency and performance of deformation, constructing positive pairs reliably. We implement two practical variants on two typical representation learning tasks in our experiments. Our promising results on seven datasets which outperform the existing methods show our great superiority. We will release our code on a companion link: https://github.com/YutingHe-list/GEMINI.
