FRCNet Frequency and Region Consistency for Semi-supervised Medical Image Segmentation
Along He, Tao Li, Yanlin Wu, Ke Zou, Huazhu Fu
TL;DR
Medical image segmentation is hampered by scarce pixel-wise labels. The authors propose FRCNet, a semi-supervised framework that jointly leverages frequency-domain and region-level cues through two consistency regularizers: Frequency Domain Consistency (FDC) and Multi-granularity Region Similarity Consistency (MRSC), built on a Mean Teacher backbone. FDC transforms RGB features with a Discrete Cosine Transform and uses a Frequency Enhancement Module to encode low- and high-frequency lesion cues, while MRSC enforces multi-scale region relationships via region similarity matrices across granularities. The training objective combines supervised loss with $\mathcal{L}_{fdc}$, $\mathcal{L}_{mrsc}$, and $\mathcal{L}_{pix}$ as $L = L_{sup} + \lambda( L_{fdc} + L_{mrsc} + L_{pix} )$, and FDC/MRSC can be removed at inference to keep complexity low; on Kvasir-SEG and ISIC 2016, FRCNet achieves state-of-the-art Dice and IoU with as little as 10% labeled data, while reducing annotation cost by almost 80%.
Abstract
Limited labeled data hinder the application of deep learning in medical domain. In clinical practice, there are sufficient unlabeled data that are not effectively used, and semi-supervised learning (SSL) is a promising way for leveraging these unlabeled data. However, existing SSL methods ignore frequency domain and region-level information and it is important for lesion regions located at low frequencies and with significant scale changes. In this paper, we introduce two consistency regularization strategies for semi-supervised medical image segmentation, including frequency domain consistency (FDC) to assist the feature learning in frequency domain and multi-granularity region similarity consistency (MRSC) to perform multi-scale region-level local context information feature learning. With the help of the proposed FDC and MRSC, we can leverage the powerful feature representation capability of them in an effective and efficient way. We perform comprehensive experiments on two datasets, and the results show that our method achieves large performance gains and exceeds other state-of-the-art methods.
