Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation
Deyi Ji, Feng Zhao, Hongtao Lu, Feng Wu, Jieping Ye
TL;DR
This work introduces Structural and Statistical Texture Knowledge Distillation (SSTKD) to address the lack of low-level texture guidance in semantic segmentation. It pairs a structural pathway (Contourlet Decomposition Module) with a statistical pathway (Texture Intensity Equalization Module) to distill texture knowledge from a teacher to a student, supervised by dedicated losses and integrated into STLNet++ and U-SSNet. Ablations demonstrate that both texture types contribute and interact synergistically, yielding state-of-the-art results on Cityscapes, ADE20K, and Pascal VOC 2012, and strong performance on ultra-high-resolution segmentation. The proposed framework generalizes beyond segmentation to dense prediction tasks like object detection, offering a practical approach to incorporate low-level texture into modern deep learning models.
Abstract
Low-level texture feature/knowledge is also of vital importance for characterizing the local structural pattern and global statistical properties, such as boundary, smoothness, regularity, and color contrast, which may not be well addressed by high-level deep features. In this paper, we aim to re-emphasize the low-level texture information in deep networks for semantic segmentation and related knowledge distillation tasks. To this end, we take full advantage of both structural and statistical texture knowledge and propose a novel Structural and Statistical Texture Knowledge Distillation (SSTKD) framework for semantic segmentation. Specifically, Contourlet Decomposition Module (CDM) is introduced to decompose the low-level features with iterative Laplacian pyramid and directional filter bank to mine the structural texture knowledge, and Texture Intensity Equalization Module (TIEM) is designed to extract and enhance the statistical texture knowledge with the corresponding Quantization Congruence Loss (QDL). Moreover, we propose the Co-occurrence TIEM (C-TIEM) and generic segmentation frameworks, namely STLNet++ and U-SSNet, to enable existing segmentation networks to harvest the structural and statistical texture information more effectively. Extensive experimental results on three segmentation tasks demonstrate the effectiveness of the proposed methods and their state-of-the-art performance on seven popular benchmark datasets, respectively.
