UMSPU: Universal Multi-Size Phase Unwrapping via Mutual Self-Distillation and Adaptive Boosting Ensemble Segmenters
Lintong Du, Huazhen Liu, Yijia Zhang, ShuXin Liu, Yuan Qu, Zenghui Zhang, Jiamiao Yang
TL;DR
UMSPU addresses the challenge of high-resolution phase unwrapping by introducing two core innovations: Mutual Self-Distillation (MSD) for cross-layer, cross-resolution semantic learning, and an adaptive boosting ensemble of segmenters to cover a broad range of spatial frequencies, complemented by a curl loss to enforce irrotational gradient fields. Phase reconstruction leverages a Discrete Cosine Transform (DCT) to recover wrap counts from predicted gradients, enabling accurate unwrapping over a 64-fold resolution range with high throughput (approximately $22.66$ ms per high-resolution image and over 40 FPS). Empirical results show that UMSPU outperforms six baselines across resolutions and fringe densities, with strong generalization to structured light and InSAR data, and robustness under translation and rotation. The proposed approach offers a practical, scalable solution for industrial-scale phase measurements, pushing deep-learning-based phase unwrapping toward real-world deployment.
Abstract
Spatial phase unwrapping is a key technique for extracting phase information to obtain 3D morphology and other features. Modern industrial measurement scenarios demand high precision, large image sizes, and high speed. However, conventional methods struggle with noise resistance and processing speed. Current deep learning methods are limited by the receptive field size and sparse semantic information, making them ineffective for large size images. To address this issue, we propose a mutual self-distillation (MSD) mechanism and adaptive boosting ensemble segmenters to construct a universal multi-size phase unwrapping network (UMSPU). MSD performs hierarchical attention refinement and achieves cross-layer collaborative learning through bidirectional distillation, ensuring fine-grained semantic representation across image sizes. The adaptive boosting ensemble segmenters combine weak segmenters with different receptive fields into a strong one, ensuring stable segmentation across spatial frequencies. Experimental results show that UMSPU overcomes image size limitations, achieving high precision across image sizes ranging from 256*256 to 2048*2048 (an 8 times increase). It also outperforms existing methods in speed, robustness, and generalization. Its practicality is further validated in structured light imaging and InSAR. We believe that UMSPU offers a universal solution for phase unwrapping, with broad potential for industrial applications.
