UCloudNet: A Residual U-Net with Deep Supervision for Cloud Image Segmentation
Yijie Li, Hewei Wang, Shaofan Wang, Yee Hui Lee, Muhammad Salman Pathan, Soumyabrata Dev
TL;DR
The paper tackles cloud segmentation from ground-based sky images and introduces UCloudNet, a residual U-Net enhanced with deep supervision to achieve higher accuracy with reduced training time. By embedding residual connections in the encoder and employing auxiliary loss branches, the model improves feature extraction and accelerates convergence. Experiments on the SWINySEG dataset show that UCloudNet with $k=4$, deep supervision, and learning-rate decay attains state-of-the-art performance across multiple metrics, while requiring significantly fewer iterations. This work advances practical, real-time cloud segmentation by balancing accuracy and training efficiency, with potential extensions to multi-class segmentation and cloud depth estimation.
Abstract
Recent advancements in meteorology involve the use of ground-based sky cameras for cloud observation. Analyzing images from these cameras helps in calculating cloud coverage and understanding atmospheric phenomena. Traditionally, cloud image segmentation relied on conventional computer vision techniques. However, with the advent of deep learning, convolutional neural networks (CNNs) are increasingly applied for this purpose. Despite their effectiveness, CNNs often require many epochs to converge, posing challenges for real-time processing in sky camera systems. In this paper, we introduce a residual U-Net with deep supervision for cloud segmentation which provides better accuracy than previous approaches, and with less training consumption. By utilizing residual connection in encoders of UCloudNet, the feature extraction ability is further improved.
