Semi-Supervised 360 Layout Estimation with Panoramic Collaborative Perturbations
Junsong Zhang, Chunyu Lin, Zhijie Shen, Lang Nie, Kang Liao, Yao Zhao
TL;DR
SemiLayout360 tackles the challenge of limited annotations in 360-degree indoor layout estimation by embedding panoramic priors into perturbations within a Mean-Teacher semi-supervised framework. It introduces panoramic layout priors to sharpen boundaries and distortion priors to model non-uniform distortion, then couples them into panoramic collaborative perturbations that balance effectiveness with convergence stability. Building on DOPNet, the method uses image, feature, and network perturbations, along with a ramped consistency loss that leverages unlabeled data, achieving state-of-the-art results on PanoContext, Stanford2D3D, and MatterportLayout with limited labels. The approach demonstrates that task-specific priors in perturbations can substantially boost semi-supervised 360 layout estimation, reducing annotation costs while improving accuracy and robustness in both cuboid and non-cuboid room layouts.
Abstract
The performance of existing supervised layout estimation methods heavily relies on the quality of data annotations. However, obtaining large-scale and high-quality datasets remains a laborious and time-consuming challenge. To solve this problem, semi-supervised approaches are introduced to relieve the demand for expensive data annotations by encouraging the consistent results of unlabeled data with different perturbations. However, existing solutions merely employ vanilla perturbations, ignoring the characteristics of panoramic layout estimation. In contrast, we propose a novel semi-supervised method named SemiLayout360, which incorporates the priors of the panoramic layout and distortion through collaborative perturbations. Specifically, we leverage the panoramic layout prior to enhance the model's focus on potential layout boundaries. Meanwhile, we introduce the panoramic distortion prior to strengthen distortion awareness. Furthermore, to prevent intense perturbations from hindering model convergence and ensure the effectiveness of prior-based perturbations, we divide and reorganize them as panoramic collaborative perturbations. Our experimental results on three mainstream benchmarks demonstrate that the proposed method offers significant advantages over existing state-of-the-art (SoTA) solutions.
