ConstScene: Dataset and Model for Advancing Robust Semantic Segmentation in Construction Environments

Maghsood Salimi; Mohammad Loni; Sara Afshar; Antonio Cicchetti; Marjan Sirjani

ConstScene: Dataset and Model for Advancing Robust Semantic Segmentation in Construction Environments

Maghsood Salimi, Mohammad Loni, Sara Afshar, Antonio Cicchetti, Marjan Sirjani

TL;DR

ConstScene addresses the need for robust perception in construction environments by introducing a large-scale, annotated semantic segmentation dataset that covers diverse weather, lighting, and lens-dirt conditions, plus monocular depth maps. The authors validate the dataset with baseline experiments using U-Net and SegFormer, achieving up to $mIoU$ of 82.27% on the original data (SegFormer-B5) and demonstrating that data augmentation improves performance. They show that adverse conditions challenge depth estimation, underscoring the importance of robust perception pipelines. The open-source dataset and code promote research on adversarial robustness and practical deployment for autonomous construction robotics.

Abstract

The increasing demand for autonomous machines in construction environments necessitates the development of robust object detection algorithms that can perform effectively across various weather and environmental conditions. This paper introduces a new semantic segmentation dataset specifically tailored for construction sites, taking into account the diverse challenges posed by adverse weather and environmental conditions. The dataset is designed to enhance the training and evaluation of object detection models, fostering their adaptability and reliability in real-world construction applications. Our dataset comprises annotated images captured under a wide range of different weather conditions, including but not limited to sunny days, rainy periods, foggy atmospheres, and low-light situations. Additionally, environmental factors such as the existence of dirt/mud on the camera lens are integrated into the dataset through actual captures and synthetic generation to simulate the complex conditions prevalent in construction sites. We also generate synthetic images of the annotations including precise semantic segmentation masks for various objects commonly found in construction environments, such as wheel loader machines, personnel, cars, and structural elements. To demonstrate the dataset's utility, we evaluate state-of-the-art object detection algorithms on our proposed benchmark. The results highlight the dataset's success in adversarial training models across diverse conditions, showcasing its efficacy compared to existing datasets that lack such environmental variability.

ConstScene: Dataset and Model for Advancing Robust Semantic Segmentation in Construction Environments

TL;DR

of 82.27% on the original data (SegFormer-B5) and demonstrating that data augmentation improves performance. They show that adverse conditions challenge depth estimation, underscoring the importance of robust perception pipelines. The open-source dataset and code promote research on adversarial robustness and practical deployment for autonomous construction robotics.

Abstract

Paper Structure (15 sections, 3 figures, 6 tables)

This paper contains 15 sections, 3 figures, 6 tables.

Introduction
Related Work
Data Collection and Preparation
Data Collection and Sensor Setup
Data Annotation
Data Augmentation
Depth Estimation
Dataset Statistics
Experimental Results
Evaluation Metrics
Experimental Setup
Prediction Results
Advantages of $\texttt{ConstScene}$ in Defending Against Adversarial Attacks
Conclusion and Future Work
Acknowledgements

Figures (3)

Figure 1: (a) Original image. (b) Overlay mask of the original image. (c) Augmented image. (d) Overlay mask of the augmented image.
Figure 2: Visual representation of samples from our dataset under various weather conditions, accompanied by the corresponding disparity map. The red circles indicate the disparity region with low-confidence matches.
Figure 3: Distribution of images in each class, including background (BG), Car, wheel-loader (WL), crusher, human, pile, and road.

ConstScene: Dataset and Model for Advancing Robust Semantic Segmentation in Construction Environments

TL;DR

Abstract

ConstScene: Dataset and Model for Advancing Robust Semantic Segmentation in Construction Environments

Authors

TL;DR

Abstract

Table of Contents

Figures (3)