SydneyScapes: Image Segmentation for Australian Environments
Hongyu Lyu, Julie Stephany Berrio, Mao Shan, Stewart Worrall
TL;DR
The paper addresses the gap in locally relevant data for autonomous vehicle perception in Australia by introducing SydneyScapes, a 756-image dataset with pixel-level semantic, instance, and panoptic annotations collected in New South Wales. It covers data collection (Day/Night/People subsets), a privacy-preserving anonymisation pipeline, a seven-group labeling scheme with train/validation splits, and a Colab-based visualization tool. The authors benchmark semantic and instance segmentation using Cityscapes-pretrained models and demonstrate significant gains from local fine-tuning, with Transformer-based methods (e.g., Mask2Former) achieving the highest performance and showing robustness to domain shifts in Australian scenes. Overall, SydneyScapes provides a practical resource and benchmarking baseline to advance perception algorithms tailored to Australian urban environments and supports domain adaptation for AV deployment.
Abstract
Autonomous Vehicles (AVs) are being partially deployed and tested across various global locations, including China, the USA, Germany, France, Japan, Korea, and the UK, but with limited demonstrations in Australia. The integration of machine learning (ML) into AV perception systems highlights the need for locally labelled datasets to develop and test algorithms in specific environments. To address this, we introduce SydneyScapes - a dataset tailored for computer vision tasks of image semantic, instance, and panoptic segmentation. This dataset, collected from Sydney and surrounding cities in New South Wales (NSW), Australia, consists of 756 images with high-quality pixel-level annotations. It is designed to assist AV industry and researchers by providing annotated data and tools for algorithm development, testing, and deployment in the Australian context. Additionally, we offer benchmarking results using state-of-the-art algorithms to establish reference points for future research and development. The dataset is publicly available at https://hdl.handle.net/2123/33051.
