Improving Pallet Detection Using Synthetic Data
Henry Gann, Josiah Bull, Trevor Gee, Mahla Nejati
TL;DR
This paper tackles pallet detection in warehouse settings by exploiting synthetic data, including Unity-based scenes and domain randomisation, to train fast detectors like YOLOv8. A grid-search over hyper-parameters substantially boosts performance, yielding 69% and 50% gains in mAP50 for stacked and racked pallets, though individual pallets see a decline. The study also explores lighting robustness, a two-stage YOLOv8+SAM detector (which proves unstable), and domain randomisation via Isaac Sim, finding DR data can match Unity in effectiveness but may require careful data-volume and epoch management. The work highlights the potential of synthetic data to bridge the sim-to-real gap for complex pallet configurations, while outlining practical considerations for lighting, model selection, and data-generation strategies in real warehouses.
Abstract
The use of synthetic data in machine learning saves a significant amount of time when implementing an effective object detector. However, there is limited research in this domain. This study aims to improve upon previously applied implementations in the task of instance segmentation of pallets in a warehouse environment. This study proposes using synthetically generated domain-randomised data as well as data generated through Unity to achieve this. This study achieved performance improvements on the stacked and racked pallet categories by 69% and 50% mAP50, respectively when being evaluated on real data. Additionally, it was found that there was a considerable impact on the performance of a model when it was evaluated against images in a darker environment, dropping as low as 3% mAP50 when being evaluated on images with an 80% brightness reduction. This study also created a two-stage detector that used YOLOv8 and SAM, but this proved to have unstable performance. The use of domain-randomised data proved to have negligible performance improvements when compared to the Unity-generated data.
