Improving Object Detection by Modifying Synthetic Data with Explainable AI

Nitish Mital; Simon Malzard; Richard Walters; Celso M. De Melo; Raghuveer Rao; Victoria Nockles

Improving Object Detection by Modifying Synthetic Data with Explainable AI

Nitish Mital, Simon Malzard, Richard Walters, Celso M. De Melo, Raghuveer Rao, Victoria Nockles

TL;DR

This work introduces a novel, human-in-the-loop framework that uses robust Explainable AI to guide the design of synthetic data for object detection. By applying SHAP saliency maps to identify unique and common features between classes, it iteratively modifies 3D mesh models in Unity to reinforce or disrupt features, improving detector performance even on unseen orientations. Across infrared vehicle detection tasks, initial synthetic data boosted mAP50 by 4.6%, and SHAP-guided refinements yielded an extra 1.5% gain, while also reducing misclassifications. The approach reduces human workload in dataset curation and holds promise for automation and broader application to robust, unbiased detection systems.

Abstract

Limited real-world data severely impacts model performance in many computer vision domains, particularly for samples that are underrepresented in training. Synthetically generated images are a promising solution, but 1) it remains unclear how to design synthetic training data to optimally improve model performance (e.g, whether and where to introduce more realism or more abstraction) and 2) the domain expertise, time and effort required from human operators for this design and optimisation process represents a major practical challenge. Here we propose a novel conceptual approach to improve the efficiency of designing synthetic images, by using robust Explainable AI (XAI) techniques to guide a human-in-the-loop process of modifying 3D mesh models used to generate these images. Importantly, this framework allows both modifications that increase and decrease realism in synthetic data, which can both improve model performance. We illustrate this concept using a real-world example where data are sparse; detection of vehicles in infrared imagery. We fine-tune an initial YOLOv8 model on the ATR DSIAC infrared dataset and synthetic images generated from 3D mesh models in the Unity gaming engine, and then use XAI saliency maps to guide modification of our Unity models. We show that synthetic data can improve detection of vehicles in orientations unseen in training by 4.6% (to mAP50 = 94.6%). We further improve performance by an additional 1.5% (to 96.1%) through our new XAI-guided approach, which reduces misclassifications through both increasing and decreasing the realism of different parts of the synthetic data. Our proof-of-concept results pave the way for fine, XAI-controlled curation of synthetic datasets tailored to improve object detection performance, whilst simultaneously reducing the burden on human operators in designing and optimising these datasets.

Improving Object Detection by Modifying Synthetic Data with Explainable AI

TL;DR

Abstract

Improving Object Detection by Modifying Synthetic Data with Explainable AI

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (25)