XMutant: XAI-based Fuzzing for Deep Learning Systems
Xingcheng Chen, Matteo Biagiola, Vincenzo Riccio, Marcelo d'Amorim, Andrea Stocco
TL;DR
XMutant introduces an XAI-guided semantic fuzzing approach for DL testing, leveraging local explanations to focus mutations on high-attention input regions. The method is evaluated across text, image, and system-level driving tasks, showing increased failure exposure and faster fault discovery while maintaining high input validity and label preservation. Results indicate task-specific best configurations, with XMutant outperforming a state-of-the-art semantic-based tester and several gradient-based baselines, particularly when using attribution-based explanations. The work demonstrates that targeted, explanation-informed perturbations can improve DL generalization testing without sacrificing realism, and highlights avenues for efficiency gains and broader applicability. The findings suggest practical benefits for developers and researchers seeking scalable, valid semantic test generation guided by explainability signals.
Abstract
Semantic-based test generators are widely used to produce failure-inducing inputs for Deep Learning (DL) systems. They typically generate challenging test inputs by applying random perturbations to input semantic concepts until a failure is found or a timeout is reached. However, such randomness may hinder them from efficiently achieving their goal. This paper proposes XMutant, a technique that leverages explainable artificial intelligence (XAI) techniques to generate challenging test inputs. XMutant uses the local explanation of the input to inform the fuzz testing process and effectively guide it toward failures of the DL system under test. We evaluated different configurations of XMutant in triggering failures for different DL systems both for model-level (sentiment analysis, digit recognition) and system-level testing (advanced driving assistance). Our studies showed that XMutant enables more effective and efficient test generation by focusing on the most impactful parts of the input. XMutant generates up to 125% more failure-inducing inputs compared to an existing baseline, up to 7X faster. We also assessed the validity of these inputs, maintaining a validation rate above 89%, according to automated and human validators.
