PCEvE: Part Contribution Evaluation Based Model Explanation for Human Figure Drawing Assessment and Beyond
Jongseo Lee, Geo Ahn, Seong Tae Kim, Jinwoo Choi
TL;DR
PCEvE introduces a part-based explanation framework for human figure drawing assessment by quantifying each predefined body part's contribution to a model decision using Shapley Values. By evaluating all possible part-combinations ($2^K$) and aggregating results at sample-, class-, and task-level, it yields intuitive part contribution histograms that align with human perception and extend beyond pixel-level attributions. The approach is validated across ASD screening, SCAT, and Stanford Cars, with additional sanity checks showing robustness to annotation quality and applicability to fine-grained visual categorization. The method holds promise for more transparent, hierarchical explanations in clinical-aided assessment and beyond, with potential integration with language models for richer descriptive explanations.
Abstract
For automatic human figure drawing (HFD) assessment tasks, such as diagnosing autism spectrum disorder (ASD) using HFD images, the clarity and explainability of a model decision are crucial. Existing pixel-level attribution-based explainable AI (XAI) approaches demand considerable effort from users to interpret the semantic information of a region in an image, which can be often time-consuming and impractical. To overcome this challenge, we propose a part contribution evaluation based model explanation (PCEvE) framework. On top of the part detection, we measure the Shapley Value of each individual part to evaluate the contribution to a model decision. Unlike existing attribution-based XAI approaches, the PCEvE provides a straightforward explanation of a model decision, i.e., a part contribution histogram. Furthermore, the PCEvE expands the scope of explanations beyond the conventional sample-level to include class-level and task-level insights, offering a richer, more comprehensive understanding of model behavior. We rigorously validate the PCEvE via extensive experiments on multiple HFD assessment datasets. Also, we sanity-check the proposed method with a set of controlled experiments. Additionally, we demonstrate the versatility and applicability of our method to other domains by applying it to a photo-realistic dataset, the Stanford Cars.
