Found in Translation: semantic approaches for enhancing AI interpretability in face verification
Miriam Doh, Caroline Mazini Rodrigues, N. Boutry, L. Najman, Matei Mancas, Bernard Gosselin
TL;DR
In face verification, the paper frames explainability as aligning model decisions with human semantic understanding by introducing semantic feature sets derived from Mediapipe landmarks and a hybrid global-local XAI framework. It integrates multiple concept-extraction approaches (LIME, MAGE EaOC, KernelSHAP), a weighted single-removal similarity map (S0), and LLM-generated textual explanations to produce human-friendly narratives. Quantitative occlusion analyses and a user study (61 participants) show that semantic explanations, especially with the finest SET_2 granularity, yield clearer, more detailed, and more human-aligned interpretations than traditional pixel-based heatmaps. By combining semantic concepts with narrative explanations, the work advances XAI 2.0, aiming to foster trust and acceptance in critical applications of face verification; it also highlights how global concept aggregation and local attributions can be synergistically used to diagnose model decisions.
Abstract
The increasing complexity of machine learning models in computer vision, particularly in face verification, requires the development of explainable artificial intelligence (XAI) to enhance interpretability and transparency. This study extends previous work by integrating semantic concepts derived from human cognitive processes into XAI frameworks to bridge the comprehension gap between model outputs and human understanding. We propose a novel approach combining global and local explanations, using semantic features defined by user-selected facial landmarks to generate similarity maps and textual explanations via large language models (LLMs). The methodology was validated through quantitative experiments and user feedback, demonstrating improved interpretability. Results indicate that our semantic-based approach, particularly the most detailed set, offers a more nuanced understanding of model decisions than traditional methods. User studies highlight a preference for our semantic explanations over traditional pixelbased heatmaps, emphasizing the benefits of human-centric interpretability in AI. This work contributes to the ongoing efforts to create XAI frameworks that align AI models behaviour with human cognitive processes, fostering trust and acceptance in critical applications.
