FaceOracle: Chat with a Face Image Oracle
Wassim Kabbani, Kiran Raja, Raghavendra Ramachandra, Christoph Busch
TL;DR
FaceOracle addresses the need for domain-specific, explainable FIQA analysis against international standards by marrying Retrieval Augmented Generation with LLM-powered autonomous agents to access private standards and the OFIQ FIQA algorithms, ensuring source-grounded results. It formalizes evaluation criteria and introduces a dataset to measure correctness, relevance, faithfulness, and context grounding, demonstrating superior tool selection and source-grounded responses relative to ChatGPT within a FIQA workflow. The work highlights a practical pathway to more efficient, auditable decisions in identity document issuance and lays groundwork for extending to detection of morphing and manipulations, with outputs expressed in a unified quality score ranging from $0$ to $100$ where applicable. Overall, FaceOracle represents a significant step toward trusted, explainable AI-assisted FIQA analysis that can be integrated into issuing authorities’ existing processes.
Abstract
A face image is a mandatory part of ID and travel documents. Obtaining high-quality face images when issuing such documents is crucial for both human examiners and automated face recognition systems. In several international standards, face image quality requirements are intricate and defined in detail. Identifying and understanding non-compliance or defects in the submitted face images is crucial for both issuing authorities and applicants. In this work, we introduce FaceOracle, an LLM-powered AI assistant that helps its users analyze a face image in a natural conversational manner using standard compliant algorithms. Leveraging the power of LLMs, users can get explanations of various face image quality concepts as well as interpret the outcome of face image quality assessment (FIQA) algorithms. We implement a proof-of-concept that demonstrates how experts at an issuing authority could integrate FaceOracle into their workflow to analyze, understand, and communicate their decisions more efficiently, resulting in enhanced productivity.
