Training Data Attribution for Image Generation using Ontology-Aligned Knowledge Graphs
Theodoros Aivalis, Iraklis A. Klampanos, Antonis Troumpoukis, Joemon M. Jose
TL;DR
The paper tackles transparency and copyright attribution in generative image systems by proposing ontology-guided knowledge graphs (KGs) constructed from images via multimodal LLMs. It introduces an end-to-end pipeline that extracts ontology-aligned semantic triples, stores per-image KGs, and performs graph-based retrieval to trace training-data influence, complemented by unlearning experiments. The authors validate the approach with locally trained fashion data and a stylised Ghibli-domain, showing competitive attribution performance and enhanced interpretability over latent embeddings. The work advances dataset transparency and attribution in generative AI and suggests scalable extensions to broader domains and richer semantic reasoning.
Abstract
As generative models become powerful, concerns around transparency, accountability, and copyright violations have intensified. Understanding how specific training data contributes to a model's output is critical. We introduce a framework for interpreting generative outputs through the automatic construction of ontologyaligned knowledge graphs (KGs). While automatic KG construction from natural text has advanced, extracting structured and ontology-consistent representations from visual content remains challenging -- due to the richness and multi-object nature of images. Leveraging multimodal large language models (LLMs), our method extracts structured triples from images, aligned with a domain-specific ontology. By comparing the KGs of generated and training images, we can trace potential influences, enabling copyright analysis, dataset transparency, and interpretable AI. We validate our method through experiments on locally trained models via unlearning, and on large-scale models through a style-specific experiment. Our framework supports the development of AI systems that foster human collaboration, creativity and stimulate curiosity.
