Semiring Provenance for Lightweight Description Logics
Camille Bourgaux, Ana Ozaki, Rafael Peñaloza
TL;DR
This work develops a unified semiring provenance framework for description logics, extending the relational provenance paradigm to annotated ontologies in the lightweight ELHI family. It defines annotated-ontology semantics, canonical models, and provenance computations for axioms and queries, and studies how these notions align with fuzzy, possibilistic, and access-control interpretations under various semiring properties. The paper presents a completion-based approach to compute provenance for annotated assertions and BCQs, analyzes complexity across fragments like ${ m ELHI}^{n}_ot$ and ${ m ELHI}^{n,-}_ot$, and links the DL provenance to Datalog and relational semantics. It also explores specialized semirings (Why[X], PosBool[X], Lin[X]) and their connections to axiom pinpointing, lean kernels, and provenance-inspired explanations, offering both theoretical insights and algorithmic procedures for practical provenance management in ontology-based reasoning. The results illuminate when provenance behaves consistently with database provenance and how to leverage canonical models for efficient reasoning, with implications for explainability and justification in Semantic Web applications. Overall, the framework enables traceable, semantically meaningful explanations of DL entailments and query answers across a spectrum of annotation regimes while preserving favorable computational properties in targeted fragments.
Abstract
We investigate semiring provenance--a successful framework originally defined in the relational database setting--for description logics. In this context, the ontology axioms are annotated with elements of a commutative semiring and these annotations are propagated to the ontology consequences in a way that reflects how they are derived. We define a provenance semantics for a language that encompasses several lightweight description logics and show its relationships with semantics that have been defined for ontologies annotated with a specific kind of annotation (such as fuzzy degrees). We show that under some restrictions on the semiring, the semantics satisfies desirable properties (such as extending the semiring provenance defined for databases). We then focus on the well-known why-provenance, for which we study the complexity of problems related to the provenance of an assertion or a conjunctive query answer. Finally, we consider two more restricted cases which correspond to the so-called positive Boolean provenance and lineage in the database setting. For these cases, we exhibit relationships with well-known notions related to explanations in description logics and complete our complexity analysis. As a side contribution, we provide conditions on an $\mathcal{ELHI}_\bot$ ontology that guarantee tractable reasoning.
