TransBox: EL++-closed Ontology Embedding
Hui Yang, Jiaoyan Chen, Uli Sattler
TL;DR
This paper tackles the challenge of embedding OWL ontologies with DL semantics by introducing EL^{++}-closed ontology embeddings and the TransBox method. TransBox represents concepts as boxes and roles as box-based translations, enabling closure under conjunction, existential quantification, and role composition, and extends box intersections to a space that includes the empty set to preserve meaningful overlaps. The authors also propose semantic and intersection enhancements, along with training objectives (distance, inclusion, negative sampling, regularization) to learn robust embeddings. Empirical results on GALEN, GO, and Anatomy show TransBox often achieves state-of-the-art performance for predicting complex axioms, with theoretical soundness proofs and a detailed ablation study. The work advances ontology learning and reasoning by enabling accurate inference over complex EL^{++} constructs and provides a foundation for future extensions to more expressive DLs and hybrid language-geometry approaches.
Abstract
OWL (Web Ontology Language) ontologies, which are able to represent both relational and type facts as standard knowledge graphs and complex domain knowledge in Description Logic (DL) axioms, are widely adopted in domains such as healthcare and bioinformatics. Inspired by the success of knowledge graph embeddings, embedding OWL ontologies has gained significant attention in recent years. Current methods primarily focus on learning embeddings for atomic concepts and roles, enabling the evaluation based on normalized axioms through specially designed score functions. However, they often neglect the embedding of complex concepts, making it difficult to infer with more intricate axioms. This limitation reduces their effectiveness in advanced reasoning tasks, such as Ontology Learning and ontology-mediated Query Answering. In this paper, we propose EL++-closed ontology embeddings which are able to represent any logical expressions in DL via composition. Furthermore, we develop TransBox, an effective EL++-closed ontology embedding method that can handle many-to-one, one-to-many and many-to-many relations. Our extensive experiments demonstrate that TransBox often achieves state-of-the-art performance across various real-world datasets for predicting complex axioms.
