Ontology Generation using Large Language Models

Anna Sofia Lippolis; Mohammad Javad Saeedizade; Robin Keskisärkkä; Sara Zuppiroli; Miguel Ceriani; Aldo Gangemi; Eva Blomqvist; Andrea Giovanni Nuzzolese

Ontology Generation using Large Language Models

Anna Sofia Lippolis, Mohammad Javad Saeedizade, Robin Keskisärkkä, Sara Zuppiroli, Miguel Ceriani, Aldo Gangemi, Eva Blomqvist, Andrea Giovanni Nuzzolese

TL;DR

This work investigates using large language models to draft OWL ontologies from natural-language requirements, introducing two prompting techniques—Memoryless CQbyCQ and Ontogenia—and evaluating them through a multi-dimensional framework. It presents a benchmark dataset of ten ontologies, 100 competency questions, and 29 user stories, and compares several LLMs across independent and incremental generation settings. The results show that OpenAI o1-preview with Ontogenia often yields higher-quality ontologies than baselines and novice human modellers, while also highlighting persistent issues such as superfluous elements and incorrect domain/range axioms. The study demonstrates the feasibility of LLM-assisted ontology drafting and emphasizes the need for comprehensive, human-in-the-loop evaluation and future work to further reduce errors and enhance practical tooling for ontology engineers.

Abstract

The ontology engineering process is complex, time-consuming, and error-prone, even for experienced ontology engineers. In this work, we investigate the potential of Large Language Models (LLMs) to provide effective OWL ontology drafts directly from ontological requirements described using user stories and competency questions. Our main contribution is the presentation and evaluation of two new prompting techniques for automated ontology development: Memoryless CQbyCQ and Ontogenia. We also emphasize the importance of three structural criteria for ontology assessment, alongside expert qualitative evaluation, highlighting the need for a multi-dimensional evaluation in order to capture the quality and usability of the generated ontologies. Our experiments, conducted on a benchmark dataset of ten ontologies with 100 distinct CQs and 29 different user stories, compare the performance of three LLMs using the two prompting techniques. The results demonstrate improvements over the current state-of-the-art in LLM-supported ontology engineering. More specifically, the model OpenAI o1-preview with Ontogenia produces ontologies of sufficient quality to meet the requirements of ontology engineers, significantly outperforming novice ontology engineers in modelling ability. However, we still note some common mistakes and variability of result quality, which is important to take into account when using LLMs for ontology authoring support. We discuss these limitations and propose directions for future research.

Ontology Generation using Large Language Models

TL;DR

Abstract

Ontology Generation using Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)