Prompt-Time Ontology-Driven Symbolic Knowledge Capture with Large Language Models
Tolga Çöplü, Arto Bendiken, Andrii Skomorokhov, Eduard Bateiko, Stephen Cobb
TL;DR
The paper tackles enabling LLMs to learn personal information from user prompts by leveraging ontology-driven symbolic knowledge capture at prompt-time. It compares in-context learning with a fine-tuning approach and demonstrates that fine-tuning on a predefined ontology (subset of KNOW) efficiently teaches the model to extract and populate subject-predicate-object triples. A dataset of 175 prompts (143 training/test plus 32 generic) serialized in Turtle, focusing on 12 family-related concepts, shows that using eight prompts per concept and 18 fine-tuning epochs yields robust precision, recall, and F1 in knowledge capture. The work highlights feasibility and scalability of combining neural models with symbolic ontologies for personalized AI, with future work aimed at integrating the generated knowledge graph into knowledge utilization by the LLM and providing code and data publicly.
Abstract
In applications such as personal assistants, large language models (LLMs) must consider the user's personal information and preferences. However, LLMs lack the inherent ability to learn from user interactions. This paper explores capturing personal information from user prompts using ontology and knowledge-graph approaches. We use a subset of the KNOW ontology, which models personal information, to train the language model on these concepts. We then evaluate the success of knowledge capture using a specially constructed dataset. Our code and datasets are publicly available at https://github.com/HaltiaAI/paper-PTODSKC
