LLM Agents Implement an NLG System from Scratch: Building Interpretable Rule-Based RDF-to-Text Generators
Mateusz Lango, Ondřej Dušek
TL;DR
This paper introduces a neurosymbolic framework where multiple LLM agents collectively design a rule-based NLG system in Python to verbalize RDF triples, eliminating the need for supervised in-domain data and enabling fast CPU-only inference. The approach yields an interpretable, maintainable generator that reduces hallucinations compared with neural baselines and remains competitive on standard metrics. Extensive experiments across WebNLG, OpenDialKG, and GEM 2024 show favorable performance, strong speedups, and solid human judgments, while ablations and interpretability studies validate the design choices. The work highlights a practical path to transparent, efficient RDF-to-text generation with potential for manual refinement and safe deployment.
Abstract
We present a novel neurosymbolic framework for RDF-to-text generation, in which the model is "trained" through collaborative interactions among multiple LLM agents rather than traditional backpropagation. The LLM agents produce rule-based Python code for a generator for the given domain, based on RDF triples only, with no in-domain human reference texts. The resulting system is fully interpretable, requires no supervised training data, and generates text nearly instantaneously using only a single CPU. Our experiments on the WebNLG and OpenDialKG data show that outputs produced by our approach reduce hallucination, with only slight fluency penalties compared to finetuned or prompted language models
