LLM Agents Implement an NLG System from Scratch: Building Interpretable Rule-Based RDF-to-Text Generators

Mateusz Lango, Ondrej Dusek


Abstract
We present a novel neurosymbolic framework for RDF-to-text generation, in which the model is “trained” through collaborative interactions among multiple LLM agents rather than traditional backpropagation. The LLM agents produce rule-based Python code for a generator for the given domain, based on RDF triples only, with no in-domain human reference texts. The resulting system is fully interpretable, requires no supervised training data, and generates text nearly instantaneously using only a single CPU. Our experiments on the WebNLG and OpenDialKG data show that outputs produced by our approach reduce hallucination, with only slight fluency penalties compared to finetuned or prompted language models.
Anthology ID:
2025.emnlp-industry.142
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
Month:
November
Year:
2025
Address:
Suzhou (China)
Editors:
Saloni Potdar, Lina Rojas-Barahona, Sebastien Montella
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2025–2040
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-industry.142/
DOI:
Bibkey:
Cite (ACL):
Mateusz Lango and Ondrej Dusek. 2025. LLM Agents Implement an NLG System from Scratch: Building Interpretable Rule-Based RDF-to-Text Generators. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 2025–2040, Suzhou (China). Association for Computational Linguistics.
Cite (Informal):
LLM Agents Implement an NLG System from Scratch: Building Interpretable Rule-Based RDF-to-Text Generators (Lango & Dusek, EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-industry.142.pdf