Exploring Graph Representations of Logical Forms for Language Modeling

Michael Sullivan


Abstract
We make the case for language models over logical forms (LFLMs), arguing that such models are more data-efficient than their textual counterparts. To that end, we introduce the  ̲Graph-based  ̲Formal- ̲Logical  ̲Distributional  ̲Semantics (GFoLDS) prototype, a pretrained LM over graph representations of logical forms, as a proof-of-concept of LFLMs. Using GFoLDS, we present strong experimental evidence that LFLMs can leverage the built-in, basic linguistic knowledge inherent in such models to immediately begin learning more complex patterns. On downstream tasks, we show that GFoLDS vastly outperforms textual, transformer LMs (BERT) pretrained on the same data, indicating that LFLMs can learn with substantially less data than models over plain text. Furthermore, we show that the performance of this model is likely to scale with additional parameters and pretraining data, suggesting the viability of LFLMs in real-world applications.
Anthology ID:
2025.findings-acl.635
Volume:
Findings of the Association for Computational Linguistics: ACL 2025
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
12285–12307
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.findings-acl.635/
DOI:
Bibkey:
Cite (ACL):
Michael Sullivan. 2025. Exploring Graph Representations of Logical Forms for Language Modeling. In Findings of the Association for Computational Linguistics: ACL 2025, pages 12285–12307, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Exploring Graph Representations of Logical Forms for Language Modeling (Sullivan, Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.findings-acl.635.pdf