Logic-Level Evaluation of Logical Table-to-Text Generation

Lena Trigg, Dean F. Hougen, Ahsan Bilal


Abstract
Logical Table-to-Text (LT2T) generation aims to produce natural-language sentences that are logically faithful to structured tabular data. While recent Large Language Models (LLMs) show high performance on aggregate fidelity metrics, these scores provide only a coarse view of performance, obscuring specific logic-type reasoning failures and models’ meta-logical awareness. We propose an operation-aware diagnostic framework that evaluates four core competencies: (1) Logical Form (LF) execution accuracy, (2) fidelity of LF-conditioned generation, (3) logic-type identification, and (4) LF-free generation.We apply this framework to a suite of frontier LLMs and perform fine-grained analysis across logic types such as aggregation, ordinal, and superlative reasoning. Our results show that LT2T fidelity assessment can be unstable; the choice of verifier and logic type can substantially alter conclusions and model rankings. Crucially, we identify a meta-logical gap: models often generate faithful statements while failing to identify the underlying operation.
Anthology ID:
2026.conll-main.41
Volume:
Proceedings of the 30th Conference on Computational Natural Language Learning
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Claire Bonial, Yevgeni Berzak
Venues:
CoNLL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
677–691
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.conll-main.41/
DOI:
Bibkey:
Cite (ACL):
Lena Trigg, Dean F. Hougen, and Ahsan Bilal. 2026. Logic-Level Evaluation of Logical Table-to-Text Generation. In Proceedings of the 30th Conference on Computational Natural Language Learning, pages 677–691, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Logic-Level Evaluation of Logical Table-to-Text Generation (Trigg et al., CoNLL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.conll-main.41.pdf