Lost in Formatting: How Output Formats Skew LLM Performance on Information Extraction

Rishi Ravikumar; Nuhu Ibrahim; Riza Theresa Batista-Navarro

Lost in Formatting: How Output Formats Skew LLM Performance on Information Extraction

Rishi Ravikumar, Nuhu Ibrahim, Riza Batista-Navarro

Abstract

We investigate how the choice of output format influences the performance of fine-tuned large language models on information extraction tasks. Based on over 280 experiments spanning multiple benchmarks, models and formats, we find that output formatting is a critical yet largely overlooked hyperparameter. Remarkably, in some cases, changing only the output format shifts F1 scores by over 40% despite using the same model. We further observe that no single format consistently dominates across settings, and the optimal choice depends on factors like model family and dataset characteristics. Overall, these results demonstrate that informationally equivalent output formats can produce substantial performance variation, highlighting the need to treat output formatting as a key factor in building accurate and reliable information extraction systems.

Anthology ID:: 2026.eacl-long.256
Volume:: Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5498–5513
Language:
URL:: https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.256/
DOI:
Bibkey:
Cite (ACL):: Rishi Ravikumar, Nuhu Ibrahim, and Riza Batista-Navarro. 2026. Lost in Formatting: How Output Formats Skew LLM Performance on Information Extraction. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5498–5513, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Lost in Formatting: How Output Formats Skew LLM Performance on Information Extraction (Ravikumar et al., EACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.256.pdf

PDF Cite Search Fix data