Diagnosing LLMs via Information Spectrum Analysis: Tail Behavior and the Effects of Side Information

Yuuki Tachioka

Diagnosing LLMs via Information Spectrum Analysis: Tail Behavior and the Effects of Side Information

Abstract

Large language models (LLMs) exhibit non-stationary generation: their output distributions shift with prompts, retrieved documents, and decoding conditions. Under such variability, average likelihood metrics can obscure heterogeneous behaviors across samples, especially in high-surprisal tails where failures often occur. We propose an information-spectrum-based diagnostic framework that treats LLMs as general sources without assuming stationarity, ergodicity, or the asymptotic equipartition property. We define sequence-level self-information density (coding rate; mean surprisal) and construct an empirical information spectrum from finite samples, enabling operational estimates of spectrum quantiles and width. We further introduce an information gain spectrum, a teacher-forced likelihood-based measure that evaluates the same generated sequence with and without side information. Across multiple Japanese LLMs and QA settings, we observe that correctness differences are often more visible in the high-surprisal tail than in the mean coding rate, and that side information can reshape tail behavior in heterogeneous ways across sequences. We also observe that instruction tuning changes the spectrum structure, making tail statistics and spectrum width more predictive of correctness than the mean coding rate. Overall, our analysis illustrates how spectrum-based diagnostics complement average-based metrics for understanding conditional generation.

Anthology ID:: 2026.findings-acl.594
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12231–12253
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.594/
DOI:
Bibkey:
Cite (ACL):: Yuuki Tachioka. 2026. Diagnosing LLMs via Information Spectrum Analysis: Tail Behavior and the Effects of Side Information. In Findings of the Association for Computational Linguistics: ACL 2026, pages 12231–12253, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Diagnosing LLMs via Information Spectrum Analysis: Tail Behavior and the Effects of Side Information (Tachioka, Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.594.pdf
Checklist:: 2026.findings-acl.594.checklist.pdf

PDF Cite Search Checklist Fix data