Semantic Information: A Difference That Makes a Difference

J. Nathanael Philipp, Max Kölbl, Michael Richter


Abstract
In the framework of distributional semantics, we introduce a novel notion and operationalisation of semantic information for natural language. The key idea is as follows: a linguistic sign carries semantic information about a document if it reduces the amount of surprisal for a language processor. We consider two systems, an informed one and an uninformed one, and describe semantic information in their terms. Processing effort is quantified via surprisal where the informed system is ‘aware’ of the linguistic sign and the uninformed one is not. On an English fairy tale corpus and on two German news corpora, we tested successfully the prediction that if the linguistic sign in question carries pre-information through semantic surprisal, the current level of surprisal for the language processor is reduced. The conclusion is that the degree of semantic information results from the degree of semantic prior information.
Anthology ID:
2026.lrec-main.883
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
11300–11308
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.883/
DOI:
Bibkey:
Cite (ACL):
J. Nathanael Philipp, Max Kölbl, and Michael Richter. 2026. Semantic Information: A Difference That Makes a Difference. International Conference on Language Resources and Evaluation, main:11300–11308.
Cite (Informal):
Semantic Information: A Difference That Makes a Difference (Philipp et al., LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.883.pdf