Semantic-pragmatic Annotations in the Prague Dependency Treebank

Marie Mikulov\'a, Eva Hajicova, Ji\v{r}{\'\i} M{\'\i}rovsk\'y, Anna Nedoluzhko, Michal Nov\'ak, Pavl{\'\i}na Synkov\'a, Jan \v{S}t\v{e}p\'anek, Barbora \v{S}t\v{e}p\'ankov\'a, Jan Haji\v{c}


Abstract
We present semantic-pragmatic specification and annotation (ellipsis, coreference, bridging and discourse relations, information structure, scope of negation) in the multi-layer, genre-diversified, 3+ million-token Prague Dependency Treebank – Consolidated 2. 0. While morphology and syntax work almost exclusively on sentence level, the semantic-pragmatic phenomena are often related to two or more neighbouring sentences and possibly to an extra-linguistic context. In the contribution, we describe these phenomena from both the linguistic perspective (form of expression, relation to syntax and morphology) and the cognitive perspective (relation to context, real world knowledge, as well as to the related processes such as thinking or reasoning) – classifying the possible relations between the semantic-pragmatic units into cognitively plausible, distinguishable, and human-understandable categories. We have applied our results to the corpus, by annotating it in its entirety. The resulting dataset is publicly and freely available, to serve for verification and further investigation of (not only) these phenomena.
Anthology ID:
2026.findings-acl.1060
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
21099–21110
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1060/
DOI:
Bibkey:
Cite (ACL):
Marie Mikulov\'a, Eva Hajicova, Ji\v{r}{\'\i} M{\'\i}rovsk\'y, Anna Nedoluzhko, Michal Nov\'ak, Pavl{\'\i}na Synkov\'a, Jan \v{S}t\v{e}p\'anek, Barbora \v{S}t\v{e}p\'ankov\'a, and Jan Haji\v{c}. 2026. Semantic-pragmatic Annotations in the Prague Dependency Treebank. In Findings of the Association for Computational Linguistics: ACL 2026, pages 21099–21110, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Semantic-pragmatic Annotations in the Prague Dependency Treebank (Mikulov'a et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1060.pdf
Checklist:
 2026.findings-acl.1060.checklist.pdf