Semantic-pragmatic Annotations in the Prague Dependency Treebank
Marie Mikulov\'a, Eva Hajicova, Ji\v{r}{\'\i} M{\'\i}rovsk\'y, Anna Nedoluzhko, Michal Nov\'ak, Pavl{\'\i}na Synkov\'a, Jan \v{S}t\v{e}p\'anek, Barbora \v{S}t\v{e}p\'ankov\'a, Jan Haji\v{c}
Abstract
We present semantic-pragmatic specification and annotation (ellipsis, coreference, bridging and discourse relations, information structure, scope of negation) in the multi-layer, genre-diversified, 3+ million-token Prague Dependency Treebank – Consolidated 2. 0. While morphology and syntax work almost exclusively on sentence level, the semantic-pragmatic phenomena are often related to two or more neighbouring sentences and possibly to an extra-linguistic context. In the contribution, we describe these phenomena from both the linguistic perspective (form of expression, relation to syntax and morphology) and the cognitive perspective (relation to context, real world knowledge, as well as to the related processes such as thinking or reasoning) – classifying the possible relations between the semantic-pragmatic units into cognitively plausible, distinguishable, and human-understandable categories. We have applied our results to the corpus, by annotating it in its entirety. The resulting dataset is publicly and freely available, to serve for verification and further investigation of (not only) these phenomena.- Anthology ID:
- 2026.findings-acl.1060
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 21099–21110
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1060/
- DOI:
- Cite (ACL):
- Marie Mikulov\'a, Eva Hajicova, Ji\v{r}{\'\i} M{\'\i}rovsk\'y, Anna Nedoluzhko, Michal Nov\'ak, Pavl{\'\i}na Synkov\'a, Jan \v{S}t\v{e}p\'anek, Barbora \v{S}t\v{e}p\'ankov\'a, and Jan Haji\v{c}. 2026. Semantic-pragmatic Annotations in the Prague Dependency Treebank. In Findings of the Association for Computational Linguistics: ACL 2026, pages 21099–21110, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- Semantic-pragmatic Annotations in the Prague Dependency Treebank (Mikulov'a et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1060.pdf