TreEn: A Multilingual Treebank Project on Environmental Discourse
Adriana Silvina Pagano, Patricia Chiril, Elisa Chierchiello, Cristina Bosco
Abstract
The increasing complexity of environmental discourse is directly proportional to the growing complexity of environmental debates present today in all communication media. While linguistic and communication studies have been pursued on this discourse, the development of computational linguistic tools and resources dedicated to support its analysis and interpretation is still very incipient. For one, no morphosyntactic resources specific to the environmental domain can be found on major platforms and repositories. This paper introduces TreEn, a multilingual treebank project in progress which compiles texts on environmental discourse produced in different conversational and communication contexts. In particular, it reports on the parallel component of the project and discusses issues faced during sentence-level alignment between original and translated texts, annotation of texts following UD guidelines, and labeling entities drawing on an ontology of environmental-related topics. This novel resource is expected to support environmental discourse analysis by providing morphological and syntactical data to enable cross-language and cross-cultural comparison based on the semantics of the entities annotated in the treebank.- Anthology ID:
- 2025.udw-1.9
- Volume:
- Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025)
- Month:
- August
- Year:
- 2025
- Address:
- Ljubljana, Slovenia
- Editors:
- Gosse Bomma, Çağrı Çöltekin
- Venues:
- UDW | WS | SyntaxFest
- SIG:
- SIGPARSE
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 80–96
- Language:
- URL:
- https://preview.aclanthology.org/mtsummit-25-ingestion/2025.udw-1.9/
- DOI:
- Cite (ACL):
- Adriana Silvina Pagano, Patricia Chiril, Elisa Chierchiello, and Cristina Bosco. 2025. TreEn: A Multilingual Treebank Project on Environmental Discourse. In Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025), pages 80–96, Ljubljana, Slovenia. Association for Computational Linguistics.
- Cite (Informal):
- TreEn: A Multilingual Treebank Project on Environmental Discourse (Pagano et al., UDW-SyntaxFest 2025)
- PDF:
- https://preview.aclanthology.org/mtsummit-25-ingestion/2025.udw-1.9.pdf