Corpus-Based Multilingual Event-type Ontology: Annotation Tools and Principles

Eva Fučíková, Jan Hajič, Zdeňka Urešová


Abstract
In the course of building a multilingual Event-type Ontology resource called SynSemClass, it was necessary to provide the maintainers and the annotators with a set of tools to facilitate their job, achieve data format consistency, and in general obtain high-quality data. We have adapted a previously existing tool (Urešová et al., 2018b), developed to assist the work in capturing bilingual synonymy. This tool needed to be both substantially expanded with some new features and fundamentally changed in the context of developing the resource for more languages, which necessarily is to be done in parallel. We are thus presenting here the tool, the new data structure design which had to change at the same time, and the associated workflow.
Anthology ID:
2023.tlt-1.1
Volume:
Proceedings of the 21st International Workshop on Treebanks and Linguistic Theories (TLT, GURT/SyntaxFest 2023)
Month:
March
Year:
2023
Address:
Washington, D.C.
Editors:
Daniel Dakota, Kilian Evang, Sandra Kübler, Lori Levin
Venues:
TLT | SyntaxFest
SIG:
SIGPARSE
Publisher:
Association for Computational Linguistics
Note:
Pages:
1–10
Language:
URL:
https://aclanthology.org/2023.tlt-1.1
DOI:
Bibkey:
Cite (ACL):
Eva Fučíková, Jan Hajič, and Zdeňka Urešová. 2023. Corpus-Based Multilingual Event-type Ontology: Annotation Tools and Principles. In Proceedings of the 21st International Workshop on Treebanks and Linguistic Theories (TLT, GURT/SyntaxFest 2023), pages 1–10, Washington, D.C.. Association for Computational Linguistics.
Cite (Informal):
Corpus-Based Multilingual Event-type Ontology: Annotation Tools and Principles (Fučíková et al., TLT-SyntaxFest 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-2023-videos/2023.tlt-1.1.pdf
Video:
 https://preview.aclanthology.org/ingest-acl-2023-videos/2023.tlt-1.1.mp4