Contemplata, a Free Platform for Constituency Treebank Annotation

Jakub Waszczuk, Ilaine Wang, Jean-Yves Antoine, Anaïs Halftermeyer


Abstract
This paper describes Contemplata, an annotation platform that offers a generic solution for treebank building as well as treebank enrichment with relations between syntactic nodes. Contemplata is dedicated to the annotation of constituency trees. The framework includes support for syntactic parsers, which provide automatic annotations to be manually revised. The balanced strategy of annotation between automatic parsing and manual revision allows to reduce the annotator workload, which favours data reliability. The paper presents the software architecture of Contemplata, describes its practical use and eventually gives two examples of annotation projects that were conducted on the platform.
Anthology ID:
2020.lrec-1.892
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
7222–7229
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.892
DOI:
Bibkey:
Cite (ACL):
Jakub Waszczuk, Ilaine Wang, Jean-Yves Antoine, and Anaïs Halftermeyer. 2020. Contemplata, a Free Platform for Constituency Treebank Annotation. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 7222–7229, Marseille, France. European Language Resources Association.
Cite (Informal):
Contemplata, a Free Platform for Constituency Treebank Annotation (Waszczuk et al., LREC 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-5/2020.lrec-1.892.pdf