Contemplata, a Free Platform for Constituency Treebank Annotation
Jakub Waszczuk, Ilaine Wang, Jean-Yves Antoine, Anaïs Halftermeyer
Abstract
This paper describes Contemplata, an annotation platform that offers a generic solution for treebank building as well as treebank enrichment with relations between syntactic nodes. Contemplata is dedicated to the annotation of constituency trees. The framework includes support for syntactic parsers, which provide automatic annotations to be manually revised. The balanced strategy of annotation between automatic parsing and manual revision allows to reduce the annotator workload, which favours data reliability. The paper presents the software architecture of Contemplata, describes its practical use and eventually gives two examples of annotation projects that were conducted on the platform.- Anthology ID:
- 2020.lrec-1.892
- Volume:
- Proceedings of the Twelfth Language Resources and Evaluation Conference
- Month:
- May
- Year:
- 2020
- Address:
- Marseille, France
- Editors:
- Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 7222–7229
- Language:
- English
- URL:
- https://aclanthology.org/2020.lrec-1.892
- DOI:
- Cite (ACL):
- Jakub Waszczuk, Ilaine Wang, Jean-Yves Antoine, and Anaïs Halftermeyer. 2020. Contemplata, a Free Platform for Constituency Treebank Annotation. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 7222–7229, Marseille, France. European Language Resources Association.
- Cite (Informal):
- Contemplata, a Free Platform for Constituency Treebank Annotation (Waszczuk et al., LREC 2020)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-5/2020.lrec-1.892.pdf