Abstract
This paper reports the annotation of a Brazilian Portuguese Treebank with semantic role labels following Propbank guidelines. A different language and a different parser output impact the task and require some decisions on how to annotate the corpus. Therefore, a new annotation guide ― called Propbank-Br - has been generated to deal with specific language phenomena and parser problems. In this phase of the project, the corpus was annotated by a unique linguist. The annotation task reported here is inserted in a larger projet for the Brazilian Portuguese language. This project aims to build Brazilian verbs frames files and a broader and distributed annotation of semantic role labels in Brazilian Portuguese, allowing inter-annotator agreement measures. The corpus, available in web, is already being used to build a semantic tagger for Portuguese language.- Anthology ID:
- L12-1114
- Volume:
- Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
- Month:
- May
- Year:
- 2012
- Address:
- Istanbul, Turkey
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 1862–1867
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/272_Paper.pdf
- DOI:
- Cite (ACL):
- Magali Sanches Duran and Sandra Maria Aluísio. 2012. Propbank-Br: a Brazilian Treebank annotated with semantic role labels. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1862–1867, Istanbul, Turkey. European Language Resources Association (ELRA).
- Cite (Informal):
- Propbank-Br: a Brazilian Treebank annotated with semantic role labels (Duran & Aluísio, LREC 2012)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/272_Paper.pdf