Abstract
This paper presents CINTIL-QATreebank, a treebank composed of Portuguese sentences that can be used to support the development of Question Answering systems. To create this treebank, we use declarative sentences from the pre-existing CINTIL-Treebank and manually transform their syntactic structure into a non-declarative sentence. Our corpus includes two clause types: interrogative and imperative clauses. CINTIL-QATreebank can be used in language science and techology general research, but it was developed particularly for the development of automatic Question Answering systems. The non-declarative entences are annotated with several layers of linguistic information, namely (i) trees with information on constituency and grammatical function; (ii) sentence type; (iii) interrogative pronoun; (iv) question type; and (v) semantic type of expected answer. Moreover, these non-declarative sentences are paired with their declarative counterparts and associated with the expected answer snippets.- Anthology ID:
- L12-1244
- Volume:
- Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
- Month:
- May
- Year:
- 2012
- Address:
- Istanbul, Turkey
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 1895–1901
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/460_Paper.pdf
- DOI:
- Cite (ACL):
- Patrícia Gonçalves, Rita Santos, and António Branco. 2012. Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1895–1901, Istanbul, Turkey. European Language Resources Association (ELRA).
- Cite (Informal):
- Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese (Gonçalves et al., LREC 2012)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/460_Paper.pdf