Abstract
This paper presents some work on direct and indirect speech in Portuguese using corpus-based methods: we report on a study whose aim was to identify (i) Portuguese verbs used to introduce reported speech and (ii) syntactic patterns used to convey reported speech, in order to enhance the performance of a quotation extraction system, dubbed QUEMDISSE?. In addition, (iii) we present a Portuguese corpus annotated with reported speech, using the lexicon and rules provided by (i) and (ii), and discuss the process of their annotation and what was learned.- Anthology ID:
- L16-1698
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 4410–4416
- Language:
- URL:
- https://aclanthology.org/L16-1698
- DOI:
- Cite (ACL):
- Cláudia Freitas, Bianca Freitas, and Diana Santos. 2016. QUEMDISSE? Reported speech in Portuguese. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 4410–4416, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- QUEMDISSE? Reported speech in Portuguese (Freitas et al., LREC 2016)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/L16-1698.pdf