Corpora and Baselines for Humour Recognition in Portuguese

Hugo Gonçalo Oliveira, André Clemêncio, Ana Alves


Abstract
Having in mind the lack of work on the automatic recognition of verbal humour in Portuguese, a topic connected with fluency in a natural language, we describe the creation of three corpora, covering two styles of humour and four sources of non-humorous text, that may be used for related studies. We then report on some experiments where the created corpora were used for training and testing computational models that exploit content and linguistic features for humour recognition. The obtained results helped us taking some conclusions about this challenge and may be seen as baselines for those willing to tackle it in the future, using the same corpora.
Anthology ID:
2020.lrec-1.160
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
1278–1285
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.160
DOI:
Bibkey:
Cite (ACL):
Hugo Gonçalo Oliveira, André Clemêncio, and Ana Alves. 2020. Corpora and Baselines for Humour Recognition in Portuguese. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 1278–1285, Marseille, France. European Language Resources Association.
Cite (Informal):
Corpora and Baselines for Humour Recognition in Portuguese (Gonçalo Oliveira et al., LREC 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2020.lrec-1.160.pdf