André Clemêncio
2020
Corpora and Baselines for Humour Recognition in Portuguese
Hugo Gonçalo Oliveira
|
André Clemêncio
|
Ana Alves
Proceedings of the Twelfth Language Resources and Evaluation Conference
Having in mind the lack of work on the automatic recognition of verbal humour in Portuguese, a topic connected with fluency in a natural language, we describe the creation of three corpora, covering two styles of humour and four sources of non-humorous text, that may be used for related studies. We then report on some experiments where the created corpora were used for training and testing computational models that exploit content and linguistic features for humour recognition. The obtained results helped us taking some conclusions about this challenge and may be seen as baselines for those willing to tackle it in the future, using the same corpora.