Cecilia Domingo


2022

pdf bib
Discourse annotation — Towards a dialogue system for pair programming
Cecilia Domingo | Paul Piwek | Svetlana Stoyanchev | Michel Wermelinger
Traitement Automatique des Langues, Volume 63, Numéro 3 : Etats de l'art en TAL [Review articles in NLP]

2021

pdf
What is on Social Media that is not in WordNet? A Preliminary Analysis on the TwitterAAE Corpus
Cecilia Domingo | Tatiana Gonzalez-Ferrero | Itziar Gonzalez-Dios
Proceedings of the 11th Global Wordnet Conference

Natural Language Processing tools and resources have been so far mainly created and trained for standard varieties of language. Nowadays, with the use of large amounts of data gathered from social media, other varieties and registers need to be processed, which may present other challenges and difficulties. In this work, we focus on English and we present a preliminary analysis by comparing the TwitterAAE corpus, which is annotated for ethnicity, and WordNet by quantifying and explaining the online language that WordNet misses.