Nicolas Foucault
2016
Automatic Classification of Tweets for Analyzing Communication Behavior of Museums
Nicolas Foucault
|
Antoine Courtin
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
In this paper, we present a study on tweet classification which aims to define the communication behavior of the 103 French museums that participated in 2014 in the Twitter operation: MuseumWeek. The tweets were automatically classified in four communication categories: sharing experience, promoting participation, interacting with the community, and promoting-informing about the institution. Our classification is multi-class. It combines Support Vector Machines and Naive Bayes methods and is supported by a selection of eighteen subtypes of features of four different kinds: metadata information, punctuation marks, tweet-specific and lexical features. It was tested against a corpus of 1,095 tweets manually annotated by two experts in Natural Language Processing and Information Communication and twelve Community Managers of French museums. We obtained an state-of-the-art result of F1-score of 72% by 10-fold cross-validation. This result is very encouraging since is even better than some state-of-the-art results found in the tweet classification literature.
2013
Web pages segmentation for document selection in Question Answering (Pré-segmentation de pages web et sélection de documents pertinents en Questions-Réponses) [in French]
Nicolas Foucault
|
Sophie Rosset
|
Gilles Adda
Proceedings of TALN 2013 (Volume 1: Long Papers)
2011
Language Modeling for Document Selection in Question Answering
Nicolas Foucault
|
Gilles Adda
|
Sophie Rosset
Proceedings of the International Conference Recent Advances in Natural Language Processing 2011
Search