Detecting Personal Medication Intake in Twitter: An Annotated Corpus and Baseline Classification System

Ari Klein, Abeed Sarker, Masoud Rouhizadeh, Karen O’Connor, Graciela Gonzalez


Abstract
Social media sites (e.g., Twitter) have been used for surveillance of drug safety at the population level, but studies that focus on the effects of medications on specific sets of individuals have had to rely on other sources of data. Mining social media data for this in-formation would require the ability to distinguish indications of personal medication in-take in this media. Towards that end, this paper presents an annotated corpus that can be used to train machine learning systems to determine whether a tweet that mentions a medication indicates that the individual posting has taken that medication at a specific time. To demonstrate the utility of the corpus as a training set, we present baseline results of supervised classification.
Anthology ID:
W17-2316
Volume:
BioNLP 2017
Month:
August
Year:
2017
Address:
Vancouver, Canada,
Editors:
Kevin Bretonnel Cohen, Dina Demner-Fushman, Sophia Ananiadou, Junichi Tsujii
Venue:
BioNLP
SIG:
SIGBIOMED
Publisher:
Association for Computational Linguistics
Note:
Pages:
136–142
Language:
URL:
https://aclanthology.org/W17-2316
DOI:
10.18653/v1/W17-2316
Bibkey:
Cite (ACL):
Ari Klein, Abeed Sarker, Masoud Rouhizadeh, Karen O’Connor, and Graciela Gonzalez. 2017. Detecting Personal Medication Intake in Twitter: An Annotated Corpus and Baseline Classification System. In BioNLP 2017, pages 136–142, Vancouver, Canada,. Association for Computational Linguistics.
Cite (Informal):
Detecting Personal Medication Intake in Twitter: An Annotated Corpus and Baseline Classification System (Klein et al., BioNLP 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-5/W17-2316.pdf