Detecting Personal Medication Intake in Twitter: An Annotated Corpus and Baseline Classification System
Ari Klein, Abeed Sarker, Masoud Rouhizadeh, Karen O’Connor, Graciela Gonzalez
Abstract
Social media sites (e.g., Twitter) have been used for surveillance of drug safety at the population level, but studies that focus on the effects of medications on specific sets of individuals have had to rely on other sources of data. Mining social media data for this in-formation would require the ability to distinguish indications of personal medication in-take in this media. Towards that end, this paper presents an annotated corpus that can be used to train machine learning systems to determine whether a tweet that mentions a medication indicates that the individual posting has taken that medication at a specific time. To demonstrate the utility of the corpus as a training set, we present baseline results of supervised classification.- Anthology ID:
- W17-2316
- Volume:
- BioNLP 2017
- Month:
- August
- Year:
- 2017
- Address:
- Vancouver, Canada,
- Editors:
- Kevin Bretonnel Cohen, Dina Demner-Fushman, Sophia Ananiadou, Junichi Tsujii
- Venue:
- BioNLP
- SIG:
- SIGBIOMED
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 136–142
- Language:
- URL:
- https://aclanthology.org/W17-2316
- DOI:
- 10.18653/v1/W17-2316
- Cite (ACL):
- Ari Klein, Abeed Sarker, Masoud Rouhizadeh, Karen O’Connor, and Graciela Gonzalez. 2017. Detecting Personal Medication Intake in Twitter: An Annotated Corpus and Baseline Classification System. In BioNLP 2017, pages 136–142, Vancouver, Canada,. Association for Computational Linguistics.
- Cite (Informal):
- Detecting Personal Medication Intake in Twitter: An Annotated Corpus and Baseline Classification System (Klein et al., BioNLP 2017)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-5/W17-2316.pdf