Generating Ethnographic Models from Communities’ Online Data

Tomek Strzalkowski, Anna Newheiser, Nathan Kemper, Ning Sa, Bharvee Acharya, Gregorios Katsios


Abstract
In this paper we describe computational ethnography study to demonstrate how machine learning techniques can be utilized to exploit bias resident in language data produced by communities with online presence. Specifically, we leverage the use of figurative language (i.e., the choice of metaphors) in online text (e.g., news media, blogs) produced by distinct communities to obtain models of community worldviews that can be shown to be distinctly biased and thus different from other communities’ models. We automatically construct metaphor-based community models for two distinct scenarios: gun rights and marriage equality. We then conduct a series of experiments to validate the hypothesis that the metaphors found in each community’s online language convey the bias in the community’s worldview.
Anthology ID:
2020.figlang-1.23
Volume:
Proceedings of the Second Workshop on Figurative Language Processing
Month:
July
Year:
2020
Address:
Online
Venue:
Fig-Lang
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
165–175
Language:
URL:
https://aclanthology.org/2020.figlang-1.23
DOI:
10.18653/v1/2020.figlang-1.23
Bibkey:
Cite (ACL):
Tomek Strzalkowski, Anna Newheiser, Nathan Kemper, Ning Sa, Bharvee Acharya, and Gregorios Katsios. 2020. Generating Ethnographic Models from Communities’ Online Data. In Proceedings of the Second Workshop on Figurative Language Processing, pages 165–175, Online. Association for Computational Linguistics.
Cite (Informal):
Generating Ethnographic Models from Communities’ Online Data (Strzalkowski et al., Fig-Lang 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2020.figlang-1.23.pdf
Software:
 2020.figlang-1.23.Software.zip
Dataset:
 2020.figlang-1.23.Dataset.pdf
Video:
 http://slideslive.com/38929711