Exploring the Relationship Between Intrinsic Stigma in Masked Language Models and Training Data Using the Stereotype Content Model

Mario Mina, Júlia Falcão, Aitor Gonzalez-Agirre


Abstract
Much work has gone into developing language models of increasing size, but only recently have we begun to examine them for pernicious behaviour that could lead to harming marginalised groups. Following Lin et al. (2022) in rooting our work in psychological research, we prompt two masked language models (MLMs) of different specialisations in English and Spanish with statements from a questionnaire developed to measure stigma to determine if they treat physical and mental illnesses equally. In both models we find a statistically significant difference in the treatment of physical and mental illnesses across most if not all latent constructs as measured by the questionnaire, and thus they are more likely to associate mental illnesses with stigma. We then examine their training data or data retrieved from the same domain using a computational implementation of the Stereotype Content Model (SCM) (Fiske et al., 2002; Fraser et al., 2021) to interpret the questionnaire results based on the SCM values as reflected in the data. We observe that model behaviour can largely be explained by the distribution of the mentions of illnesses according to their SCM values.
Anthology ID:
2024.rapid-1.7
Volume:
Proceedings of the Fifth Workshop on Resources and ProcessIng of linguistic, para-linguistic and extra-linguistic Data from people with various forms of cognitive/psychiatric/developmental impairments @LREC-COLING 2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Dimitrios Kokkinakis, Kathleen C. Fraser, Charalambos K. Themistocleous, Kristina Lundholm Fors, Athanasios Tsanas, Fredrik Ohman
Venues:
RaPID | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
54–67
Language:
URL:
https://preview.aclanthology.org/icon-24-ingestion/2024.rapid-1.7/
DOI:
Bibkey:
Cite (ACL):
Mario Mina, Júlia Falcão, and Aitor Gonzalez-Agirre. 2024. Exploring the Relationship Between Intrinsic Stigma in Masked Language Models and Training Data Using the Stereotype Content Model. In Proceedings of the Fifth Workshop on Resources and ProcessIng of linguistic, para-linguistic and extra-linguistic Data from people with various forms of cognitive/psychiatric/developmental impairments @LREC-COLING 2024, pages 54–67, Torino, Italia. ELRA and ICCL.
Cite (Informal):
Exploring the Relationship Between Intrinsic Stigma in Masked Language Models and Training Data Using the Stereotype Content Model (Mina et al., RaPID 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/icon-24-ingestion/2024.rapid-1.7.pdf