Trigger Warnings Are Grounded in a Shared Vocabulary: A Corpus Analysis with User-Generated Labels

Sebastian Heineking, Matti Wiegmann, Magdalena Wolska, Benno Stein, Martin Potthast


Abstract
Trigger warnings advise of potentially disturbing content. On that note: This document discusses abuse. But can we trust trigger warnings? For a warning to be credible, independent authors must have a shared understanding of the type of content that advises caution. We investigate for the first time whether trigger warnings are aligned with the vocabulary of texts written by uncoordinated authors. To quantify the lexical alignment of trigger warnings, we conduct a series of statistical tests on the texts of fan fiction authors who used warnings relating to emotional, physical, or sexual abuse. We find that the vocabulary of texts with these warnings is aligned with a curated dictionary of terms related to abuse. However, a high frequency of a term in texts with a warning does not necessarily indicate a semantic relation.
Anthology ID:
2026.lrec-main.565
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
7106–7125
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.565/
DOI:
Bibkey:
Cite (ACL):
Sebastian Heineking, Matti Wiegmann, Magdalena Wolska, Benno Stein, and Martin Potthast. 2026. Trigger Warnings Are Grounded in a Shared Vocabulary: A Corpus Analysis with User-Generated Labels. International Conference on Language Resources and Evaluation, main:7106–7125.
Cite (Informal):
Trigger Warnings Are Grounded in a Shared Vocabulary: A Corpus Analysis with User-Generated Labels (Heineking et al., LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.565.pdf