Abstract
We introduce a large corpus of comments extracted from an Italian online incel (‘involuntary incelibate’) forum, a community of men who build a collective identity and anti-feminist ideology centered around their inability to find a sexual or romantic partner and who frequently use explicitly misogynistic language. Our corpus consists of 2.4K comments that have been manually collected, analyzed and annotated with topic labels, and a further 32K threads (300K comments) that have been automatically scraped and automatically annotated with FrameNet annotations. We show how large-scale frame semantic analysis can shed a light on what is discussed in the community, and introduce incel topic classification as a new NLP task and benchmark.- Anthology ID:
- 2024.rfp-1.4
- Volume:
- Proceedings of the First Workshop on Reference, Framing, and Perspective @ LREC-COLING 2024
- Month:
- May
- Year:
- 2024
- Address:
- Torino, Italia
- Editors:
- Pia Sommerauer, Tommaso Caselli, Malvina Nissim, Levi Remijnse, Piek Vossen
- Venues:
- rfp | WS
- SIG:
- Publisher:
- ELRA and ICCL
- Note:
- Pages:
- 28–39
- Language:
- URL:
- https://aclanthology.org/2024.rfp-1.4
- DOI:
- Cite (ACL):
- Sara Gemelli and Gosse Minnema. 2024. Manosphrames: exploring an Italian incel community through the lens of NLP and Frame Semantics. In Proceedings of the First Workshop on Reference, Framing, and Perspective @ LREC-COLING 2024, pages 28–39, Torino, Italia. ELRA and ICCL.
- Cite (Informal):
- Manosphrames: exploring an Italian incel community through the lens of NLP and Frame Semantics (Gemelli & Minnema, rfp-WS 2024)
- PDF:
- https://preview.aclanthology.org/naacl24-info/2024.rfp-1.4.pdf