McMaster at LeWiDi-2025: Demographic-Aware RoBERTa
Mandira Sawkar, Samay U. Shetty, Deepak Pandita, Tharindu Cyril Weerasooriya, Christopher M. Homan
Abstract
We present our submission to the Learning With Disagreements (LeWiDi) 2025 shared task. Our team implemented a variety of BERT-based models that encode annotator meta-data in combination with text to predict soft-label distributions and individual annotator labels. We show across four tasks that a combination of demographic factors leads to improved performance, however through ablations across all demographic variables we find that in some cases, a single variable performs best. Our approach placed 4th in the overall competition.- Anthology ID:
- 2025.nlperspectives-1.18
- Volume:
- Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP
- Month:
- November
- Year:
- 2025
- Address:
- Suzhou, China
- Editors:
- Gavin Abercrombie, Valerio Basile, Simona Frenda, Sara Tonelli, Shiran Dudy
- Venues:
- NLPerspectives | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 208–218
- Language:
- URL:
- https://preview.aclanthology.org/ingest-emnlp/2025.nlperspectives-1.18/
- DOI:
- Cite (ACL):
- Mandira Sawkar, Samay U. Shetty, Deepak Pandita, Tharindu Cyril Weerasooriya, and Christopher M. Homan. 2025. McMaster at LeWiDi-2025: Demographic-Aware RoBERTa. In Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP, pages 208–218, Suzhou, China. Association for Computational Linguistics.
- Cite (Informal):
- McMaster at LeWiDi-2025: Demographic-Aware RoBERTa (Sawkar et al., NLPerspectives 2025)
- PDF:
- https://preview.aclanthology.org/ingest-emnlp/2025.nlperspectives-1.18.pdf