Culture Matters in Toxic Language Detection in Persian

Zahra Bokaei, Walid Magdy, Bonnie Webber


Abstract
Toxic language detection is crucial for creating safer online environments and limiting the spread of harmful content. While toxic language detection has been under-explored in Persian, the current work compares different methods for this task, including fine-tuning, data enrichment, zero-shot and few-shot learning, and cross-lingual transfer learning. What is especially compelling is the impact of cultural context on transfer learning for this task: We show that the language of a country with cultural similarities to Persian yields better results in transfer learning. Conversely, the improvement is lower when the language comes from a culturally distinct country.
Anthology ID:
2025.acl-long.456
Volume:
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
9290–9304
Language:
URL:
https://preview.aclanthology.org/ingestion-acl-25/2025.acl-long.456/
DOI:
Bibkey:
Cite (ACL):
Zahra Bokaei, Walid Magdy, and Bonnie Webber. 2025. Culture Matters in Toxic Language Detection in Persian. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 9290–9304, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Culture Matters in Toxic Language Detection in Persian (Bokaei et al., ACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-acl-25/2025.acl-long.456.pdf