KOAS: Korean Text Offensiveness Analysis System
San-Hee Park, Kang-Min Kim, Seonhee Cho, Jun-Hyung Park, Hyuntae Park, Hyuna Kim, Seongwon Chung, SangKeun Lee
Abstract
Warning: This manuscript contains a certain level of offensive expression. As communication through social media platforms has grown immensely, the increasing prevalence of offensive language online has become a critical problem. Notably in Korea, one of the countries with the highest Internet usage, automatic detection of offensive expressions has recently been brought to attention. However, morphological richness and complex syntax of Korean causes difficulties in neural model training. Furthermore, most of previous studies mainly focus on the detection of abusive language, disregarding implicit offensiveness and underestimating a different degree of intensity. To tackle these problems, we present KOAS, a system that fully exploits both contextual and linguistic features and estimates an offensiveness score for a text. We carefully designed KOAS with a multi-task learning framework and constructed a Korean dataset for offensive analysis from various domains. Refer for a detailed demonstration.- Anthology ID:
- 2021.emnlp-demo.9
- Volume:
- Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
- Month:
- November
- Year:
- 2021
- Address:
- Online and Punta Cana, Dominican Republic
- Editors:
- Heike Adel, Shuming Shi
- Venue:
- EMNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 72–78
- Language:
- URL:
- https://aclanthology.org/2021.emnlp-demo.9
- DOI:
- 10.18653/v1/2021.emnlp-demo.9
- Cite (ACL):
- San-Hee Park, Kang-Min Kim, Seonhee Cho, Jun-Hyung Park, Hyuntae Park, Hyuna Kim, Seongwon Chung, and SangKeun Lee. 2021. KOAS: Korean Text Offensiveness Analysis System. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 72–78, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
- Cite (Informal):
- KOAS: Korean Text Offensiveness Analysis System (Park et al., EMNLP 2021)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-3/2021.emnlp-demo.9.pdf