Sanghoun Song
2026
KOCOH: Korean Context-Dependent Hate Speech Dataset
Eunah Park | Sanghoun Song
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Eunah Park | Sanghoun Song
Proceedings of the Fifteenth Language Resources and Evaluation Conference
We introduce the KOrean COntext-dependent Hate speech dataset (KOCOH) to evaluate large language models’ ability to detect context-dependent hate speech in Korean. KOCOH consists of 3,000 context-comment pairs collected from Korean online communities (Dcinside, FMkorea) with detailed annotations, including labels for hate speech and hate target groups. We assess the context-dependent hate speech detection capabilities of both humans and 11 state-of-the-art large language models, including GPT-5, Claude Sonnet 4, and Gemini 2.5 Flash. Our results show that humans outperform language models, with GPT-5 achieving the highest performance among the evaluated models. While humans demonstrate balanced recall and specificity, language models generally show significantly higher specificity compared to recall. The performance of both humans and models is affected by factors such as Honam-related vocabulary and sentiment polarity. This study contributes resources to Korean hate speech research and empirically demonstrates the performance gap between humans and language models. Through both quantitative and qualitative analyses, we explore the similarities and differences between humans and language models, offering insights for future developments in language models and AI ethics research. KOCOH is available at https://github.com/eparkatgithub/KOCOH.
2025
Assessing GPT models’ Sensitivity to Epistemic Meanings in Korean Periphrastic Construction
Yebin Lee | Sanghoun Song | Arum Kang
Proceedings of the 39th Pacific Asia Conference on Language, Information and Computation
Yebin Lee | Sanghoun Song | Arum Kang
Proceedings of the 39th Pacific Asia Conference on Language, Information and Computation
2020
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
Minh Le Nguyen | Mai Chi Luong | Sanghoun Song
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
Minh Le Nguyen | Mai Chi Luong | Sanghoun Song
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
Plausibility and Well-formedness Acceptability Test on Deep Neural Nativeness Classification
Kwonsik Park | Sanghoun Song
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
Kwonsik Park | Sanghoun Song
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
2015
Building an HPSG-based Indonesian Resource Grammar (INDRA)
David Moeljadi | Francis Bond | Sanghoun Song
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop
David Moeljadi | Francis Bond | Sanghoun Song
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop
Representing Honorifics via Individual Constraints
Sanghoun Song
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop
Sanghoun Song
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop
An HPSG-based Shared-Grammar for the Chinese Languages: ZHONG [|]
Zhenzhen Fan | Sanghoun Song | Francis Bond
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop
Zhenzhen Fan | Sanghoun Song | Francis Bond
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop
2012
Calculating Selectional Preferences of Transitive Verbs in Korean
Sanghoun Song | Jae-Woong Choe
Proceedings of the 26th Pacific Asia Conference on Language, Information, and Computation
Sanghoun Song | Jae-Woong Choe
Proceedings of the 26th Pacific Asia Conference on Language, Information, and Computation
2010
Development of the Korean Resource Grammar: Towards Grammar Customization
Sanghoun Song | Jong-Bok Kim | Francis Bond | Jaehyung Yang
Proceedings of the Eighth Workshop on Asian Language Resouces
Sanghoun Song | Jong-Bok Kim | Francis Bond | Jaehyung Yang
Proceedings of the Eighth Workshop on Asian Language Resouces
A Computational Treatment of Korean Serial Verb Constructions
Jong-Bok Kim | Jaehyung Yang | Sanghoun Song
Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation
Jong-Bok Kim | Jaehyung Yang | Sanghoun Song
Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation
2009
Online Search Interface for the Sejong Korean-Japanese Bilingual Corpus and Auto-interpolation of Phrase Alignment
Sanghoun Song | Francis Bond
Proceedings of the Third Linguistic Annotation Workshop (LAW III)
Sanghoun Song | Francis Bond
Proceedings of the Third Linguistic Annotation Workshop (LAW III)
2008
The Relationship between Semantic Similarity and Subcategorization Frames in English: A Stochastic Test Using ICE-GB and WordNet
Sanghoun Song | Jae-Woong Choe
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation
Sanghoun Song | Jae-Woong Choe
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation