Sunwoo Kim
2026
Pre-Deployment Advertisement Ranking under Data Scarcity via Context-Aware Criteria Generation with VLMs
Kyungho Kim | Yeonje Choi | Gyurim Hwang | Sejin Chung | Hongseok Lee | Myeong Ho Song | Yeongho Kim | Sunwoo Kim | Jongha Lee | Juyeon Kim | Kijung Shin
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Kyungho Kim | Yeonje Choi | Gyurim Hwang | Sejin Chung | Hongseok Lee | Myeong Ho Song | Yeongho Kim | Sunwoo Kim | Jongha Lee | Juyeon Kim | Kijung Shin
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Vision-Language Models (VLMs) perform well on general multimodal tasks, yet applying them to real-world advertisement (ad) evaluation is challenging due to strong brand specificity and limited labeled data. We introduce a new practical task, brand-specific ad ranking, which aims to rank ads for a target brand prior to deployment by modeling brand-specific effectiveness. To this end, we propose ADvisor, which derives explicit brand-aware decision criteria using VLMs, augments limited brand context with ads from similar brands, and applies reflection-based scoring for ranking. Experiments on real-world advertising data from 10 brands, collected through actual ad campaigns, show that ADvisor outperforms strong baselines by up to 7.2%. Further analyses show the generated criteria capture meaningful brand specificity, and ADvisor also performs strongly in online A/B testing. Our code is available at https://github.com/K-Kyungho/ADvisor.
2025
PapersPlease: A Benchmark for Evaluating Motivational Values of Large Language Models Based on ERG Theory
Junho Myung | Yeon Su Park | Sunwoo Kim | Shin Yoo | Alice Oh
Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM²)
Junho Myung | Yeon Su Park | Sunwoo Kim | Shin Yoo | Alice Oh
Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM²)
Evaluating the performance and biases of large language models (LLMs) through role-playing scenarios is becoming increasingly common, as LLMs often exhibit biased behaviors in these contexts. Building on this line of research, we introduce PapersPlease, a benchmark consisting of 3,700 moral dilemmas designed to investigate LLMs’ decision-making in prioritizing various levels of human needs. In our setup, LLMs act as immigration inspectors deciding whether to approve or deny entry based on the short narratives of people. These narratives are constructed using the Existence, Relatedness, and Growth (ERG) theory, which categorizes human needs into three hierarchical levels. Our analysis of six LLMs reveals statistically significant patterns in decision-making, suggesting that LLMs encode implicit preferences. Additionally, our evaluation of the impact of incorporating social identities into the narratives shows varying responsiveness based on both motivational needs and identity cues, with some models exhibiting higher denial rates for marginalized identities. All data is publicly available at https://github.com/yeonsuuuu28/papers-please.
‘Hello, World!’: Making GNNs Talk with LLMs
Sunwoo Kim | Soo Yong Lee | Jaemin Yoo | Kijung Shin
Findings of the Association for Computational Linguistics: EMNLP 2025
Sunwoo Kim | Soo Yong Lee | Jaemin Yoo | Kijung Shin
Findings of the Association for Computational Linguistics: EMNLP 2025
While graph neural networks (GNNs) have shown remarkable performance across diverse graph-related tasks, their high-dimensional hidden representations render them black boxes. In this work, we propose Graph Lingual Network (GLN), a GNN built on large language models (LLMs), with hidden representations in the form of human-readable text. Through careful prompt design, GLN incorporates not only the message passing module of GNNs but also advanced GNN techniques, including graph attention and initial residual connection. The comprehensibility of GLN’s hidden representations enables an intuitive analysis of how node representations change (1) across layers and (2) under advanced GNN techniques, shedding light on the inner workings of GNNs. Furthermore, we demonstrate that GLN achieves strong zero-shot performance on node classification and link prediction, outperforming existing LLM-based baseline methods.