Kijung Shin
2026
Hybrid-Vector Retrieval for Visually Rich Documents: Combining Single-Vector Efficiency and Multi-Vector Accuracy
Juyeon Kim | Geon Lee | Dongwon Choi | Taeuk Kim | Kijung Shin
Findings of the Association for Computational Linguistics: ACL 2026
Juyeon Kim | Geon Lee | Dongwon Choi | Taeuk Kim | Kijung Shin
Findings of the Association for Computational Linguistics: ACL 2026
Retrieval over visually rich documents is essential for tasks such as legal discovery, scientific search, and enterprise knowledge management. Existing approaches fall into two paradigms: single-vector retrieval, which is efficient but coarse, and multi-vector retrieval, which is accurate but computationally expensive. To address this trade-off, we propose HEAVEN, a plug-and-play two-stage hybrid-vector framework. In the first stage, HEAVEN efficiently retrieves candidate pages using a single-vector method over Visually-Summarized Pages (VS-Pages), which assemble representative visual layouts from multiple pages. In the second stage, it reranks candidates with a multi-vector method while filtering query tokens by linguistic importance to reduce redundant computations. To evaluate retrieval systems under realistic conditions, we also introduce ViMDoc, a benchmark for visually rich, multi-document, and long-document retrieval. Across four benchmarks, HEAVEN attains 99.87% of the Recall@1 performance of multi-vector models on average while reducing per-query computation by 99.82%, achieving efficiency and accuracy. Our code and datasets are available at: https://github.com/juyeonnn/HEAVEN
Pre-Deployment Advertisement Ranking under Data Scarcity via Context-Aware Criteria Generation with VLMs
Kyungho Kim | Yeonje Choi | Gyurim Hwang | Sejin Chung | Hongseok Lee | Myeong Ho Song | Yeongho Kim | Sunwoo Kim | Jongha Lee | Juyeon Kim | Kijung Shin
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track)
Kyungho Kim | Yeonje Choi | Gyurim Hwang | Sejin Chung | Hongseok Lee | Myeong Ho Song | Yeongho Kim | Sunwoo Kim | Jongha Lee | Juyeon Kim | Kijung Shin
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track)
Vision-Language Models (VLMs) perform well on general multimodal tasks, yet applying them to real-world advertisement (ad) evaluation is challenging due to strong brand specificity and limited labeled data. We introduce a new practical task, brand-specific ad ranking, which aims to rank ads for a target brand prior to deployment by modeling brand-specific effectiveness. To this end, we propose ADvisor, which derives explicit brand-aware decision criteria using VLMs, augments limited brand context with ads from similar brands, and applies reflection-based scoring for ranking. Experiments on real-world advertising data from 10 brands, collected through actual ad campaigns, show that ADvisor outperforms strong baselines by up to 7.2%. Further analyses show the generated criteria capture meaningful brand specificity, and ADvisor also performs strongly in online A/B testing. Our code is available at https://github.com/K-Kyungho/ADvisor.
2025
‘Hello, World!’: Making GNNs Talk with LLMs
Sunwoo Kim | Soo Yong Lee | Jaemin Yoo | Kijung Shin
Findings of the Association for Computational Linguistics: EMNLP 2025
Sunwoo Kim | Soo Yong Lee | Jaemin Yoo | Kijung Shin
Findings of the Association for Computational Linguistics: EMNLP 2025
While graph neural networks (GNNs) have shown remarkable performance across diverse graph-related tasks, their high-dimensional hidden representations render them black boxes. In this work, we propose Graph Lingual Network (GLN), a GNN built on large language models (LLMs), with hidden representations in the form of human-readable text. Through careful prompt design, GLN incorporates not only the message passing module of GNNs but also advanced GNN techniques, including graph attention and initial residual connection. The comprehensibility of GLN’s hidden representations enables an intuitive analysis of how node representations change (1) across layers and (2) under advanced GNN techniques, shedding light on the inner workings of GNNs. Furthermore, we demonstrate that GLN achieves strong zero-shot performance on node classification and link prediction, outperforming existing LLM-based baseline methods.