YongTaek Lim
2026
STAR-Teaming: A Strategy-Response Multiplex Network Approach to Automated LLM Red Teaming
Min Jae Jung | YongTaek Lim | Chaeyun Kim | Junghwan Kim | Kihyun Kim | Minwoo Kim
Findings of the Association for Computational Linguistics: ACL 2026
Min Jae Jung | YongTaek Lim | Chaeyun Kim | Junghwan Kim | Kihyun Kim | Minwoo Kim
Findings of the Association for Computational Linguistics: ACL 2026
While Large Language Models (LLMs) are widely used, they remain susceptible to jailbreak prompts that can elicit harmful or inappropriate responses. This paper introduces STAR-Teaming, a novel black-box framework for automated red teaming that effectively generates such prompts. STAR-Teaming integrates a Multi-Agent System (MAS) with a Strategy-Response Multiplex Network and employs network-driven optimization to sample effective attack strategies. This network-based approach recasts the intractable high-dimensional embedding space into a tractable structure, yielding two key advantages: it enhances the interpretability of the LLM’s strategic vulnerabilities, and it streamlines the search for effective strategies by organizing the search space into semantic communities, thereby preventing redundant exploration. Empirical results demonstrate that STAR-Teaming significantly surpasses existing methods, achieving a higher attack success rate (ASR) at a lower computational cost. Extensive experiments validate the effectiveness and explainability of the Multiplex Network. The code is available at https://github.com/selectstar-ai/STAR-Teaming-paper.
2025
TIDES: Technical Information Discovery and Extraction System
Jihee Kim | Subeen Park | Hakyung Lee | YongTaek Lim | Hyo-won Suh | Kyungwoo Song
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Jihee Kim | Subeen Park | Hakyung Lee | YongTaek Lim | Hyo-won Suh | Kyungwoo Song
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Addressing the challenges in QA for specific technical domains requires identifying relevant portions of extensive documents and generating answers based on this focused content. Traditional pre-trained LLMs often struggle with domain-specific terminology, while fine-tuned LLMs demand substantial computational resources. To overcome these limitations, we propose TIDES, Technical Information Distillation and Extraction System. TIDES is a training-free approach that combines traditional TF-IDF techniques with prompt-based LLMs in a hybrid process, effectively addressing complex technical questions. It uses TF-IDF to identify and prioritize domain-specific words that are rare in other documents and LLMs to refine the candidate pool by focusing on the most relevant segments in documents through multiple stages. Our approach improves the precision and efficiency of QA systems in technical contexts without LLM retraining.