Yuting Huang
2026
"I Don’t Know What to Say": A Fact-Filling Questionnaire Method to Help Non-Experts Talk to LegalAI Assistant
Yuting Huang | Yiquan Wu | Meitong Guo | Ang Li | Xiaozhong Liu | Keting Yin | Fei Wu | Kun Kuang
Findings of the Association for Computational Linguistics: ACL 2026
Yuting Huang | Yiquan Wu | Meitong Guo | Ang Li | Xiaozhong Liu | Keting Yin | Fei Wu | Kun Kuang
Findings of the Association for Computational Linguistics: ACL 2026
Artificial intelligence has become increasingly prevalent in the legal domain. However, LegalAI systems often struggle with vague user queries that lack essential legal details, leading to suboptimal performance in practical applications. To address this challenge, we propose FactFiller, a novel approach that dynamically generates questionnaires to help users refine their input queries. Our method leverages an iterative training process that collects valuable questionnaires, eliminating the need for human annotation. Additionally, we introduce a "case-law-quiz” cascading retrieval process, ensuring that the generated questions and answer options are directly linked to specific legal provisions. Through the user study and the downstream task experiments, we demonstrate that FactFiller, while remaining easy for non-experts to understand, not only improves the completeness of queries but also ensures the performance of various domain-specific models in downstream legal tasks.
SplitThenMerge: Token-Level Skill-Compositional Sparse Mixture-of-Experts for Complex Domain-Specific Tasks
Yuting Huang | Jiawen Zhang | Yiquan Wu | Yinghao Hu | Fei Wu | Kun Kuang
Findings of the Association for Computational Linguistics: ACL 2026
Yuting Huang | Jiawen Zhang | Yiquan Wu | Yinghao Hu | Fei Wu | Kun Kuang
Findings of the Association for Computational Linguistics: ACL 2026
Large language models have demonstrated strong performance on general-purpose tasks but often fail to satisfy the accuracy requirements of knowledge-intensive domains such as law, medicine, and finance. Complex domain-specific generation is inherently compositional, involving multiple atomic skills such as reasoning, knowledge grounding, and numerical computation that are frequently interleaved at the token level. Existing domain adaptation methods typically train these heterogeneous skills jointly within a single objective, which makes it difficult for models to reliably coordinate multiple skills when solving complex tasks. In this work, we explicitly incorporate atomic skills into domain-specific model training and propose SplitThenMerge, a framework that decomposes domain competence into atomic skills, trains them independently, and composes them dynamically during generation. SplitThenMerge adopts a token-level sparse Mixture-of-Experts architecture to enable fine-grained skill routing and coordination while implementing each skill as a lightweight LoRA expert to achieve parameter-efficient specialization. Experimental results demonstrate that our method consistently achieves superior performance in both legal and medical domains under the same training parameter budget.
2025
Rewrite to Jailbreak: Discover Learnable and Transferable Implicit Harmfulness Instruction
Yuting Huang | Chengyuan Liu | Yifeng Feng | Yiquan Wu | Chao Wu | Fei Wu | Kun Kuang
Findings of the Association for Computational Linguistics: ACL 2025
Yuting Huang | Chengyuan Liu | Yifeng Feng | Yiquan Wu | Chao Wu | Fei Wu | Kun Kuang
Findings of the Association for Computational Linguistics: ACL 2025
As Large Language Models (LLMs) are widely applied in various domains, the safety of LLMs is increasingly attracting attention to avoid their powerful capabilities being misused. Existing jailbreak methods create a forced instruction-following scenario, or search adversarial prompts with prefix or suffix tokens to achieve a specific representation manually or automatically. However, they suffer from low efficiency and explicit jailbreak patterns, far from the real deployment of mass attacks to LLMs. In this paper, we point out that simply rewriting the original instruction can achieve a jailbreak, and we find that this rewriting approach is learnable and transferable. We propose the **R**ewrite to **J**ailbreak (R2J) approach, a transferable black-box jailbreak method to attack LLMs by iteratively exploring the weakness of the LLMs and automatically improving the attacking strategy. The jailbreak is more efficient and hard to identify since no additional features are introduced. Extensive experiments and analysis demonstrate the effectiveness of R2J, and we find that the jailbreak is also transferable to multiple datasets and various types of models with only a few queries. We hope our work motivates further investigation of LLM safety. The code can be found at [https://github.com/ythuang02/R2J/.](https://github.com/ythuang02/R2J/)