Sicheng Zhou
2026
SimRPD: Optimizing Recruitment Proactive Dialogue Agents through Simulator-Based Data Evaluation and Selection
Zhiyong Cao | Dunqiang Liu | Qi Dai | Haojun Xu | Huai Yuen Khor | Hao Wang | Huan He | Yafei Liu | Ke Ma | Ruqian Shi | Sicheng Zhou | Sijia Yao
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Zhiyong Cao | Dunqiang Liu | Qi Dai | Haojun Xu | Huai Yuen Khor | Hao Wang | Huan He | Yafei Liu | Ke Ma | Ruqian Shi | Sicheng Zhou | Sijia Yao
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Task-oriented proactive dialogue agents play a pivotal role in recruitment, particularly for steering conversations towards specific business outcomes, such as acquiring social-media contacts for private-channel conversion. Although supervised fine-tuning and reinforcement learning have proven effective for training such agents, their performance is heavily constrained by the scarcity of high-quality, goal-oriented domain-specific training data. To address this challenge, we propose SimRPD, a three-stage framework for training recruitment proactive dialogue agents. First, we develop a high-fidelity user simulator to synthesize large-scale conversational data through multi-turn online dialogue. Then we introduce a multi-dimensional evaluation framework based on Chain-of-Intention (CoI) to comprehensively assess the simulator and effectively select high-quality data, incorporating both global-level and instance-level metrics. Finally, we train the recruitment proactive dialogue agent on the selected dataset. Experiments in a real-world recruitment scenario demonstrate that SimRPD outperforms existing simulator-based data selection strategies, highlighting its practical value for industrial deployment and its potential applicability to other business-oriented dialogue scenarios.
2025
StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs
Zhicheng Guo | Sijie Cheng | Yuchen Niu | Hao Wang | Sicheng Zhou | Wenbing Huang | Yang Liu
Findings of the Association for Computational Linguistics: ACL 2025
Zhicheng Guo | Sijie Cheng | Yuchen Niu | Hao Wang | Sicheng Zhou | Wenbing Huang | Yang Liu
Findings of the Association for Computational Linguistics: ACL 2025
The rapid advancement of large language models (LLMs) has spurred significant interest in tool learning, where LLMs are augmented with external tools to tackle complex tasks. However, existing tool environments face challenges in balancing stability, scale, and realism, particularly for benchmarking purposes. To address this, we propose MirrorAPI, a novel framework that trains specialized LLMs to accurately simulate real API responses, effectively acting as “mirrors” to tool environments. Using a comprehensive dataset of request-response pairs from 7,000+ APIs, we employ supervised fine-tuning and chain-of-thought reasoning to enhance simulation fidelity. MirrorAPI achieves superior accuracy and stability compared to state-of-the-art methods, as demonstrated by its performance on the newly constructed MirrorAPI-Bench and its integration into StableToolBench.