Wenjie Wang
Other people with similar names: Wenjie Wang, Wenjie Wang
Unverified author pages with similar names: Wenjie Wang
2026
RiskLab: A Controlled Toolkit for Probing Emergent Risks in LLM-Based Multi-Agent Systems
Yu Jiang | Wenjie Wang | Yue Huang | Yanbo Wang | Zhenhong Zhou | Xiuying Chen | Yang Liu | Pin-Yu Chen | Wei Wang | Xiangliang Zhang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Yu Jiang | Wenjie Wang | Yue Huang | Yanbo Wang | Zhenhong Zhou | Xiuying Chen | Yang Liu | Pin-Yu Chen | Wei Wang | Xiangliang Zhang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Large language model (LLM) agents increasingly operate in multi-agent settings where failures emerge from interaction dynamics rather than isolated model errors. We introduce RiskLab, an open-source toolkit for instantiating, probing, and measuring emergent risks in LLM-based multi-agent systems under controlled conditions. Each experiment is defined as a structured topology–environment–protocol–agent–task quintuple, enabling reproducible studies of how communication structure, coordination mechanisms, and incentives shape system-level risks. RiskLab provides flexible communication topologies, swappable interaction protocols, trajectory-grounded evaluation, and extensible registries for risk detectors and agent backends. We demonstrate the toolkit across representative risks, including collusion, resource overreach, semantic drift, and strategic misreporting, and support one-file reproducibility via configuration.