VishBox v2: A Multi-Agent System for Adaptive Voice Phishing Simulation

Sungmi Park, Daon Choi, Yoonmo Yang, Hong Yunyi, Heedou Kim


Abstract
Voice phishing is a multi-round social engineering attack in which strategy and victim psychology co-evolve, yet real transcripts are rarely accessible for systematic analysis. We present VishBox v2, a multi-agent architecture that generates structured phishing simulations grounded in crime-script procedures and persuasion principles. A Main Agent orchestrates a Dialogue Agent and a Tactic Search Agent, combining multi-round dialogue generation, web-based tactic mining, and emotion-driven vulnerability tracking. Across 571 rounds, results including police-expert evaluation support procedural realism and show that VishBox v2 captures tactic concentration, vulnerability transitions, and web-search-induced procedural disruptions. The framework provides a controlled foundation for safer red-teaming and security training research.
Anthology ID:
2026.acl-industry.145
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Yunyao Li, Georg Rehm, Mei Tu
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2168–2182
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-industry.145/
DOI:
Bibkey:
Cite (ACL):
Sungmi Park, Daon Choi, Yoonmo Yang, Hong Yunyi, and Heedou Kim. 2026. VishBox v2: A Multi-Agent System for Adaptive Voice Phishing Simulation. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 2168–2182, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
VishBox v2: A Multi-Agent System for Adaptive Voice Phishing Simulation (Park et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-industry.145.pdf