Yilun Zhang
2026
Web Fraud Attacks Against LLM-Driven Multi-Agent Systems
Dezhang Kong | Hujin Peng | Yilun Zhang | Lele Zhao | Zhenhua Xu | Shi Lin | Changting Lin | Meng Han
Findings of the Association for Computational Linguistics: ACL 2026
Dezhang Kong | Hujin Peng | Yilun Zhang | Lele Zhao | Zhenhua Xu | Shi Lin | Changting Lin | Meng Han
Findings of the Association for Computational Linguistics: ACL 2026
With the proliferation of LLM-driven multi-agent systems (MAS), the security of Web links has become a critical concern. Once MAS is induced to trust a malicious link, attackers can use it as a springboard to expand the attack surface. In this paper, we propose Web Fraud Attacks, a novel type of attack manipulating unique structures of web links to deceive MAS. We design 12 representative attack variants that encompass various methods, such as homoglyph deception, sub-directory nesting, and parameter obfuscation. Through extensive experiments on these attack vectors, we demonstrate that Web fraud attacks not only exhibit significant destructive potential across different MAS architectures but also possess a distinct advantage in evasion: they circumvent the need for complex input design, lowering the threshold for attacks significantly. These results underscore the importance of addressing Web fraud attacks, providing new insights into MAS safety.
2025
SENTRA: Selected-Next-Token Transformer for LLM Text Detection
Mitchell Plyler | Yilun Zhang | Alexander Tuzhilin | Saoud Khalifah | Sen Tian
Findings of the Association for Computational Linguistics: EMNLP 2025
Mitchell Plyler | Yilun Zhang | Alexander Tuzhilin | Saoud Khalifah | Sen Tian
Findings of the Association for Computational Linguistics: EMNLP 2025
LLMs are becoming increasingly capable and widespread. Consequently, the potential and reality of their misuse is also growing. In this work, we address the problem of detecting LLM-generated text that is not explicitly declared as such. We present a novel, general-purpose, and supervised LLM text detector, SElected-Next-Token tRAnsformer (SENTRA). SENTRA is a Transformer-based encoder leveraging selected-next-token-probability sequences and utilizing contrastive pre-training on large amounts of unlabeled data. Our experiments on three popular public datasets across 24 domains of text demonstrate SENTRA is a general-purpose classifier that significantly outperforms popular baselines in the out-of-domain setting.