DIDECO: An Annotated Dataset for Intent Detection in Digital Communications
Senaid Popovic, Damien Riquet, Maxime Meyer, Fabien Lauer, Yannick Parmentier
Abstract
This paper presents DIDECO, the first annotated dataset specifically designed for detecting both explicit and implicit intents in digital communications. We address a critical gap in cybersecurity research by developing a comprehensive taxonomy that distinguishes between explicit communicative goals (what is requested) and implicit persuasion mechanisms (how compliance is engineered). Grounded in Speech Act Theory and persuasion psychology principles, our taxonomy encompasses 20 distinct intent categories across explicit and implicit intents. We annotated 220 LLM-generated spear-phishing emails using a multi-label protocol with six trained annotators, yielding 2,162 intent annotations that reveal the layered complexity of malicious communications. Our analysis demonstrates that sophisticated attacks employ multiple concurrent intents, combining explicit communicative goals with implicit persuasion strategies. This dataset provides resources for developing intent-aware detection systems capable of identifying sophisticated social engineering attacks through semantic analysis.- Anthology ID:
- 2026.lrec-main.542
- Volume:
- Proceedings of the Fifteenth Language Resources and Evaluation Conference
- Month:
- May
- Year:
- 2026
- Address:
- Palma de Mallorca, Spain
- Editors:
- Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
- Venue:
- LREC
- SIG:
- Publisher:
- ELRA Language Resource Association
- Note:
- Pages:
- 6808–6822
- Language:
- URL:
- https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.542/
- DOI:
- Cite (ACL):
- Senaid Popovic, Damien Riquet, Maxime Meyer, Fabien Lauer, and Yannick Parmentier. 2026. DIDECO: An Annotated Dataset for Intent Detection in Digital Communications. International Conference on Language Resources and Evaluation, main:6808–6822.
- Cite (Informal):
- DIDECO: An Annotated Dataset for Intent Detection in Digital Communications (Popovic et al., LREC 2026)
- PDF:
- https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.542.pdf