Team Vivek Dhayaal at SemEval-2026 Task 13 Subtask B: Multi-Class Authorship Detection

David Rodriguez, Mario Graff


Abstract
This paper describes the system for SemEval-2026 Task 10 Subtask 2 on conspiracy detection. We explore a progressive modeling strategy comparing traditional lexical representations with contextual transformer models. Lexical baselines include Bag-of-Words and TF-IDF features combined with Logistic Regression and Ridge classifiers. We then fine-tune a DistilRoBERTa transformer model for binary classification.All experiments were conducted using only the official task data in a CPU-only environment without external datasets or data augmentation. Our objective was to achieve acceptable performance while minimizing computational resources and model complexity. Results show that the transformer model improves the best lexical baseline from 0.67 to 0.75. The work highlights that competitive performance in conspiracy detection can be obtained with lightweight and reproducible configurations.
Anthology ID:
2026.semeval-1.434
Volume:
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3520–3523
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.434/
DOI:
Bibkey:
Cite (ACL):
David Rodriguez and Mario Graff. 2026. Team Vivek Dhayaal at SemEval-2026 Task 13 Subtask B: Multi-Class Authorship Detection. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 3520–3523, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Team Vivek Dhayaal at SemEval-2026 Task 13 Subtask B: Multi-Class Authorship Detection (Rodriguez & Graff, SemEval 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.434.pdf