CodeHunters at SemEval-2026 Task 13: Detecting Machine-Generated Code with Multiple Programming Languages, Generators, and Application Scenarios

Daniel-Antoniu Dumitru, Simina Lazăr, Nicoleta Danilă (amargheoalei), Daniela Gîfu, Diana Trăndăbăț


Abstract
We participated in Subtasks A and B, where we fine-tuned 3 different pre-trained models (UniXCoder, CodeT5 and codeBERT). The paper describes the detailed approach for both of the subtasks.
Anthology ID:
2026.semeval-1.39
Volume:
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
270–276
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.39/
DOI:
Bibkey:
Cite (ACL):
Daniel-Antoniu Dumitru, Simina Lazăr, Nicoleta Danilă (amargheoalei), Daniela Gîfu, and Diana Trăndăbăț. 2026. CodeHunters at SemEval-2026 Task 13: Detecting Machine-Generated Code with Multiple Programming Languages, Generators, and Application Scenarios. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 270–276, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
CodeHunters at SemEval-2026 Task 13: Detecting Machine-Generated Code with Multiple Programming Languages, Generators, and Application Scenarios (Dumitru et al., SemEval 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.39.pdf
Supplementarymaterial:
 2026.semeval-1.39.SupplementaryMaterial.tex