CodeHunters at SemEval-2026 Task 13: Detecting Machine-Generated Code with Multiple Programming Languages, Generators, and Application Scenarios
Daniel-Antoniu Dumitru, Simina Lazăr, Nicoleta Danilă (amargheoalei), Daniela Gîfu, Diana Trăndăbăț
Abstract
We participated in Subtasks A and B, where we fine-tuned 3 different pre-trained models (UniXCoder, CodeT5 and codeBERT). The paper describes the detailed approach for both of the subtasks.- Anthology ID:
- 2026.semeval-1.39
- Volume:
- Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, USA
- Editors:
- Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
- Venues:
- SemEval | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 270–276
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.39/
- DOI:
- Cite (ACL):
- Daniel-Antoniu Dumitru, Simina Lazăr, Nicoleta Danilă (amargheoalei), Daniela Gîfu, and Diana Trăndăbăț. 2026. CodeHunters at SemEval-2026 Task 13: Detecting Machine-Generated Code with Multiple Programming Languages, Generators, and Application Scenarios. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 270–276, San Diego, California, USA. Association for Computational Linguistics.
- Cite (Informal):
- CodeHunters at SemEval-2026 Task 13: Detecting Machine-Generated Code with Multiple Programming Languages, Generators, and Application Scenarios (Dumitru et al., SemEval 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.39.pdf