UIT-AMMC at SemEval-2026 Task 13: Exploiting Structural Formatting Signatures for Robust AI-Generated Code Detection

Cuong Pham; Minh Nguyen; Minh Lê; An Nguyen; Chinh Nguyen

UIT-AMMC at SemEval-2026 Task 13: Exploiting Structural Formatting Signatures for Robust AI-Generated Code Detection

Cuong Pham, Minh Nguyen, Minh Le, An Nguyen, Chinh Nguyen

Abstract

We participated in Subtask A with our Structure-Aware Contrastive Cascade, a multi-stage architecture designed to distinguish between human-authored and machine-generated code by integrating generative reasoning with explicit structural linguistic features. Our system focuses on exploiting structural formatting signatures that frequently emerge in AI-generated code as a byproduct of post-training alignment and readability optimization. The pipeline utilizes a Qwen-2.5-Coder 14B model fine-tuned via QLoRA, incorporating stochastic data augmentation techniques to ensure robustness across unseen programming languages. Final classification is achieved through a late-fusion mechanism that combines contrastive probability scores with statistical metrics of code presentation density. For samples exhibiting high epistemic uncertainty, we implement a multi-agent adversarial debate step to refine the final verdict. This approach enabled our system to achieve a Macro F1 score of 0.802, ranking 3rd on the official leaderboard.

Anthology ID:: 2026.semeval-1.60
Volume:: Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 418–425
Language:
URL:: https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.60/
DOI:
Bibkey:
Cite (ACL):: Cuong Pham, Minh Nguyen, Minh Le, An Nguyen, and Chinh Nguyen. 2026. UIT-AMMC at SemEval-2026 Task 13: Exploiting Structural Formatting Signatures for Robust AI-Generated Code Detection. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 418–425, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: UIT-AMMC at SemEval-2026 Task 13: Exploiting Structural Formatting Signatures for Robust AI-Generated Code Detection (Pham et al., SemEval 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.60.pdf
Supplementarymaterial:: 2026.semeval-1.60.SupplementaryMaterial.zip

PDF Cite Search Supplementarymaterial Fix data