SpyComet at SemEval-2026 Task 11: When Adversarial Debiasing Backfires - A Comparison of Data-Level and Model-Level Debiasing

Sai Aravind C; Sunil Saumya; C Pothan Reddy

SpyComet at SemEval-2026 Task 11: When Adversarial Debiasing Backfires - A Comparison of Data-Level and Model-Level Debiasing

Sai Aravind C, Sunil Saumya, C Pothan Reddy

Abstract

We describe MLA-CI (Multi-Layer Adversarial for Content Invariance), a DeBERTa-v3-base system for SemEval-2026 Task 11 Subtask 1 on content-invariant syllogistic reasoning. MLA-CI combines multi-layer feature extraction, gradient-reversal adversarial training, structure-preserving template augmentation, implausible-class oversampling, and focal loss. Our principal contribution is a systematic ablation study, confirmed across three random seeds, showing that adversarial training at standard strength is counterproductive: removing gradient reversal improves the mean validation score from 26.41 ± 0.99 to 38.15 ± 5.32. Per-condition analysis reveals that gradient reversal over-suppresses plausibility-correlated features, creating an inverted content effect that disproportionately harms plausible-valid accuracy. A sweep over seven adversarial pressure values reveal that only very light adversarial pressure value (≤ 0.1) preserves accuracy, while the submitted adversarial pressure value (1.0 or above) cause severe degradation. We conclude that data-level debiasing through structure-preserving augmentation is more effective and robust than model-level adversarial debiasing for this task.

Anthology ID:: 2026.semeval-1.317
Volume:: Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2512–2519
Language:
URL:: https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.317/
DOI:
Bibkey:
Cite (ACL):: Sai Aravind C, Sunil Saumya, and C Pothan Reddy. 2026. SpyComet at SemEval-2026 Task 11: When Adversarial Debiasing Backfires - A Comparison of Data-Level and Model-Level Debiasing. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 2512–2519, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: SpyComet at SemEval-2026 Task 11: When Adversarial Debiasing Backfires - A Comparison of Data-Level and Model-Level Debiasing (C et al., SemEval 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.317.pdf

PDF Cite Search Fix data