SpyComet at SemEval-2026 Task 11: When Adversarial Debiasing Backfires - A Comparison of Data-Level and Model-Level Debiasing

Sai Aravind C, Sunil Saumya, C Pothan Reddy


Abstract
We describe MLA-CI (Multi-Layer Adversarial for Content Invariance), a DeBERTa-v3-base system for SemEval-2026 Task 11 Subtask 1 on content-invariant syllogistic reasoning. MLA-CI combines multi-layer feature extraction, gradient-reversal adversarial training, structure-preserving template augmentation, implausible-class oversampling, and focal loss. Our principal contribution is a systematic ablation study, confirmed across three random seeds, showing that adversarial training at standard strength is counterproductive: removing gradient reversal improves the mean validation score from 26.41 ± 0.99 to 38.15 ± 5.32. Per-condition analysis reveals that gradient reversal over-suppresses plausibility-correlated features, creating an inverted content effect that disproportionately harms plausible-valid accuracy. A sweep over seven adversarial pressure values reveal that only very light adversarial pressure value (≤ 0.1) preserves accuracy, while the submitted adversarial pressure value (1.0 or above) cause severe degradation. We conclude that data-level debiasing through structure-preserving augmentation is more effective and robust than model-level adversarial debiasing for this task.
Anthology ID:
2026.semeval-1.317
Volume:
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2512–2519
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.317/
DOI:
Bibkey:
Cite (ACL):
Sai Aravind C, Sunil Saumya, and C Pothan Reddy. 2026. SpyComet at SemEval-2026 Task 11: When Adversarial Debiasing Backfires - A Comparison of Data-Level and Model-Level Debiasing. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 2512–2519, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
SpyComet at SemEval-2026 Task 11: When Adversarial Debiasing Backfires - A Comparison of Data-Level and Model-Level Debiasing (C et al., SemEval 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.317.pdf