Krishna Tewari

2026

IReLIIT(BHU) at SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization
Soumadip Majumder | Arjun Mukherjee | Krishna Tewari | Sanjaya Lenka | Sukomal Pal
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)

This paper presents the IReLIIT(BHU) submission to SemEval-2026 Task 9 for the Chinese language track. We participated in all three subtasks: binary polarization detection,multi-label polarization type classification, and multi-label manifestation identification. Our approach is based on a unified transformer based framework with cross-validation, prediction aggregation, and threshold optimization to improve robustness across tasks. On the official evaluation, our systems achieved Macro-F1 scores of 0.9081, 0.7962, and 0.6484 for Subtasks 1, 2, and 3, respectively on test data.

pdf bib abs

IReL_IIT(BHU)@LTEDI 2026: Fine-Tuning Instruction-Tuned Transformers for Gender-Inclusive Rewriting and Counterfactual Bias Mitigation
Anurag Balaji | Arjun Mukherjee | Krishna Tewari | Sukomal Pal
Proceedings of the Sixth Workshop on Language Technology for Equality, Diversity, Inclusion

This paper presents our submissions to the LT-EDI@ACL 2026 Shared Task on Gender Inclusive Language Generation. The task focuses on controlled text rewriting that reduces gender bias while keeping the original meaning and fluency intact. We participated in boththe subtasks and treated them independently, training separate instances of the instruction-tuned encoder–decoder model on the respective training datasets. Scores are calculated based on averages across different rubrics, including Gender Assumption (GA), Gender Neutrality (GN), and Quality Relevance (QR) for Task A, and Politeness and Respectful (PR), Contextual Counter-Narrative Coherence (CCNC), and Quality and Relevance (QR) for Task B.For Subtask A (Gender-Inclusive Language Generation) in the English dataset, an average score of 43.7917 could be achieved. For Subtask B (Counterfactual Generation), we achieved an average score of 82.6241. Overall, the experiments indicate that full finetuning of instruction-tuned transformers provides an effective way to produce sentence in gender-neutral form and also producing counter-factual sentences for biased one, wheneach subtask is optimized on its own data.

Co-authors

Venues

Fix author