Rishikant Chigrupaatii
2026
Systematic Performance Degradation in Indic Vision-Language Models: Evidence from Hindi and Telugu
Rishikant Chigrupaatii | Ponnada Sai Tulasi Kanishka | Lalit Chandra Routhu | Martin Patel | Sama Supratheek Reddy | Divyam Gupta | Rajiv Misra | Rohun Tripathi
Proceedings of the 4th Workshop on Advances in Language and Vision Research (ALVR)
Rishikant Chigrupaatii | Ponnada Sai Tulasi Kanishka | Lalit Chandra Routhu | Martin Patel | Sama Supratheek Reddy | Divyam Gupta | Rajiv Misra | Rohun Tripathi
Proceedings of the 4th Workshop on Advances in Language and Vision Research (ALVR)
With 1.5 billion people speaking over 120 major languages, India exemplifies the challenges of multilingual AI evaluation. Current multilingual VLM benchmarks suffer from unverified auto-translations, narrow task coverage, small sample sizes, and lack of culturally grounded content. We present HinTel-AlignBench, a comprehensive evaluation framework and benchmark for Hindi and Telugu vision-language models with English-aligned samples. Our framework combines semi-automated translation with human verification to generate 4k QA pairs per language across five domains: adapted English datasets (VQAv2, RealWorldQA, CLEVR-Math) and native Indic sets (JEE for STEM, VAANI for cultural grounding). Evaluation of state-of-the-art open and closed-source VLMs reveals consistent performance regression from English to Indic languages, with average drops of 8.3 points for Hindi and 5.5 points for Telugu across four of five tasks. We identify key failure modes and establish reproducible baselines for multilingual multimodal evaluation.
2024
BiasWipe: Mitigating Unintended Bias in Text Classifiers through Model Interpretability
Mamta | Rishikant Chigrupaatii | Asif Ekbal
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Mamta | Rishikant Chigrupaatii | Asif Ekbal
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Toxic content detection plays a vital role in addressing the misuse of social media platforms to harm people or groups due to their race, gender or ethnicity. However, due to the nature of the datasets, systems develop an unintended bias due to the over-generalization of the model to the training data. This compromises the fairness of the systems, which can impact certain groups due to their race, gender, etc.Existing methods mitigate bias using data augmentation, adversarial learning, etc., which require re-training and adding extra parameters to the model.In this work, we present a robust and generalizable technique BiasWipe to mitigate unintended bias in language models. BiasWipe utilizes model interpretability using Shapley values, which achieve fairness by pruning the neuron weights responsible for unintended bias. It first identifies the neuron weights responsible for unintended bias and then achieves fairness by pruning them without loss of original performance. It does not require re-training or adding extra parameters to the model. To show the effectiveness of our proposed technique for bias unlearning, we perform extensive experiments for Toxic content detection for BERT, RoBERTa, and GPT models. .