Pritam Kadasi
2026
LingoResearchGroup at SemEval-2026 Task 9: Evaluating Prompt Variants for Polarization Detection
Pritam Kadasi | Anuj Tiwari | Mayank Singh
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Pritam Kadasi | Anuj Tiwari | Mayank Singh
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Our submission presented in this paper is for SemEval-2026 Task 9: Multilingual Text Classification Challenge - Polarization Detection and it covers all three subtasks: (1) binary polarization detection, (2) polarization type classification and (3) polarization manifestation identification. We adopt a systematic approach of research on short designed prompts by considering twelve designed prompts that are different in terminology clarity, detail of the definition, guidance of reasoning and in-context examples use. The experiments are conducted using aya-101 and Gemma3-27B, with the latter chosen for the submission at the end of the development through performance considerations. Our system has an average macro level \textbf{F1-score of 0.762 on Subtask 1, 0.587 on Subtask 2 and 0.444 on Subtask 3} with the average accuracy of 0.819, 0.678 and 0.498, respectively, on the official test set averaged among 22 languages, respectively. With cross-task and cross-lingual analysis, we demonstrate that prompt-based approaches can be used effectively to detect coarse-grained polarization but encounter more and more difficulties as far as fine-grained and multi-label sociolinguistic classification is concerned.
2023
Unveiling the Multi-Annotation Process: Examining the Influence of Annotation Quantity and Instance Difficulty on Model Performance
Pritam Kadasi | Mayank Singh
Findings of the Association for Computational Linguistics: EMNLP 2023
Pritam Kadasi | Mayank Singh
Findings of the Association for Computational Linguistics: EMNLP 2023
The NLP community has long advocated for the construction of multi-annotator datasets to better capture the nuances of language interpretation, subjectivity, and ambiguity. This paper conducts a retrospective study to show how performance scores can vary when a dataset expands from a single annotation per instance to multiple annotations. We propose a novel multi-annotator simulation process to generate datasets with varying annotation budgets. We show that similar datasets with the same annotation budget can lead to varying performance gains. Our findings challenge the popular belief that models trained on multi-annotation examples always lead to better performance than models trained on single or few-annotation examples.