Poojitha Nandigam

2025

pdf bib abs
How Inclusively do LMs Perceive Social and Moral Norms?
Michael Galarnyk | Agam Shah | Dipanwita Guhathakurta | Poojitha Nandigam | Sudheer Chava
Findings of the Association for Computational Linguistics: NAACL 2025

**This paper discusses and contains offensive content.** Language models (LMs) are used in decision-making systems and as interactive assistants. However, how well do these models making judgements align with the diversity of human values, particularly regarding social and moral norms? In this work, we investigate how inclusively LMs perceive norms across demographic groups (e.g., gender, age, and income). We prompt 11 LMs on rules-of-thumb (RoTs) and compare their outputs with the existing responses of 100 human annotators. We introduce the Absolute Distance Alignment Metric (ADA-Met) to quantify alignment on ordinal questions. We find notable disparities in LM responses, with younger, higher-income groups showing closer alignment, raising concerns about the representation of marginalized perspectives. Our findings highlight the importance of further efforts to make LMs more inclusive of diverse human values. The code and prompts are available on GitHub under the CC BY-NC 4.0 license.

2022

pdf bib abs
Diverse Multi-Answer Retrieval with Determinantal Point Processes
Poojitha Nandigam | Nikhil Rayaprolu | Manish Shrivastava
Proceedings of the 29th International Conference on Computational Linguistics

Often questions provided to open-domain question answering systems are ambiguous. Traditional QA systems that provide a single answer are incapable of answering ambiguous questions since the question may be interpreted in several ways and may have multiple distinct answers. In this paper, we address multi-answer retrieval which entails retrieving passages that can capture majority of the diverse answers to the question. We propose a re-ranking based approach using Determinantal point processes utilizing BERT as kernels. Our method jointly considers query-passage relevance and passage-passage correlation to retrieve passages that are both query-relevant and diverse. Results demonstrate that our re-ranking technique outperforms state-of-the-art method on the AmbigQA dataset.

pdf bib abs
Named Entity Recognition for Code-Mixed Kannada-English Social Media Data
Poojitha Nandigam | Abhinav Appidi | Manish Shrivastava
Proceedings of the 19th International Conference on Natural Language Processing (ICON)

Named Entity Recognition (NER) is a critical task in the field of Natural Language Processing (NLP) and is also a sub-task of Information Extraction. There has been a significant amount of work done in entity extraction and Named Entity Recognition for resource-rich languages. Entity extraction from code-mixed social media data like tweets from twitter complicates the problem due to its unstructured, informal, and incomplete information available in tweets. Here, we present work on NER in Kannada-English code-mixed social media corpus with corresponding named entity tags referring to Organisation (Org), Person (Pers), and Location (Loc). We experimented with machine learning classification models like Conditional Random Fields (CRF), Bi-LSTM, and Bi-LSTM-CRF models on our corpus.

Co-authors

Nikhil Rayaprolu 1

Agam Shah 1

Venues

Fix data