Nidhi Arora


2026

Word sense disambiguation in narrative contexts requires systems to reason about subtle semantic relationships between candidate senses and discourse context. This paper addresses SemEval 2026 Task 5, which reformulates WSD as a graded plausibility prediction problem on a 1–5 Likert scale using the AmbiStory dataset. We present two complementary approaches: (1) a DeBERTa-v3-Large encoder with attention-weighted pooling and ordinal regression, achieving a Spearman correlation of 0.718, and (2) a rank-based ensemble combining FLAN-T5 and RoBERTa, achieving 0.692. Through ablation studies, we show that explicitly modeling ordinal structure improves performance over standard regression by 17.3%. We further analyze the strengths of each approach, showing that fine-tuned encoders capture fine-grained semantic relationships, while ensemble methods provide robustness through complementary modeling biases. Our results provide a detailed empirical analysis of design choices for graded plausibility prediction in narrative understanding.

2021

Designing robust conversation systems with great customer experience requires a team of design experts to think of all probable ways a customer can interact with the system and then author responses for each use case individually. The responses are authored from scratch for each new client and application even though similar responses have been created in the past. This happens largely because the responses are encoded using domain specific set of intents and entities. In this paper, we present preliminary work to define a dialog act schema to merge and map responses from different domains and applications using a consistent domain-independent representation. These representations are stored and maintained using an Elasticsearch system to facilitate generation of responses through a search and retrieval process. We experimented generating different surface realizations for a response given a desired information state of the dialog.