Manoj Balaji Jagadeeshan


2025

pdf bib
Mahānāma: A Unique Testbed for Literary Entity Discovery and Linking
Sujoy Sarkar | Gourav Sarkar | Manoj Balaji Jagadeeshan | Jivnesh Sandhan | Amrith Krishna | Pawan Goyal
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

High lexical variation, ambiguous references, and long-range dependencies make entity resolution in literary texts particularly challenging. We present Mahānāma, the first large-scale dataset for end-to-end Entity Discovery and Linking (EDL) in Sanskrit, a morphologically rich and under-resourced language. Derived from the Mahābhārata , the world’s longest epic, the dataset comprises over 109K named entity mentions mapped to 5.5K unique entities, and is aligned with an English knowledge base to support cross-lingual linking. The complex narrative structure of Mahānāma, coupled with extensive name variation and ambiguity, poses significant challenges to resolution systems. Our evaluation reveals that current coreference and entity linking models struggle when evaluated on the global context of the test set. These results highlight the limitations of current approaches in resolving entities within such complex discourse. Mahānāma thus provides a unique benchmark for advancing entity resolution, especially in literary domains.

pdf bib
Agentic LLMs for Analyst-Style Financial Insights: An LLM Pipeline for Persuasive Financial Analysis
Gaurangi Sinha | Rajarajeswari Palacharla | Manoj Balaji Jagadeeshan
Proceedings of The 10th Workshop on Financial Technology and Natural Language Processing

pdf bib
Vedavani: A Benchmark Corpus for ASR on Vedic Sanskrit Poetry
Sujeet Kumar | Pretam Ray | Abhinay Beerukuri | Shrey Kamoji | Manoj Balaji Jagadeeshan | Pawan Goyal
Computational Sanskrit and Digital Humanities - World Sanskrit Conference 2025

pdf bib
Anveshana: A New Benchmark Dataset for Cross-Lingual Information Retrieval on English Queries and Sanskrit Documents
Manoj Balaji Jagadeeshan | Prince Raj | Pawan Goyal
Computational Sanskrit and Digital Humanities - World Sanskrit Conference 2025