Yatin Katyal
2026
Uncovering Currency Bias and Syntax Gap in Text Embedding Models
Saurav Sudevan | Harsh Pal | Yatin Katyal | Sahil Manchanda
Findings of the Association for Computational Linguistics: ACL 2026
Saurav Sudevan | Harsh Pal | Yatin Katyal | Sahil Manchanda
Findings of the Association for Computational Linguistics: ACL 2026
Text-embedding models frequently inherit societal biases, yet the influence of socio-economic markers remains largely unexplored. This paper identifies Currency Bias as a systemic representational limitation in financial AI, where models exhibit associative sensitivity to economic hierarchies. We analyze this through three dimensions: (1) the Syntax Gap, where models fail to align currency names, symbols, and acronyms; (2) Associative Sensitivity, where embeddings disproportionately link specific currency identifiers to narratives of risk or poverty; and (3) Downstream volatility, where currency substitutions induce predictive entropy, sentence misunderstanding, sentiment shifts, and credit default prediction flips in downstream tasks. Benchmarking 14 state-of-the-art architectures reveals a pervasive phenomenon of representational disparity, affecting several currencies. These findings suggest that current embedding practices inadvertently encode inequalities, posing significant risks for the fairness and reliability of global financial NLP applications.
2020
Information Retrieval and Extraction on COVID-19 Clinical Articles Using Graph Community Detection and Bio-BERT Embeddings
Debasmita Das | Yatin Katyal | Janu Verma | Shashank Dubey | AakashDeep Singh | Kushagra Agarwal | Sourojit Bhaduri | RajeshKumar Ranjan
Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020
Debasmita Das | Yatin Katyal | Janu Verma | Shashank Dubey | AakashDeep Singh | Kushagra Agarwal | Sourojit Bhaduri | RajeshKumar Ranjan
Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020
In this paper, we present an information retrieval system on a corpus of scientific articles related to COVID-19. We build a similarity network on the articles where similarity is determined via shared citations and biological domain-specific sentence embeddings. Ego-splitting community detection on the article network is employed to cluster the articles and then the queries are matched with the clusters. Extractive summarization using BERT and PageRank methods is used to provide responses to the query. We also provide a Question-Answer bot on a small set of intents to demonstrate the efficacy of our model for an information extraction module.