Karan Gupta
Papers on this page may belong to the following people: Karan Gupta
2025
Assess and Prompt: A Generative RL Framework for Improving Engagement in Online Mental Health Communities
Bhagesh Gaur | Karan Gupta | Aseem Srivastava | Manish Gupta | Md Shad Akhtar
Findings of the Association for Computational Linguistics: EMNLP 2025
Bhagesh Gaur | Karan Gupta | Aseem Srivastava | Manish Gupta | Md Shad Akhtar
Findings of the Association for Computational Linguistics: EMNLP 2025
Online Mental Health Communities (OMHCs) provide crucial peer and expert support, yet many posts remain unanswered due to missing support attributes that signal the need for help. We present a novel framework that identifies these gaps and prompts users to enrich their posts, thereby improving engagement. To support this, we introduce REDDME, a new dataset of 4,760 posts from mental health subreddits annotated for the span and intensity of three key support attributes: event what happened?, effect what did the user experience?, and requirement what support they need?. Next, we devise a hierarchical taxonomy, CueTaxo, of support attributes for controlled question generation. Further, we propose MH-COPILOT, a reinforcement learning-based system that integrates (a) contextual attribute-span identification, (b) support attribute intensity classification, (c) controlled question generation via a hierarchical taxonomy, and (d) a verifier for reward modeling. Our model dynamically assesses posts for the presence/absence of support attributes, and generates targeted prompts to elicit missing information. Empirical results across four notable language models demonstrate significant improvements in attribute elicitation and user engagement. A human evaluation further validates the model’s effectiveness in real-world OMHC settings.
Generative or Discriminative? Revisiting Text Classification in the Era of Transformers
Siva Rajesh Kasa | Karan Gupta | Sumegh Roychowdhury | Ashutosh Kumar | Yaswanth Biruduraju | Santhosh Kumar Kasa | Pattisapu Nikhil Priyatam | Arindam Bhattacharya | Shailendra Agarwal | Vijay Huddar
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Siva Rajesh Kasa | Karan Gupta | Sumegh Roychowdhury | Ashutosh Kumar | Yaswanth Biruduraju | Santhosh Kumar Kasa | Pattisapu Nikhil Priyatam | Arindam Bhattacharya | Shailendra Agarwal | Vijay Huddar
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
*The comparison between discriminative and generative classifiers has intrigued researchers since [Efron (1975)’s](https://www.jstor.org/stable/2285453) seminal analysis of logistic regression versus discriminant analysis. While early theoretical work established that generative classifiers exhibit lower sample complexity but higher asymptotic error in simple linear settings, these trade-offs remain unexplored in the transformer era. We present the first comprehensive evaluation of modern generative and discriminative architectures—Auto-regressive, Masked Language Modeling, Discrete Diffusion, and Encoders for text classification. Our study reveals that the classical “two regimes” phenomenon manifests distinctly across different architectures and training paradigms. Beyond accuracy, we analyze sample efficiency, calibration, noise robustness, and ordinality across diverse scenarios. Our findings offer practical guidance for selecting the most suitable modeling approach based on real-world constraints such as latency and data limitations.*
2024
Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques
Siva Rajesh Kasa | Aniket Goel | Karan Gupta | Sumegh Roychowdhury | Pattisapu Priyatam | Anish Bhanushali | Prasanna Srinivasa Murthy
Findings of the Association for Computational Linguistics: ACL 2024
Siva Rajesh Kasa | Aniket Goel | Karan Gupta | Sumegh Roychowdhury | Pattisapu Priyatam | Anish Bhanushali | Prasanna Srinivasa Murthy
Findings of the Association for Computational Linguistics: ACL 2024
Ordinal Classification (OC) is a widely encountered challenge in Natural Language Processing (NLP), with applications in various domains such as sentiment analysis, rating prediction, and more. Previous approaches to tackle OC have primarily focused on modifying existing or creating novel loss functions that explicitly account for the ordinal nature of labels. However, with the advent of Pre-trained Language Models (PLMs), it became possible to tackle ordinality through the implicit semantics of the labels as well. This paper provides a comprehensive theoretical and empirical examination of both these approaches. Furthermore, we also offer strategic recommendations regarding the most effective approach to adopt based on specific settings.