Aditya Vempaty
2025
Introducing Spotlight: A Novel Approach for Generating Captivating Key Information from Documents
Ankan Mullick
|
Sombit Bose
|
Rounak Saha
|
Ayan Kumar Bhowmick
|
Aditya Vempaty
|
Prasenjit Dey
|
Ravi Kokku
|
Pawan Goyal
|
Niloy Ganguly
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Analyzing and processing vast amounts of textual data presents significant challenges in efficiently extracting key information.In this paper, we introduce '***Spotlight***’, a novel paradigm for information extraction that produces concise, engaging narratives by highlighting the most compelling aspects of a document. Unlike highlights (fragmented key points) and traditional summaries, which prioritize comprehensive coverage, spotlights selectively emphasize intriguing content to foster deeper reader engagement with the source material. We formally differentiate spotlights from related constructs and support our analysis with a detailed benchmarking study using new datasets curated for this work. To generate high-quality spotlights, we propose a two-stage approach: fine-tuning a large language model on our benchmark data, followed by alignment via Direct Preference Optimization (DPO). Our comprehensive evaluation demonstrates that the resulting model not only identifies key elements with precision but also enhances readability and boosts the engagement value of the original document. Datasets and code are available at https://github.com/ankan2/Spotlight-EMNLP2025.
2019
Content Customization for Micro Learning using Human Augmented AI Techniques
Ayush Shah
|
Tamer Abuelsaad
|
Jae-Wook Ahn
|
Prasenjit Dey
|
Ravi Kokku
|
Ruhi Sharma Mittal
|
Aditya Vempaty
|
Mourvi Sharma
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications
Visual content has been proven to be effective for micro-learning compared to other media. In this paper, we discuss leveraging this observation in our efforts to build audio-visual content for young learners’ vocabulary learning. We attempt to tackle two major issues in the process of traditional visual curation tasks. Generic learning videos do not necessarily satisfy the unique context of a learner and/or an educator, and hence may not result in maximal learning outcomes. Also, manual video curation by educators is a highly labor-intensive process. To this end, we present a customizable micro-learning audio-visual content curation tool that is designed to reduce the human (educator) effort in creating just-in-time learning videos from a textual description (learning script). This provides educators with control of the content while preparing the learning scripts, and in turn can also be customized to capture the desired learning objectives and outcomes. As a use case, we automatically generate learning videos with British National Corpus’ (BNC) frequently spoken vocabulary words and evaluate them with experts. They positively recommended the generated learning videos with an average rating of 4.25 on a Likert scale of 5 points. The inter-annotator agreement between the experts for the video quality was substantial (Fleiss Kappa=0.62) with an overall agreement of 81%.
Search
Fix author
Co-authors
- Prasenjit Dey 2
- Ravi Kokku 2
- Tamer Abuelsaad 1
- Jae-Wook Ahn 1
- Ayan Kumar Bhowmick 1
- show all...