2021
pdf
abs
Generating An Optimal Interview Question Plan Using A Knowledge Graph And Integer Linear Programming
Soham Datta
|
Prabir Mallick
|
Sangameshwar Patil
|
Indrajit Bhattacharya
|
Girish Palshikar
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Given the diversity of the candidates and complexity of job requirements, and since interviewing is an inherently subjective process, it is an important task to ensure consistent, uniform, efficient and objective interviews that result in high quality recruitment. We propose an interview assistant system to automatically, and in an objective manner, select an optimal set of technical questions (from question banks) personalized for a candidate. This set can help a human interviewer to plan for an upcoming interview of that candidate. We formalize the problem of selecting a set of questions as an integer linear programming problem and use standard solvers to get a solution. We use knowledge graph as background knowledge in this formulation, and derive our objective functions and constraints from it. We use candidate’s resume to personalize the selection of questions. We propose an intrinsic evaluation to compare a set of suggested questions with actually asked questions. We also use expert interviewers to comparatively evaluate our approach with a set of reasonable baselines.
pdf
abs
Temporal Question Generation from History Text
Harsimran Bedi
|
Sangameshwar Patil
|
Girish Palshikar
Proceedings of the 18th International Conference on Natural Language Processing (ICON)
Temporal analysis of history text has always held special significance to students, historians and the Social Sciences community in general. We observe from experimental data that existing deep learning (DL) models of ProphetNet and UniLM for question generation (QG) task do not perform satisfactorily when used directly for temporal QG from history text. We propose linguistically motivated templates for generating temporal questions that probe different aspects of history text and show that finetuning the DL models using the temporal questions significantly improves their performance on temporal QG task. Using automated metrics as well as human expert evaluation, we show that performance of the DL models finetuned with the template-based questions is better than finetuning done with temporal questions from SQuAD.
pdf
abs
Extracting Events from Industrial Incident Reports
Nitin Ramrakhiyani
|
Swapnil Hingmire
|
Sangameshwar Patil
|
Alok Kumar
|
Girish Palshikar
Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021)
Incidents in industries have huge social and political impact and minimizing the consequent damage has been a high priority. However, automated analysis of repositories of incident reports has remained a challenge. In this paper, we focus on automatically extracting events from incident reports. Due to absence of event annotated datasets for industrial incidents we employ a transfer learning based approach which is shown to outperform several baselines. We further provide detailed analysis regarding effect of increase in pre-training data and provide explainability of why pre-training improves the performance.
2020
pdf
abs
Extracting Message Sequence Charts from Hindi Narrative Text
Swapnil Hingmire
|
Nitin Ramrakhiyani
|
Avinash Kumar Singh
|
Sangameshwar Patil
|
Girish Palshikar
|
Pushpak Bhattacharyya
|
Vasudeva Varma
Proceedings of the First Joint Workshop on Narrative Understanding, Storylines, and Events
In this paper, we propose the use of Message Sequence Charts (MSC) as a representation for visualizing narrative text in Hindi. An MSC is a formal representation allowing the depiction of actors and interactions among these actors in a scenario, apart from supporting a rich framework for formal inference. We propose an approach to extract MSC actors and interactions from a Hindi narrative. As a part of the approach, we enrich an existing event annotation scheme where we provide guidelines for annotation of the mood of events (realis vs irrealis) and guidelines for annotation of event arguments. We report performance on multiple evaluation criteria by experimenting with Hindi narratives from Indian History. Though Hindi is the fourth most-spoken first language in the world, from the NLP perspective it has comparatively lesser resources than English. Moreover, there is relatively less work in the context of event processing in Hindi. Hence, we believe that this work is among the initial works for Hindi event processing.
2019
pdf
abs
Extraction of Message Sequence Charts from Narrative History Text
Girish Palshikar
|
Sachin Pawar
|
Sangameshwar Patil
|
Swapnil Hingmire
|
Nitin Ramrakhiyani
|
Harsimran Bedi
|
Pushpak Bhattacharyya
|
Vasudeva Varma
Proceedings of the First Workshop on Narrative Understanding
In this paper, we advocate the use of Message Sequence Chart (MSC) as a knowledge representation to capture and visualize multi-actor interactions and their temporal ordering. We propose algorithms to automatically extract an MSC from a history narrative. For a given narrative, we first identify verbs which indicate interactions and then use dependency parsing and Semantic Role Labelling based approaches to identify senders (initiating actors) and receivers (other actors involved) for these interaction verbs. As a final step in MSC extraction, we employ a state-of-the art algorithm to temporally re-order these interactions. Our evaluation on multiple publicly available narratives shows improvements over four baselines.
pdf
abs
Extraction of Message Sequence Charts from Software Use-Case Descriptions
Girish Palshikar
|
Nitin Ramrakhiyani
|
Sangameshwar Patil
|
Sachin Pawar
|
Swapnil Hingmire
|
Vasudeva Varma
|
Pushpak Bhattacharyya
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Industry Papers)
Software Requirement Specification documents provide natural language descriptions of the core functional requirements as a set of use-cases. Essentially, each use-case contains a set of actors and sequences of steps describing the interactions among them. Goals of use-case reviews and analyses include their correctness, completeness, detection of ambiguities, prototyping, verification, test case generation and traceability. Message Sequence Chart (MSC) have been proposed as a expressive, rigorous yet intuitive visual representation of use-cases. In this paper, we describe a linguistic knowledge-based approach to extract MSCs from use-cases. Compared to existing techniques, we extract richer constructs of the MSC notation such as timers, conditions and alt-boxes. We apply this tool to extract MSCs from several real-life software use-case descriptions and show that it performs better than the existing techniques. We also discuss the benefits and limitations of the extracted MSCs to meet the above goals.
2018
pdf
Resolving Actor Coreferences in Hindi Narrative Text
Nitin Ramrakhiyani
|
Swapnil Hingmire
|
Sachin Pawar
|
Sangameshwar Patil
|
Girish K. Palshikar
|
Pushpak Bhattacharyya
|
Vasudeva Verma
Proceedings of the 15th International Conference on Natural Language Processing
pdf
abs
Identification of Alias Links among Participants in Narratives
Sangameshwar Patil
|
Sachin Pawar
|
Swapnil Hingmire
|
Girish Palshikar
|
Vasudeva Varma
|
Pushpak Bhattacharyya
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Identification of distinct and independent participants (entities of interest) in a narrative is an important task for many NLP applications. This task becomes challenging because these participants are often referred to using multiple aliases. In this paper, we propose an approach based on linguistic knowledge for identification of aliases mentioned using proper nouns, pronouns or noun phrases with common noun headword. We use Markov Logic Network (MLN) to encode the linguistic knowledge for identification of aliases. We evaluate on four diverse history narratives of varying complexity. Our approach performs better than the state-of-the-art approach as well as a combination of standard named entity recognition and coreference resolution techniques.
2017
pdf
abs
Event Timeline Generation from History Textbooks
Harsimran Bedi
|
Sangameshwar Patil
|
Swapnil Hingmire
|
Girish Palshikar
Proceedings of the 4th Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA 2017)
Event timeline serves as the basic structure of history, and it is used as a disposition of key phenomena in studying history as a subject in secondary school. In order to enable a student to understand a historical phenomenon as a series of connected events, we present a system for automatic event timeline generation from history textbooks. Additionally, we propose Message Sequence Chart (MSC) and time-map based visualization techniques to visualize an event timeline. We also identify key computational challenges in developing natural language processing based applications for history textbooks.
2013
pdf
Named Entity Extraction using Information Distance
Sangameshwar Patil
|
Sachin Pawar
|
Girish Palshikar
Proceedings of the Sixth International Joint Conference on Natural Language Processing