2023
pdf
abs
Speaker Role Identification in Call Centre Dialogues: Leveraging Opening Sentences and Large Language Models
Minh-Quoc Nghiem
|
Nichola Roberts
|
Dmitry Sityaev
Proceedings of the 24th Annual Meeting of the Special Interest Group on Discourse and Dialogue
This paper addresses the task of speaker role identification in call centre dialogues, focusing on distinguishing between the customer and the agent. We propose a text-based approach that utilises the identification of the agent’s opening sentence as a key feature for role classification. The opening sentence is identified using a model trained through active learning. By combining this information with a large language model, we accurately classify the speaker roles. The proposed approach is evaluated on a dataset of call centre dialogues and achieves 93.61% accuracy. This work contributes to the field by providing an effective solution for speaker role identification in call centre settings, with potential applications in interaction analysis and information retrieval.
2022
pdf
abs
Text Classification and Prediction in the Legal Domain
Minh-Quoc Nghiem
|
Paul Baylis
|
André Freitas
|
Sophia Ananiadou
Proceedings of the Thirteenth Language Resources and Evaluation Conference
We present a case study on the application of text classification and legal judgment prediction for flight compensation. We combine transformer-based classification models to classify responses from airlines and incorporate text data with other data types to predict a legal claim being successful. Our experimental evaluations show that our models achieve consistent and significant improvements over baselines and even outperformed human prediction when predicting a claim being successful. These models were integrated into an existing claim management system, providing substantial productivity gains for handling the case lifecycle, currently supporting several thousands of monthly processes.
2021
pdf
abs
Paladin: an annotation tool based on active and proactive learning
Minh-Quoc Nghiem
|
Paul Baylis
|
Sophia Ananiadou
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations
In this paper, we present Paladin, an open-source web-based annotation tool for creating high-quality multi-label document-level datasets. By integrating active learning and proactive learning to the annotation task, Paladin makes the task less time-consuming and requiring less human effort. Although Paladin is designed for multi-label settings, the system is flexible and can be adapted to other tasks in single-label settings.
2018
pdf
abs
APLenty: annotation tool for creating high-quality datasets using active and proactive learning
Minh-Quoc Nghiem
|
Sophia Ananiadou
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
In this paper, we present APLenty, an annotation tool for creating high-quality sequence labeling datasets using active and proactive learning. A major innovation of our tool is the integration of automatic annotation with active learning and proactive learning. This makes the task of creating labeled datasets easier, less time-consuming and requiring less human effort. APLenty is highly flexible and can be adapted to various other tasks.
2013
pdf
Sense Disambiguation: From Natural Language Words to Mathematical Terms
Minh-Quoc Nghiem
|
Giovanni Yoko Kristianto
|
Goran Topić
|
Akiko Aizawa
Proceedings of the Sixth International Joint Conference on Natural Language Processing