Multi-party Goal Tracking with LLMs: Comparing Pre-training, Fine-tuning, and Prompt Engineering

Angus Addlesee; Weronika Sieińska; Nancie Gunson; Daniel Hernández Garcia; Christian Dondrup; Oliver Lemon

doi:10.18653/v1/2023.sigdial-1.22

Multi-party Goal Tracking with LLMs: Comparing Pre-training, Fine-tuning, and Prompt Engineering

Angus Addlesee, Weronika Sieińska, Nancie Gunson, Daniel Hernandez Garcia, Christian Dondrup, Oliver Lemon

Abstract

This paper evaluates the extent to which current LLMs can capture task-oriented multi-party conversations (MPCs). We have recorded and transcribed 29 MPCs between patients, their companions, and a social robot in a hospital. We then annotated this corpus for multi-party goal-tracking and intent-slot recognition. People share goals, answer each other’s goals, and provide other people’s goals in MPCs - none of which occur in dyadic interactions. To understand user goals in MPCs, we compared three methods in zero-shot and few-shot settings: we fine-tuned T5, created pre-training tasks to train DialogLM using LED, and employed prompt engineering techniques with GPT-3.5-turbo, to determine which approach can complete this novel task with limited data. GPT-3.5-turbo significantly outperformed the others in a few-shot setting. The ‘reasoning’ style prompt, when given 7% of the corpus as example annotated conversations, was the best performing method. It correctly annotated 62.32% of the goal tracking MPCs, and 69.57% of the intent-slot recognition MPCs. A ‘story’ style prompt increased model hallucination, which could be detrimental if deployed in safety-critical settings. We conclude that multi-party conversations still challenge state-of-the-art LLMs.

Anthology ID:: 2023.sigdial-1.22
Volume:: Proceedings of the 24th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Month:: September
Year:: 2023
Address:: Prague, Czechia
Editors:: Svetlana Stoyanchev, Shafiq Joty, David Schlangen, Ondrej Dusek, Casey Kennington, Malihe Alikhani
Venue:: SIGDIAL
SIG:: SIGDIAL
Publisher:: Association for Computational Linguistics
Note:
Pages:: 229–241
Language:
URL:: https://aclanthology.org/2023.sigdial-1.22
DOI:: 10.18653/v1/2023.sigdial-1.22
Bibkey:
Cite (ACL):: Angus Addlesee, Weronika Sieińska, Nancie Gunson, Daniel Hernandez Garcia, Christian Dondrup, and Oliver Lemon. 2023. Multi-party Goal Tracking with LLMs: Comparing Pre-training, Fine-tuning, and Prompt Engineering. In Proceedings of the 24th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 229–241, Prague, Czechia. Association for Computational Linguistics.
Cite (Informal):: Multi-party Goal Tracking with LLMs: Comparing Pre-training, Fine-tuning, and Prompt Engineering (Addlesee et al., SIGDIAL 2023)
Copy Citation:
PDF:: https://preview.aclanthology.org/ml4al-ingestion/2023.sigdial-1.22.pdf

PDF Search