Multi-Task Learning for Ambiguous Candidate Identification with Pre-trained Model

Daesik Jang; Hyewon Choi

Multi-Task Learning for Ambiguous Candidate Identification with Pre-trained Model

Abstract

Recently, research using multimodal datasets containing image and text information has been conducted actively. One of them is the SIMMC2.1 dataset. It is a more complicated dataset than answering a conversation using only text because it should predict an answer after understanding the relationship between images and text. Therefore, there are limitations to answering a conversation only using text-based models such as BERT or GPT-2, so models with both image and language understanding abilities should be considered. We propose a new model that is effective for the ambiguous candidate identification task in DSTC11 SIMMC2.1 Tark. It consists of a simple pipeline model structure, which has two steps. The first step is to check whether there is ambiguity in the current user utterance, and the second step is to extract objects mentioned in the ambiguous utterance of the user. We suggest a new learning framework with a pre-trained image model and text model that is effective for the ambiguous candidate identification task. Experiments show that the proposed method can improve the model performance, and our model achieved 3rd place in sub-task 1 of the SIMMC2.1 track.

Anthology ID:: 2023.dstc-1.2
Volume:: Proceedings of The Eleventh Dialog System Technology Challenge
Month:: September
Year:: 2023
Address:: Prague, Czech Republic
Editors:: Yun-Nung Chen, Paul Crook, Michel Galley, Sarik Ghazarian, Chulaka Gunasekara, Raghav Gupta, Behnam Hedayatnia, Satwik Kottur, Seungwhan Moon, Chen Zhang
Venues:: DSTC | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 9–14
Language:
URL:: https://aclanthology.org/2023.dstc-1.2
DOI:
Bibkey:
Cite (ACL):: Daesik Jang and Hyewon Choi. 2023. Multi-Task Learning for Ambiguous Candidate Identification with Pre-trained Model. In Proceedings of The Eleventh Dialog System Technology Challenge, pages 9–14, Prague, Czech Republic. Association for Computational Linguistics.
Cite (Informal):: Multi-Task Learning for Ambiguous Candidate Identification with Pre-trained Model (Jang & Choi, DSTC-WS 2023)
Copy Citation:
PDF:: https://preview.aclanthology.org/nschneid-patch-2/2023.dstc-1.2.pdf

PDF Cite Search