Multi-Turn Target-Guided Topic Prediction with Monte Carlo Tree Search

Jingxuan Yang, Si Li, Jun Guo


Abstract
This paper concerns the problem of topic prediction in target-guided conversation, which requires the system to proactively and naturally guide the topic thread of the conversation, ending up with achieving a designated target subject. Existing studies usually resolve the task with a sequence of single-turn topic prediction. Greedy decision is made at each turn since it is impossible to explore the topics in future turns under the single-turn topic prediction mechanism. As a result, these methods often suffer from generating sub-optimal topic threads. In this paper, we formulate the target-guided conversation as a problem of multi-turn topic prediction and model it under the framework of Markov decision process (MDP). To alleviate the problem of generating sub-optimal topic thread, Monte Carlo tree search (MCTS) is employed to improve the topic prediction by conducting long-term planning. At online topic prediction, given a target and a start utterance, our proposed MM-TP (MCTS-enhanced MDP for Topic Prediction) firstly performs MCTS to enhance the policy for predicting the topic for each turn. Then, two retrieval models are respectively used to generate the responses of the agent and the user. Quantitative evaluation and qualitative study showed that MM-TP significantly improved the state-of-the-art baselines.
Anthology ID:
2021.icon-main.39
Volume:
Proceedings of the 18th International Conference on Natural Language Processing (ICON)
Month:
December
Year:
2021
Address:
National Institute of Technology Silchar, Silchar, India
Editors:
Sivaji Bandyopadhyay, Sobha Lalitha Devi, Pushpak Bhattacharyya
Venue:
ICON
SIG:
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
324–334
Language:
URL:
https://aclanthology.org/2021.icon-main.39
DOI:
Bibkey:
Cite (ACL):
Jingxuan Yang, Si Li, and Jun Guo. 2021. Multi-Turn Target-Guided Topic Prediction with Monte Carlo Tree Search. In Proceedings of the 18th International Conference on Natural Language Processing (ICON), pages 324–334, National Institute of Technology Silchar, Silchar, India. NLP Association of India (NLPAI).
Cite (Informal):
Multi-Turn Target-Guided Topic Prediction with Monte Carlo Tree Search (Yang et al., ICON 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/improve-issue-templates/2021.icon-main.39.pdf