Abstract
Learning from sparse and delayed reward is a central issue in reinforcement learning. In this paper, to tackle reward sparseness problem of task oriented dialogue management, we propose a curriculum based approach on the number of slots of user goals. This curriculum makes it possible to learn dialogue management for sets of user goals with large number of slots. We also propose a dialogue policy based on progressive neural networks whose modules with parameters are appended with previous parameters fixed as the curriculum proceeds, and this policy improves performances over the one with single set of parameters.- Anthology ID:
- W18-5707
- Volume:
- Proceedings of the 2018 EMNLP Workshop SCAI: The 2nd International Workshop on Search-Oriented Conversational AI
- Month:
- October
- Year:
- 2018
- Address:
- Brussels, Belgium
- Editors:
- Aleksandr Chuklin, Jeff Dalton, Julia Kiseleva, Alexey Borisov, Mikhail Burtsev
- Venue:
- EMNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 46–51
- Language:
- URL:
- https://aclanthology.org/W18-5707
- DOI:
- 10.18653/v1/W18-5707
- Cite (ACL):
- Atsushi Saito. 2018. Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management. In Proceedings of the 2018 EMNLP Workshop SCAI: The 2nd International Workshop on Search-Oriented Conversational AI, pages 46–51, Brussels, Belgium. Association for Computational Linguistics.
- Cite (Informal):
- Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management (Saito, EMNLP 2018)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-3/W18-5707.pdf