Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems
Pei-Hao Su, David Vandyke, Milica Gašić, Nikola Mrkšić, Tsung-Hsien Wen, Steve Young
- Anthology ID:
- W15-4655
- Volume:
- Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue
- Month:
- September
- Year:
- 2015
- Address:
- Prague, Czech Republic
- Editors:
- Alexander Koller, Gabriel Skantze, Filip Jurcicek, Masahiro Araki, Carolyn Penstein Rose
- Venue:
- SIGDIAL
- SIG:
- SIGDIAL
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 417–421
- Language:
- URL:
- https://aclanthology.org/W15-4655
- DOI:
- 10.18653/v1/W15-4655
- Cite (ACL):
- Pei-Hao Su, David Vandyke, Milica Gašić, Nikola Mrkšić, Tsung-Hsien Wen, and Steve Young. 2015. Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems. In Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 417–421, Prague, Czech Republic. Association for Computational Linguistics.
- Cite (Informal):
- Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems (Su et al., SIGDIAL 2015)
- PDF:
- https://preview.aclanthology.org/naacl-24-ws-corrections/W15-4655.pdf