Xiaoping Chen
2020
Learning and Reasoning for Robot Dialog and Navigation Tasks
Keting Lu
|
Shiqi Zhang
|
Peter Stone
|
Xiaoping Chen
Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Reinforcement learning and probabilistic reasoning algorithms aim at learning from interaction experiences and reasoning with probabilistic contextual knowledge respectively. In this research, we develop algorithms for robot task completions, while looking into the complementary strengths of reinforcement learning and probabilistic reasoning techniques. The robots learn from trial-and-error experiences to augment their declarative knowledge base, and the augmented knowledge can be used for speeding up the learning process in potentially different tasks. We have implemented and evaluated the developed algorithms using mobile robots conducting dialog and navigation tasks. From the results, we see that our robot’s performance can be improved by both reasoning with human knowledge and learning from task-completion experience. More interestingly, the robot was able to learn from navigation tasks to improve its dialog strategies.
Adaptive Dialog Policy Learning with Hindsight and User Modeling
Yan Cao
|
Keting Lu
|
Xiaoping Chen
|
Shiqi Zhang
Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Reinforcement learning (RL) methods have been widely used for learning dialog policies. Sample efficiency, i.e., the efficiency of learning from limited dialog experience, is particularly important in RL-based dialog policy learning, because interacting with people is costly and low-quality dialog policies produce very poor user experience. In this paper, we develop LHUA (Learning with Hindsight, User modeling, and Adaptation) that, for the first time, enables dialog agents to adaptively learn with hindsight from both simulated and real users. Simulation and hindsight provide the dialog agent with more experience and more (positive) reinforcement respectively. Experimental results suggest that LHUA outperforms competitive baselines from the literature, including its no-simulation, no-adaptation, and no-hindsight counterparts.
2015
Revisiting Word Embedding for Contrasting Meaning
Zhigang Chen
|
Wei Lin
|
Qian Chen
|
Xiaoping Chen
|
Si Wei
|
Hui Jiang
|
Xiaodan Zhu
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
Search
Co-authors
- Keting Lu 2
- Shiqi Zhang 2
- Zhigang Chen 1
- Wei Lin 1
- Qian Chen 1
- show all...