TY - GEN
T1 - Learning and reasoning for robot dialog and navigation tasks
AU - Lu, Keting
AU - Zhang, Shiqi
AU - Stone, Peter
AU - Chen, Xiaoping
N1 - Publisher Copyright: © 2020 Association for Computational Linguistics.
PY - 2020
Y1 - 2020
N2 - Reinforcement learning and probabilistic reasoning algorithms aim at learning from interaction experiences and reasoning with probabilistic contextual knowledge respectively. In this research, we develop algorithms for robot task completions, while looking into the complementary strengths of reinforcement learning and probabilistic reasoning techniques. The robots learn from trial-and-error experiences to augment their declarative knowledge base, and the augmented knowledge can be used for speeding up the learning process in potentially different tasks. We have implemented and evaluated the developed algorithms using mobile robots conducting dialog and navigation tasks. From the results, we see that our robot's performance can be improved by both reasoning with human knowledge and learning from task-completion experience. More interestingly, the robot was able to learn from navigation tasks to improve its dialog strategies.
AB - Reinforcement learning and probabilistic reasoning algorithms aim at learning from interaction experiences and reasoning with probabilistic contextual knowledge respectively. In this research, we develop algorithms for robot task completions, while looking into the complementary strengths of reinforcement learning and probabilistic reasoning techniques. The robots learn from trial-and-error experiences to augment their declarative knowledge base, and the augmented knowledge can be used for speeding up the learning process in potentially different tasks. We have implemented and evaluated the developed algorithms using mobile robots conducting dialog and navigation tasks. From the results, we see that our robot's performance can be improved by both reasoning with human knowledge and learning from task-completion experience. More interestingly, the robot was able to learn from navigation tasks to improve its dialog strategies.
UR - https://www.scopus.com/pages/publications/85102756941
U2 - 10.18653/v1/2020.sigdial-1.14
DO - 10.18653/v1/2020.sigdial-1.14
M3 - Conference contribution
T3 - SIGDIAL 2020 - 21st Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference
SP - 107
EP - 117
BT - SIGDIAL 2020 - 21st Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference
PB - Association for Computational Linguistics (ACL)
T2 - 21st Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2020
Y2 - 1 July 2020 through 3 July 2020
ER -