Skip to main navigation Skip to search Skip to main content

Learning and reasoning for robot dialog and navigation tasks

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Reinforcement learning and probabilistic reasoning algorithms aim at learning from interaction experiences and reasoning with probabilistic contextual knowledge respectively. In this research, we develop algorithms for robot task completions, while looking into the complementary strengths of reinforcement learning and probabilistic reasoning techniques. The robots learn from trial-and-error experiences to augment their declarative knowledge base, and the augmented knowledge can be used for speeding up the learning process in potentially different tasks. We have implemented and evaluated the developed algorithms using mobile robots conducting dialog and navigation tasks. From the results, we see that our robot's performance can be improved by both reasoning with human knowledge and learning from task-completion experience. More interestingly, the robot was able to learn from navigation tasks to improve its dialog strategies.

Original languageEnglish
Title of host publicationSIGDIAL 2020 - 21st Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference
PublisherAssociation for Computational Linguistics (ACL)
Pages107-117
Number of pages11
ISBN (Electronic)9781952148026
DOIs
StatePublished - 2020
Event21st Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2020 - Virtual, Online
Duration: Jul 1 2020Jul 3 2020

Publication series

NameSIGDIAL 2020 - 21st Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference

Conference

Conference21st Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2020
CityVirtual, Online
Period07/1/2007/3/20

Fingerprint

Dive into the research topics of 'Learning and reasoning for robot dialog and navigation tasks'. Together they form a unique fingerprint.

Cite this