TY - GEN
T1 - Learning Quadruped Locomotion Policies Using Logical Rules
AU - DeFazio, David
AU - Hayamizu, Yohei
AU - Zhang, Shiqi
N1 - Publisher Copyright: Copyright © 2024, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2024/5/30
Y1 - 2024/5/30
N2 - Quadruped animals are capable of exhibiting a diverse range of locomotion gaits. While progress has been made in demonstrating such gaits on robots, current methods rely on motion priors, dynamics models, or other forms of extensive manual efforts. People can use natural language to describe dance moves. Could one use a formal language to specify quadruped gaits? To this end, we aim to enable easy gait specification and efficient policy learning. Leveraging Reward Machines (RMs) for high-level gait specification over foot contacts, our approach is called RM-based Locomotion Learning (RMLL), and supports adjusting gait frequency at execution time. Gait specification is enabled through the use of a few logical rules per gait (e.g., alternate between moving front feet and back feet) and does not require labor-intensive motion priors. Experimental results in simulation highlight the diversity of learned gaits (including two novel gaits), their energy consumption and stability across different terrains, and the superior sample-efficiency when compared to baselines. We also demonstrate these learned policies with a real quadruped robot. Video and supplementary materials: https://sites.google.com/view/rm-locomotion-learning/home.
AB - Quadruped animals are capable of exhibiting a diverse range of locomotion gaits. While progress has been made in demonstrating such gaits on robots, current methods rely on motion priors, dynamics models, or other forms of extensive manual efforts. People can use natural language to describe dance moves. Could one use a formal language to specify quadruped gaits? To this end, we aim to enable easy gait specification and efficient policy learning. Leveraging Reward Machines (RMs) for high-level gait specification over foot contacts, our approach is called RM-based Locomotion Learning (RMLL), and supports adjusting gait frequency at execution time. Gait specification is enabled through the use of a few logical rules per gait (e.g., alternate between moving front feet and back feet) and does not require labor-intensive motion priors. Experimental results in simulation highlight the diversity of learned gaits (including two novel gaits), their energy consumption and stability across different terrains, and the superior sample-efficiency when compared to baselines. We also demonstrate these learned policies with a real quadruped robot. Video and supplementary materials: https://sites.google.com/view/rm-locomotion-learning/home.
UR - https://www.scopus.com/pages/publications/85195978625
U2 - 10.1609/icaps.v34i1.31470
DO - 10.1609/icaps.v34i1.31470
M3 - Conference contribution
T3 - Proceedings International Conference on Automated Planning and Scheduling, ICAPS
SP - 142
EP - 150
BT - Proceedings of the 34th International Conference on Automated Planning and Scheduling, ICAPS 2024
A2 - Bernardini, Sara
A2 - Muise, Christian
PB - Association for the Advancement of Artificial Intelligence
T2 - 34th International Conference on Automated Planning and Scheduling, ICAPS 2024
Y2 - 1 June 2024 through 6 June 2024
ER -