TY - GEN
T1 - A CUDA-based parallel adaptive dynamic programming algorithm
AU - Li, Lu
AU - Chen, Xin
AU - Wang, Wei
N1 - Publisher Copyright: © 2017 Technical Committee on Control Theory, CAA.
PY - 2017/9/7
Y1 - 2017/9/7
N2 - Adaptive Dynamic Programming (ADP) with critic-actor architecture is a useful way to achieve online learning control. The algorithm Gaussian-Kernel Adaptive Dynamic Programming (GK-ADP) that has been developed before has a kind of two-phase iteration, which not only approximates value function, but also optimizes hyper-parameters simultaneously. However, just like most iteration algorithms are applied in practice, the scale of sample set will increase as the complexity of the system increases, and it will induce a high computation cost. In order to speed up computation, a practical acceleration method using parallel computation for GK-ADP is presented in this paper. To realize parallel computation, a high efficient configuration based on CUDA is designed, in which a group of GPUs work in parallel to compute the most complex part of GK-ADP. The comparison test illustrates that the computation burden which hinders GK-ADP's application is reduced to a large extent when the parallel computing is introduced.
AB - Adaptive Dynamic Programming (ADP) with critic-actor architecture is a useful way to achieve online learning control. The algorithm Gaussian-Kernel Adaptive Dynamic Programming (GK-ADP) that has been developed before has a kind of two-phase iteration, which not only approximates value function, but also optimizes hyper-parameters simultaneously. However, just like most iteration algorithms are applied in practice, the scale of sample set will increase as the complexity of the system increases, and it will induce a high computation cost. In order to speed up computation, a practical acceleration method using parallel computation for GK-ADP is presented in this paper. To realize parallel computation, a high efficient configuration based on CUDA is designed, in which a group of GPUs work in parallel to compute the most complex part of GK-ADP. The comparison test illustrates that the computation burden which hinders GK-ADP's application is reduced to a large extent when the parallel computing is introduced.
KW - CUDA
KW - Gaussian-Kernel Adaptive Dynamic Programming
KW - GPU
KW - Parallel Computing
UR - https://www.scopus.com/pages/publications/85032208508
U2 - 10.23919/ChiCC.2017.8027901
DO - 10.23919/ChiCC.2017.8027901
M3 - Conference contribution
T3 - Chinese Control Conference, CCC
SP - 3510
EP - 3515
BT - Proceedings of the 36th Chinese Control Conference, CCC 2017
A2 - Liu, Tao
A2 - Zhao, Qianchuan
PB - IEEE Computer Society
T2 - 36th Chinese Control Conference, CCC 2017
Y2 - 26 July 2017 through 28 July 2017
ER -