TY - GEN
T1 - Optimal control of distributed Markov decision processes with network delays
AU - Adlakha, Sachin
AU - Madan, Ritesh
AU - Lall, Sanjay
AU - Goldsmith, Andrea
PY - 2007
Y1 - 2007
N2 - We consider the problem of finding an optimal feedback controller for a network of interconnected subsystems, each of which is a Markov decision process. Each subsystem is coupled to its neighbors via communication links by which signals are delayed but are otherwise transmitted noise-free. One of the subsystems receives input from a controller, and the controller receives delayed statemeasurements from all of the subsystems. We show that an optimal controller requires only a finite amount of memory which does not grow with time, and obtain a bound on the amount of memory that a controller needs to have for each subsystem. This makes the computation of an optimal controller through dynamic programming tractable. We illustrate our result by a numerical example, and show that it generalizes previous results on Markov decision processes with delayed state measurements.
AB - We consider the problem of finding an optimal feedback controller for a network of interconnected subsystems, each of which is a Markov decision process. Each subsystem is coupled to its neighbors via communication links by which signals are delayed but are otherwise transmitted noise-free. One of the subsystems receives input from a controller, and the controller receives delayed statemeasurements from all of the subsystems. We show that an optimal controller requires only a finite amount of memory which does not grow with time, and obtain a bound on the amount of memory that a controller needs to have for each subsystem. This makes the computation of an optimal controller through dynamic programming tractable. We illustrate our result by a numerical example, and show that it generalizes previous results on Markov decision processes with delayed state measurements.
UR - https://www.scopus.com/pages/publications/62749192571
U2 - 10.1109/CDC.2007.4434792
DO - 10.1109/CDC.2007.4434792
M3 - Conference contribution
SN - 1424414989
SN - 9781424414987
T3 - Proceedings of the IEEE Conference on Decision and Control
SP - 3308
EP - 3314
BT - Proceedings of the 46th IEEE Conference on Decision and Control 2007, CDC
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 46th IEEE Conference on Decision and Control 2007, CDC
Y2 - 12 December 2007 through 14 December 2007
ER -