TY - GEN
T1 - Reinforcement Learning for Adaptive Periodic Linear Quadratic Control
AU - Pang, Bo
AU - Jiang, Zhong Ping
AU - Mareels, Iven
N1 - Funding Information:
*This work has been supported in part by the U.S. National Science Foundation grant ECCS-1501044.
Publisher Copyright:
© 2019 IEEE.
PY - 2019/12
Y1 - 2019/12
N2 - This paper presents a first solution to the problem of adaptive LQR for continuous-time linear periodic systems. Specifically, reinforcement learning and adaptive dynamic programming (ADP) techniques are used to develop two algorithms to obtain near-optimal controllers. Firstly, the policy iteration (PI) and value iteration (VI) methods are proposed when the model is known. Then, PI-based and VI-based off-policy ADP algorithms are derived to find near-optimal solutions directly from input/state data collected along the system trajectories, without the exact knowledge of system dynamics. The effectiveness of the derived algorithms is validated using the well-known lossy Mathieu equation.
AB - This paper presents a first solution to the problem of adaptive LQR for continuous-time linear periodic systems. Specifically, reinforcement learning and adaptive dynamic programming (ADP) techniques are used to develop two algorithms to obtain near-optimal controllers. Firstly, the policy iteration (PI) and value iteration (VI) methods are proposed when the model is known. Then, PI-based and VI-based off-policy ADP algorithms are derived to find near-optimal solutions directly from input/state data collected along the system trajectories, without the exact knowledge of system dynamics. The effectiveness of the derived algorithms is validated using the well-known lossy Mathieu equation.
UR - http://www.scopus.com/inward/record.url?scp=85082504963&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85082504963&partnerID=8YFLogxK
U2 - 10.1109/CDC40024.2019.9030252
DO - 10.1109/CDC40024.2019.9030252
M3 - Conference contribution
AN - SCOPUS:85082504963
T3 - Proceedings of the IEEE Conference on Decision and Control
SP - 3322
EP - 3327
BT - 2019 IEEE 58th Conference on Decision and Control, CDC 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 58th IEEE Conference on Decision and Control, CDC 2019
Y2 - 11 December 2019 through 13 December 2019
ER -