Reinforcement learning for adaptive optimal control of continuous-time linear periodic systems

Bo Pang, Zhong Ping Jiang, Iven Mareels

Research output: Contribution to journalArticlepeer-review

Abstract

This paper studies the infinite-horizon adaptive optimal control of continuous-time linear periodic (CTLP) systems, using reinforcement learning techniques. By means of policy iteration (PI) for CTLP systems, both on-policy and off-policy adaptive dynamic programming (ADP) algorithms are derived, such that the solution of the optimal control problem can be found without the exact knowledge of the system dynamics. Starting with initial stabilizing controllers, the proposed PI-based ADP algorithms converge to the optimal solutions under mild conditions. Application to the adaptive optimal control of the lossy Mathieu equation demonstrates the efficacy of the proposed learning-based adaptive optimal control algorithm.

Original languageEnglish (US)
Article number109035
JournalAutomatica
Volume118
DOIs
StatePublished - Aug 2020

Keywords

  • Adaptive dynamic programming (ADP)
  • Optimal control
  • Policy iteration (PI)
  • Reinforcement learning (RL)

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Reinforcement learning for adaptive optimal control of continuous-time linear periodic systems'. Together they form a unique fingerprint.

Cite this