Learning-based adaptive optimal control of linear time-delay systems: A value iteration approach

Leilei Cui, Bo Pang, Miroslav Krstić, Zhong Ping Jiang

Research output: Contribution to journalArticlepeer-review

Abstract

This paper proposes a novel learning-based adaptive optimal controller design method for a class of continuous-time linear time-delay systems. A key strategy is to exploit the state-of-the-art reinforcement learning (RL) techniques and adaptive dynamic programming (ADP), and propose a data-driven method to learn the near-optimal controller without the precise knowledge of system dynamics. Specifically, a value iteration (VI) algorithm is proposed to solve the infinite-dimensional Riccati equation for the linear quadratic optimal control problem of time-delay systems using finite samples of input-state trajectory data. It is rigorously proved that the proposed VI algorithm converges to the near-optimal solution. Compared with the previous literature, the nice features of the proposed VI algorithm are that it is directly developed for continuous-time systems without discretization and an initial admissible controller is not required for implementing the algorithm. The efficacy of the proposed methodology is demonstrated by two practical examples of metal cutting and autonomous driving.

Original languageEnglish (US)
Article number111944
JournalAutomatica
Volume171
DOIs
StatePublished - Jan 2025

Keywords

  • Adaptive dynamic programming (ADP)
  • Learning-based control
  • Linear time-delay systems
  • Value iteration (VI)

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Learning-based adaptive optimal control of linear time-delay systems: A value iteration approach'. Together they form a unique fingerprint.

Cite this