Value iteration, adaptive dynamic programming, and optimal control of nonlinear systems

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents a complete answer to the longstanding unanswered question of what value iteration (VI) is for continuous-time, continuous-state-action space nonlinear systems. Based on this proposed VI, we develop a new data-driven adaptive optimal control methodology for unknown nonlinear systems. As compared with the existing literature of adaptive dynamic programming (ADP) for continuous-time systems which often uses policy iteration (PI), an initial admissible control policy is no longer required. By means of the obtained result, a non-model-based adaptive optimal control design is given. The effectiveness of the proposed methodology is also illustrated by an example.

Original languageEnglish (US)
Title of host publication2016 IEEE 55th Conference on Decision and Control, CDC 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3375-3380
Number of pages6
ISBN (Electronic)9781509018376
DOIs
StatePublished - Dec 27 2016
Event55th IEEE Conference on Decision and Control, CDC 2016 - Las Vegas, United States
Duration: Dec 12 2016Dec 14 2016

Publication series

Name2016 IEEE 55th Conference on Decision and Control, CDC 2016

Other

Other55th IEEE Conference on Decision and Control, CDC 2016
CountryUnited States
CityLas Vegas
Period12/12/1612/14/16

ASJC Scopus subject areas

  • Artificial Intelligence
  • Decision Sciences (miscellaneous)
  • Control and Optimization

Fingerprint Dive into the research topics of 'Value iteration, adaptive dynamic programming, and optimal control of nonlinear systems'. Together they form a unique fingerprint.

Cite this