Robust Stability and Near-Optimality for Policy Iteration: For Want of Recursive Feasibility, All is Not Lost

Mathieu Granzotto, Olivier Lindamulage De Silva, Romain Postoyan, Dragan Nesic, Zhong Ping Jiang

Research output: Contribution to journalArticlepeer-review

Abstract

We consider deterministic nonlinear discrete-time systems whose inputs are generated by policy iteration (PI) for undiscounted cost functions. We first assume that PI is recursively feasible, in the sense that the optimization problems solved at each iteration admit a solution. In this case, we provide novel conditions to establish recursive robust stability properties for a general attractor, meaning that the policies generated at each iteration ensure a robust KL-stability property with respect to a general state measure. We then derive novel explicit bounds on the mismatch between the (suboptimal) value function returned by PI at each iteration and the optimal one. However, we show by a counterexample that PI may fail to be recursively feasible, disallowing the mentioned stability and near-optimality guarantees. We therefore also present a modification of PI so that recursive feasibility is guaranteed a priori under mild conditions. This modified algorithm, called PI+ is shown to preserve the recursive robust stability when the attractor is compact. In addition, PI+ enjoys the same near-optimality properties as its PI counterpart under the same assumptions.

Original languageEnglish (US)
Pages (from-to)8247-8262
Number of pages16
JournalIEEE Transactions on Automatic Control
Volume69
Issue number12
DOIs
StatePublished - 2024

Keywords

  • Algorithm design and analysis
  • Lyapunov methods
  • asymptotic stability
  • control system synthesis
  • cost function
  • dynamic programming
  • nonlinear systems
  • optimal control
  • predictive control
  • robustness

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Computer Science Applications
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Robust Stability and Near-Optimality for Policy Iteration: For Want of Recursive Feasibility, All is Not Lost'. Together they form a unique fingerprint.

Cite this