iRiSC: Iterative Risk Sensitive Control for Nonlinear Systems with Imperfect Observations

Bilal Hammoud, Armand Jordana, Ludovic Righetti

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This work addresses the problem of risk-sensitive control for nonlinear systems with imperfect state observations, extending results for the linear case. In particular, we derive an algorithm that can compute local solutions with computational complexity similar to the iterative linear quadratic regulator algorithm. The proposed algorithm introduces feasibility gaps to allow the initialization with non-feasible trajectories. Moreover, an approximation for the expectation of the general nonlinear cost is proposed to enable an iterative line search solution to the planning problem. The optimal estimator is also derived along with the controls minimizing the general stochastic nonlinear cost. Finally extensive simulations are carried out to show the increased robustness the proposed framework provides when compared to the risk neutral iLQG counter part. To the authors' best knowledge, this is the first algorithm that computes risk aware optimal controls that are a function of both the process noise and measurement uncertainty.

Original languageEnglish (US)
Title of host publication2022 American Control Conference, ACC 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3550-3557
Number of pages8
ISBN (Electronic)9781665451963
DOIs
StatePublished - 2022
Event2022 American Control Conference, ACC 2022 - Atlanta, United States
Duration: Jun 8 2022Jun 10 2022

Publication series

NameProceedings of the American Control Conference
Volume2022-June
ISSN (Print)0743-1619

Conference

Conference2022 American Control Conference, ACC 2022
Country/TerritoryUnited States
CityAtlanta
Period6/8/226/10/22

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'iRiSC: Iterative Risk Sensitive Control for Nonlinear Systems with Imperfect Observations'. Together they form a unique fingerprint.

Cite this