TY - JOUR
T1 - State constrained stochastic optimal control for continuous and hybrid dynamical systems using DFBSDE
AU - Dai, Bolun
AU - Krishnamurthy, Prashanth
AU - Papanicolaou, Andrew
AU - Khorrami, Farshad
N1 - Publisher Copyright:
© 2023 Elsevier Ltd
PY - 2023/9
Y1 - 2023/9
N2 - We develop a computationally efficient learning-based forward–backward stochastic differential equations (FBSDE) controller for both continuous and hybrid dynamical (HD) systems subject to stochastic noise and state constraints. Solutions to stochastic optimal control (SOC) problems satisfy the Hamilton–Jacobi–Bellman (HJB) equation. Using current FBSDE-based solutions, the optimal control can be obtained from the HJB equations using deep neural networks (e.g., long short-term memory (LSTM) networks). To ensure the learned controller respects the constraint boundaries, we enforce the state constraints using a soft penalty function. In addition to previous works, we adapt the deep FBSDE (DFBSDE) control framework to handle HD systems consisting of continuous dynamics and a deterministic discrete state change. We demonstrate our proposed algorithm in simulation on a continuous nonlinear system (cart–pole) and a hybrid nonlinear system (five-link biped).
AB - We develop a computationally efficient learning-based forward–backward stochastic differential equations (FBSDE) controller for both continuous and hybrid dynamical (HD) systems subject to stochastic noise and state constraints. Solutions to stochastic optimal control (SOC) problems satisfy the Hamilton–Jacobi–Bellman (HJB) equation. Using current FBSDE-based solutions, the optimal control can be obtained from the HJB equations using deep neural networks (e.g., long short-term memory (LSTM) networks). To ensure the learned controller respects the constraint boundaries, we enforce the state constraints using a soft penalty function. In addition to previous works, we adapt the deep FBSDE (DFBSDE) control framework to handle HD systems consisting of continuous dynamics and a deterministic discrete state change. We demonstrate our proposed algorithm in simulation on a continuous nonlinear system (cart–pole) and a hybrid nonlinear system (five-link biped).
KW - Forward and backward stochastic differential equations
KW - Optimal control
KW - Stochastic control
UR - http://www.scopus.com/inward/record.url?scp=85163469119&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85163469119&partnerID=8YFLogxK
U2 - 10.1016/j.automatica.2023.111146
DO - 10.1016/j.automatica.2023.111146
M3 - Article
AN - SCOPUS:85163469119
SN - 0005-1098
VL - 155
JO - Automatica
JF - Automatica
M1 - 111146
ER -