TY - GEN
T1 - Extremum Seeking and Adaptive Dynamic Programming for Distributed Feedback Optimization
AU - Liu, Tong
AU - Krstic, Miroslav
AU - Jiang, Zhong Ping
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This paper studies the distributed feedback optimization problem for linear multi-agent systems without precise knowledge of local costs and agent dynamics. The proposed solution is based on a hierarchical approach that uses upper-level coordinators to adjust reference signals toward the global optimum and lower-level controllers to regulate agents' outputs toward the reference signals. In the absence of precise information on local gradients and agent dynamics, an extremum-seeking mechanism is used to enforce a gradient descent optimization strategy, and an adaptive dynamic programming approach is taken to synthesize an internal-model-based optimal tracking controller. The whole procedure relies only on measurements of local costs and input-state data along agents' trajectories. Moreover, under appropriate conditions, the closed-loop signals are bounded and the output of the agents exponentially converges to a small neighborhood of the desired extremum. A numerical example is conducted to validate the efficacy of the proposed method.
AB - This paper studies the distributed feedback optimization problem for linear multi-agent systems without precise knowledge of local costs and agent dynamics. The proposed solution is based on a hierarchical approach that uses upper-level coordinators to adjust reference signals toward the global optimum and lower-level controllers to regulate agents' outputs toward the reference signals. In the absence of precise information on local gradients and agent dynamics, an extremum-seeking mechanism is used to enforce a gradient descent optimization strategy, and an adaptive dynamic programming approach is taken to synthesize an internal-model-based optimal tracking controller. The whole procedure relies only on measurements of local costs and input-state data along agents' trajectories. Moreover, under appropriate conditions, the closed-loop signals are bounded and the output of the agents exponentially converges to a small neighborhood of the desired extremum. A numerical example is conducted to validate the efficacy of the proposed method.
UR - http://www.scopus.com/inward/record.url?scp=86000669950&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=86000669950&partnerID=8YFLogxK
U2 - 10.1109/CDC56724.2024.10886832
DO - 10.1109/CDC56724.2024.10886832
M3 - Conference contribution
AN - SCOPUS:86000669950
T3 - Proceedings of the IEEE Conference on Decision and Control
SP - 7662
EP - 7667
BT - 2024 IEEE 63rd Conference on Decision and Control, CDC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 63rd IEEE Conference on Decision and Control, CDC 2024
Y2 - 16 December 2024 through 19 December 2024
ER -