Abstract
In this article, a data-driven distributed control method is proposed to solve the cooperative optimal output regulation problem of leader-follower multiagent systems. Different from traditional studies on cooperative output regulation, a distributed adaptive internal model is originally developed, which includes a distributed internal model and a distributed observer to estimate the leader's dynamics. Without relying on the dynamics of multiagent systems, we have proposed two reinforcement learning algorithms, policy iteration and value iteration, to learn the optimal controller through online input and state data, and estimated values of the leader's state. By combining these methods, we have established a basis for connecting data-distributed control methods with adaptive dynamic programming approaches in general since these are the theoretical foundation from which they are built.
Original language | English (US) |
---|---|
Pages (from-to) | 5229-5240 |
Number of pages | 12 |
Journal | IEEE transactions on neural networks and learning systems |
Volume | 33 |
Issue number | 10 |
DOIs | |
State | Published - Oct 1 2022 |
Keywords
- Adaptive optimal control
- cooperative output regulation
- distributed adaptive internal model
- reinforcement learning
ASJC Scopus subject areas
- Software
- Computer Science Applications
- Computer Networks and Communications
- Artificial Intelligence