Stackelberg Meta-Learning Based Control for Guided Cooperative LQG Systems

Yuhan Zhao, Quanyan Zhu

Research output: Chapter in Book/Report/Conference proceedingConference contribution


Guided cooperation allows intelligent agents with heterogeneous capabilities to work together by following a leader-follower type of interaction. However, the associated control problem becomes challenging when the leader agent does not have complete information about follower agents. There is a need for learning and adaptation of cooperation plans. To this end, we develop a meta-learning-based Stackelberg game-theoretic framework to address the challenges in the guided cooperative control for linear systems. We first formulate the guided cooperation between agents as a dynamic Stackelberg game and use the feedback Stackelberg equilibrium as the agent-wise cooperation strategy. We further leverage meta-learning to address the incomplete information of follower agents, where the leader agent learns a meta-response model from a prescribed set of followers offline and adapts to a new coming cooperation task with a small amount of learning data. We use a case study in robot teaming to corroborate the effectiveness of our framework. Comparison with other learning approaches also shows that our learned cooperation strategy provides better transferability for different cooperation tasks.

Original languageEnglish (US)
Title of host publicationIFAC-PapersOnLine
EditorsHideaki Ishii, Yoshio Ebihara, Jun-ichi Imura, Masaki Yamakita
PublisherElsevier B.V.
Number of pages6
ISBN (Electronic)9781713872344
StatePublished - Jul 1 2023
Event22nd IFAC World Congress - Yokohama, Japan
Duration: Jul 9 2023Jul 14 2023

Publication series

ISSN (Electronic)2405-8963


Conference22nd IFAC World Congress


  • Adaptive control of multi-agent systems
  • Data-driven control
  • Dynamic games
  • Intelligent robotics
  • Learning for control
  • Linear systems

ASJC Scopus subject areas

  • Control and Systems Engineering


Dive into the research topics of 'Stackelberg Meta-Learning Based Control for Guided Cooperative LQG Systems'. Together they form a unique fingerprint.

Cite this