Reinforcement learning and non-zero-sum game output regulation for multi-player linear uncertain systems

Adedapo Odekunle, Weinan Gao, Masoud Davari, Zhong Ping Jiang

Research output: Contribution to journalArticle


This paper studies the non-zero-sum game output regulation problem (GORP) for a class of continuous-time multi-player linear systems. Without the knowledge of state and input matrices, the Nash equilibrium solution, N-tuple of feedback control policy, is learned through online data collected along the system trajectories. A key strategy is, for the first time, to combine techniques from reinforcement learning (RL), differential game theory, and output regulation for data-driven control design. Different from the existing literature of adaptive optimal output regulation, the feedforward matrices are considered nontrivial. Theoretical analysis shows the disturbance rejection and tracking ability of the closed-loop system. Simulation results demonstrate the efficacy of the developed data-driven control approach.

Original languageEnglish (US)
Article number108672
StatePublished - Feb 2020



  • Adaptive optimal control
  • Data-Driven control
  • Game theory
  • Output regulation
  • Reinforcement learning (RL)

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Electrical and Electronic Engineering

Cite this