Discrete-time dynamic graphical games: model-free reinforcement learning solution

Mohammed I. Abouheaf*, Frank L. Lewis, Magdi S. Mahmoud, Dariusz G. Mikulski

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

71 Scopus citations

Abstract

This paper introduces a model-free reinforcement learning technique that is used to solve a class of dynamic games known as dynamic graphical games. The graphical game results from multi-agent dynamical systems, where pinning control is used to make all the agents synchronize to the state of a command generator or a leader agent. Novel coupled Bellman equations and Hamiltonian functions are developed for the dynamic graphical games. The Hamiltonian mechanics are used to derive the necessary conditions for optimality. The solution for the dynamic graphical game is given in terms of the solution to a set of coupled Hamilton-Jacobi-Bellman equations developed herein. Nash equilibrium solution for the graphical game is given in terms of the solution to the underlying coupled Hamilton-Jacobi-Bellman equations. An online model-free policy iteration algorithm is developed to learn the Nash solution for the dynamic graphical game. This algorithm does not require any knowledge of the agents’ dynamics. A proof of convergence for this multi-agent learning algorithm is given under mild assumption about the inter-connectivity properties of the graph. A gradient descent technique with critic network structures is used to implement the policy iteration algorithm to solve the graphical game online in real-time.

Original languageEnglish
Pages (from-to)55-69
Number of pages15
JournalControl Theory and Technology
Volume13
Issue number1
DOIs
StatePublished - 1 Feb 2015

Bibliographical note

Publisher Copyright:
© 2015, South China University of Technology, Academy of Mathematics and Systems Science, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg.

Keywords

  • Dynamic graphical games
  • Nash equilibrium
  • discrete mechanics
  • model-free reinforcement learning
  • optimal control
  • policy iteration

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Signal Processing
  • Information Systems
  • Modeling and Simulation
  • Aerospace Engineering
  • Control and Optimization
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Discrete-time dynamic graphical games: model-free reinforcement learning solution'. Together they form a unique fingerprint.

Cite this