Closed-loop stability analysis of deep reinforcement learning controlled systems with experimental validation

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

Trained deep reinforcement learning (DRL) based controllers can effectively control dynamic systems where classical controllers can be ineffective and difficult to tune. However, the lack of closed-loop stability guarantees of systems controlled by trained DRL agents hinders their adoption in practical applications. This research study investigates the closed-loop stability of dynamic systems controlled by trained DRL agents using Lyapunov analysis based on a linear-quadratic polynomial approximation of the trained agent. In addition, this work develops an understanding of the system's stability margin to determine operational boundaries and critical thresholds of the system's physical parameters for effective operation. The proposed analysis is verified on a DRL-controlled system for several simulated and experimental scenarios. The DRL agent is trained using a detailed dynamic model of a non-linear system and then tested on the corresponding real-world hardware platform without any fine-tuning. Experiments are conducted on a wide range of system states and physical parameters and the results have confirmed the validity of the proposed stability analysis (https://youtu.be/QlpeD5sTlPU).

Original languageEnglish
Pages (from-to)1649-1668
Number of pages20
JournalIET Control Theory and Applications
Volume18
Issue number13
DOIs
StatePublished - Sep 2024
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2024 The Author(s). IET Control Theory & Applications published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology.

Keywords

  • control system analysis
  • cranes
  • iterative learning control
  • learning (artificial intelligence)
  • learning systems
  • neural nets
  • neurocontrollers

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Human-Computer Interaction
  • Computer Science Applications
  • Control and Optimization
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Closed-loop stability analysis of deep reinforcement learning controlled systems with experimental validation'. Together they form a unique fingerprint.

Cite this