Conversion of MDP problems into heuristics based planning problems using temporal decomposition

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

This paper presents an approach for recasting Markov Decision Process (MDP) problems into heuristics based planning problems. The basic idea is to use temporal decomposition of the state space based on a subset of state space referred to as termination sample space. Specifically, the recasting of MDP problems is done in three steps. First step is to define a state space adaptation criterion based on the termination sample space. Second step is to define an action selection heuristic from each state. Third and final step is to define a recursion or backtracking methodology to avoid dead ends and infinite loops. All three steps have been described and discussed. A case study involving fault detection and alarm generation for the reaction wheels of a satellite mission has been discussed. The proposed approach has been compared with existing approaches for recasting MDP problems using the case study. Computational reduction achieved by the proposed approach is evident from the results.

Original languageEnglish
Title of host publicationProceedings of 2016 13th International Bhurban Conference on Applied Sciences and Technology, IBCAST 2016
EditorsMuhammad Zafar-uz-Zaman
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages179-184
Number of pages6
ISBN (Electronic)9781467391276
DOIs
StatePublished - 9 Mar 2016
Event13th International Bhurban Conference on Applied Sciences and Technology, IBCAST 2016 - Islamabad, Pakistan
Duration: 12 Jan 201616 Jan 2016

Publication series

NameProceedings of 2016 13th International Bhurban Conference on Applied Sciences and Technology, IBCAST 2016

Conference

Conference13th International Bhurban Conference on Applied Sciences and Technology, IBCAST 2016
Country/TerritoryPakistan
CityIslamabad
Period12/01/1616/01/16

Bibliographical note

Publisher Copyright:
© 2016 IEEE.

Keywords

  • Approximate Dynamic Programming
  • Heuristics Based Planning
  • Markov Decision Processes
  • Temporal Decomposition

ASJC Scopus subject areas

  • Civil and Structural Engineering
  • Control and Systems Engineering
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Signal Processing
  • Materials Science (miscellaneous)
  • Modeling and Simulation

Fingerprint

Dive into the research topics of 'Conversion of MDP problems into heuristics based planning problems using temporal decomposition'. Together they form a unique fingerprint.

Cite this