MDPG: Markov Decision Process with Graph Representation in Reinforcement Learning

Yide Yu, Dennis Wong, Yan Ma, Yue Liu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The Markov Decision Process (MDP) is a widely used framework for modeling decision-making problems. However, its traditional representation as a set of states and actions may be limited in its ability to capture complex dependencies between variables. This study proposes an alternative representation of MDP, called MDP represented by the graph (MDPG), which leverages Graph Theory to represent and address decision-making problems. In this paper, we provide the definition of MDPG and valid walks, and explore its degree and induced subgraph properties. Additionally, we prove that MDPG is capable of detecting non-stationary and partially observable processes, which is an important advantage over traditional MDP models. To enhance the practical application of MDPG, we redefine the state-value and state-action-value functions. Overall, our study demonstrates the potential of MDPG as a promising framework for modeling decision-making problems, and offers a new perspective on the use of Graph Theory in the field of decision-making.

Original languageEnglish
Title of host publicationProceedings - 2023 International Conference on Algorithms, Computing and Data Processing, ACDP 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages121-125
Number of pages5
ISBN (Electronic)9798350326680
DOIs
Publication statusPublished - 2023
Event2023 International Conference on Algorithms, Computing and Data Processing, ACDP 2023 - Virtual, Online, China
Duration: 23 Jun 202325 Jun 2023

Publication series

NameProceedings - 2023 International Conference on Algorithms, Computing and Data Processing, ACDP 2023

Conference

Conference2023 International Conference on Algorithms, Computing and Data Processing, ACDP 2023
Country/TerritoryChina
CityVirtual, Online
Period23/06/2325/06/23

Keywords

  • Graph Theory
  • Markov Decision Process
  • partially observable

Fingerprint

Dive into the research topics of 'MDPG: Markov Decision Process with Graph Representation in Reinforcement Learning'. Together they form a unique fingerprint.

Cite this