MDPG: Markov Decision Process with Graph Representation in Reinforcement Learning

Yide Yu, Dennis Wong, Yan Ma, Yue Liu

研究成果: Conference contribution同行評審

摘要

The Markov Decision Process (MDP) is a widely used framework for modeling decision-making problems. However, its traditional representation as a set of states and actions may be limited in its ability to capture complex dependencies between variables. This study proposes an alternative representation of MDP, called MDP represented by the graph (MDPG), which leverages Graph Theory to represent and address decision-making problems. In this paper, we provide the definition of MDPG and valid walks, and explore its degree and induced subgraph properties. Additionally, we prove that MDPG is capable of detecting non-stationary and partially observable processes, which is an important advantage over traditional MDP models. To enhance the practical application of MDPG, we redefine the state-value and state-action-value functions. Overall, our study demonstrates the potential of MDPG as a promising framework for modeling decision-making problems, and offers a new perspective on the use of Graph Theory in the field of decision-making.

原文English
主出版物標題Proceedings - 2023 International Conference on Algorithms, Computing and Data Processing, ACDP 2023
發行者Institute of Electrical and Electronics Engineers Inc.
頁面121-125
頁數5
ISBN(電子)9798350326680
DOIs
出版狀態Published - 2023
事件2023 International Conference on Algorithms, Computing and Data Processing, ACDP 2023 - Virtual, Online, China
持續時間: 23 6月 202325 6月 2023

出版系列

名字Proceedings - 2023 International Conference on Algorithms, Computing and Data Processing, ACDP 2023

Conference

Conference2023 International Conference on Algorithms, Computing and Data Processing, ACDP 2023
國家/地區China
城市Virtual, Online
期間23/06/2325/06/23

指紋

深入研究「MDPG: Markov Decision Process with Graph Representation in Reinforcement Learning」主題。共同形成了獨特的指紋。

引用此