TY - GEN
T1 - MDPG
T2 - 2023 International Conference on Algorithms, Computing and Data Processing, ACDP 2023
AU - Yu, Yide
AU - Wong, Dennis
AU - Ma, Yan
AU - Liu, Yue
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - The Markov Decision Process (MDP) is a widely used framework for modeling decision-making problems. However, its traditional representation as a set of states and actions may be limited in its ability to capture complex dependencies between variables. This study proposes an alternative representation of MDP, called MDP represented by the graph (MDPG), which leverages Graph Theory to represent and address decision-making problems. In this paper, we provide the definition of MDPG and valid walks, and explore its degree and induced subgraph properties. Additionally, we prove that MDPG is capable of detecting non-stationary and partially observable processes, which is an important advantage over traditional MDP models. To enhance the practical application of MDPG, we redefine the state-value and state-action-value functions. Overall, our study demonstrates the potential of MDPG as a promising framework for modeling decision-making problems, and offers a new perspective on the use of Graph Theory in the field of decision-making.
AB - The Markov Decision Process (MDP) is a widely used framework for modeling decision-making problems. However, its traditional representation as a set of states and actions may be limited in its ability to capture complex dependencies between variables. This study proposes an alternative representation of MDP, called MDP represented by the graph (MDPG), which leverages Graph Theory to represent and address decision-making problems. In this paper, we provide the definition of MDPG and valid walks, and explore its degree and induced subgraph properties. Additionally, we prove that MDPG is capable of detecting non-stationary and partially observable processes, which is an important advantage over traditional MDP models. To enhance the practical application of MDPG, we redefine the state-value and state-action-value functions. Overall, our study demonstrates the potential of MDPG as a promising framework for modeling decision-making problems, and offers a new perspective on the use of Graph Theory in the field of decision-making.
KW - Graph Theory
KW - Markov Decision Process
KW - partially observable
UR - http://www.scopus.com/inward/record.url?scp=85190662548&partnerID=8YFLogxK
U2 - 10.1109/ACDP59959.2023.00026
DO - 10.1109/ACDP59959.2023.00026
M3 - Conference contribution
AN - SCOPUS:85190662548
T3 - Proceedings - 2023 International Conference on Algorithms, Computing and Data Processing, ACDP 2023
SP - 121
EP - 125
BT - Proceedings - 2023 International Conference on Algorithms, Computing and Data Processing, ACDP 2023
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 23 June 2023 through 25 June 2023
ER -