TY - JOUR
T1 - Multi-Agent Learning for Precise and Collaborative Control of Anesthetics in TIVA
AU - Li, Huijie
AU - Liu, Kunpeng
AU - Yu, Yide
AU - Zhai, Yuejing
AU - Hu, Anmin
AU - Huo, Jian
AU - Luo, Wuman
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2025
Y1 - 2025
N2 - Precise and collaborative control of multiple anesthetics in Total Intravenous Anesthesia (TIVA) is essential for ensuring patient safety and maintaining the target depth of anesthesia (DoA). However, existing automated anesthesia control methods often fail to effectively capture the complex synergistic interactions between anesthetics and lack adaptability to patient-specific physiological variability, thereby limiting their clinical applicability. To address these issues, we propose AnesMADRL, a novel Multi-Agent Deep Reinforcement Learning (MADRL)-based framework that leverages the Counterfactual Multi-Agent algorithm for effective credit assignment between agents controlling propofol and remifentanil, and adopts a continuous action space to enable fine-grained dose adjustments. Furthermore, AnesMADRL integrates comprehensive patient-specific physiological indicators and employs a random forest-based simulator to generate dynamic and diverse training environments. Experimental results show that AnesMADRL significantly outperforms baseline methods and human expertise in terms of anesthetic efficiency and total drug consumption. Relative to human expertise, AnesMADRL achieves roughly twofold efficiency while reducing total dose to about one-half, highlighting its potential to enhance patient safety and optimize clinical outcomes.
AB - Precise and collaborative control of multiple anesthetics in Total Intravenous Anesthesia (TIVA) is essential for ensuring patient safety and maintaining the target depth of anesthesia (DoA). However, existing automated anesthesia control methods often fail to effectively capture the complex synergistic interactions between anesthetics and lack adaptability to patient-specific physiological variability, thereby limiting their clinical applicability. To address these issues, we propose AnesMADRL, a novel Multi-Agent Deep Reinforcement Learning (MADRL)-based framework that leverages the Counterfactual Multi-Agent algorithm for effective credit assignment between agents controlling propofol and remifentanil, and adopts a continuous action space to enable fine-grained dose adjustments. Furthermore, AnesMADRL integrates comprehensive patient-specific physiological indicators and employs a random forest-based simulator to generate dynamic and diverse training environments. Experimental results show that AnesMADRL significantly outperforms baseline methods and human expertise in terms of anesthetic efficiency and total drug consumption. Relative to human expertise, AnesMADRL achieves roughly twofold efficiency while reducing total dose to about one-half, highlighting its potential to enhance patient safety and optimize clinical outcomes.
KW - collaborative control
KW - environment simulator
KW - multi-agent deep reinforcement learning
KW - personalized anesthesia control
KW - total intravenous anesthesia
UR - https://www.scopus.com/pages/publications/105026032824
U2 - 10.1109/JBHI.2025.3647549
DO - 10.1109/JBHI.2025.3647549
M3 - Article
C2 - 41433183
AN - SCOPUS:105026032824
SN - 2168-2194
JO - IEEE Journal of Biomedical and Health Informatics
JF - IEEE Journal of Biomedical and Health Informatics
ER -