跳至主導覽 跳至搜尋 跳過主要內容

Value Decomposition-Based Multi-Agent Learning for Anesthetics Collaborative Control

  • Huijie Li
  • , Yide Yu
  • , Si Shi
  • , Anmin Hu
  • , Jian Huo
  • , Wei Lin
  • , Chaoran Wu
  • , Wuman Luo

研究成果: Article同行評審

摘要

Automated control of personalized multiple anesthetics in clinical Total Intravenous Anesthesia (TIVA) is crucial yet challenging. Current systems, including target-controlled infusion (TCI) and closed-loop systems, either rely on relatively static pharmacokinetic/pharmacodynamic (PK/PD) models or focus on single anesthetic control. So they limit both personalization and collaborative control. To address these issues, we propose a novel Value Decomposition Multi-Agent Deep Reinforcement Learning (VD-MADRL) framework based on Markov Game (MG) for Personalized Multiple Anesthetics Control in a Closed-Loop system (PMAC-CL). VD-MADRL optimizes the collaboration between two anesthetics propofol (Agent I) and remifentanil (Agent II) by leveraging a MG to identify optimal actions among heterogeneous agents. We employ various value function decomposition methods to resolve the credit allocation problem and enhance collaborative control. We also introduce a multivariate environment model based on random forest (RF) for anesthesia state simulation. To ensure data validity, we design a data resampling and alignment technique to synchronize trajectory data from different devices, avoiding gradient explosion and maintaining conformity to Markov property. Extensive experiments on general and thoracic surgery datasets demonstrate that VD-MADRL provides more refined dose adjustments and maintains multiple anesthesia state indicators more stably at target levels compared to human experience. Especially, the best-performing algorithm, VDN in general surgery with online training, achieved a 16.4% increase in cumulative reward (CR) and a 58.0% reduction in mean MDPE compared to human experience. This demonstrates its great clinical value.

原文English
頁(從 - 到)2167-2180
頁數14
期刊IEEE Journal of Biomedical and Health Informatics
30
發行號3
DOIs
出版狀態Published - 2026

指紋

深入研究「Value Decomposition-Based Multi-Agent Learning for Anesthetics Collaborative Control」主題。共同形成了獨特的指紋。

引用此