TY - GEN
T1 - Reinforcement Learning Approach to Sedation and Delirium Management in the Intensive Care Unit
AU - Eghbali, Niloufar
AU - Alhanai, Tuka
AU - Ghassemi, Mohammad M.
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Common treatments in Intensive Care Units frequently involve prolonged sedation. Maintaining adequate sedation levels is challenging and prone to errors including: incorrect dosing, omission/delay in administration and, selecting a sub-optimal combination of sedatives. In this single-center retrospective study of 1,346 patients, we use a Deep Q Network approach to develop a multi-objective sedation management agent. The agent's objective was to achieve an adequate level of patient sedation without moving the patient's Mean Arterial Pressure (MAP) outside of a therapeutic range. To achieve this objective, the agent was allowed to periodically (every 4 hours) recommend how the dose of two commonly used sedatives (propofol, midazolam) and an opioid (fentanyl) should be adjusted: increased, decreased, or stay the same. To inform it's recommendations, the agent was provided with the patient's demographym and periodic measures including: vital signs, and depth of sedation. To mitigate the potential risk of delirium and the adverse effects of over sedation, a delirium control variable was integrated into the agent's reward function. We found that Physicians with dosing policies that agreed with our agent were 29% more likely to maintain the patient's sedation in a therapeutic range, compared to those that disagreed with our agent's policy.
AB - Common treatments in Intensive Care Units frequently involve prolonged sedation. Maintaining adequate sedation levels is challenging and prone to errors including: incorrect dosing, omission/delay in administration and, selecting a sub-optimal combination of sedatives. In this single-center retrospective study of 1,346 patients, we use a Deep Q Network approach to develop a multi-objective sedation management agent. The agent's objective was to achieve an adequate level of patient sedation without moving the patient's Mean Arterial Pressure (MAP) outside of a therapeutic range. To achieve this objective, the agent was allowed to periodically (every 4 hours) recommend how the dose of two commonly used sedatives (propofol, midazolam) and an opioid (fentanyl) should be adjusted: increased, decreased, or stay the same. To inform it's recommendations, the agent was provided with the patient's demographym and periodic measures including: vital signs, and depth of sedation. To mitigate the potential risk of delirium and the adverse effects of over sedation, a delirium control variable was integrated into the agent's reward function. We found that Physicians with dosing policies that agreed with our agent were 29% more likely to maintain the patient's sedation in a therapeutic range, compared to those that disagreed with our agent's policy.
UR - http://www.scopus.com/inward/record.url?scp=85179512772&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85179512772&partnerID=8YFLogxK
U2 - 10.1109/BHI58575.2023.10313431
DO - 10.1109/BHI58575.2023.10313431
M3 - Conference contribution
AN - SCOPUS:85179512772
T3 - BHI 2023 - IEEE-EMBS International Conference on Biomedical and Health Informatics, Proceedings
BT - BHI 2023 - IEEE-EMBS International Conference on Biomedical and Health Informatics, Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 IEEE-EMBS International Conference on Biomedical and Health Informatics, BHI 2023
Y2 - 15 October 2023 through 18 October 2023
ER -