Benchmarking Predictive Risk Models for Emergency Departments with Large Public Electronic Health Records
-
Alon Dagan
|
An-Kwok Ian Wong
|
Bibhas Chakraborty
|
Fei Gao
|
Feng Xie
|
Jin Wee Lee
|
Jun Zhou
|
Logasan S/O Rajnthern
|
Marcel Lucas Chee
|
Marcus Eng Hock Ong
|
Mingrui Tan
|
Nan Liu
|
Siqi Li
-
Centre for Quantitative Medicine and Programme in Health Services and Systems Research, Duke-NUS Medical School, Singapore, Singapore
|
School of Electrical and Electronic Engineering, Nanyang Technological University,Singapore, Singapore
|
Xie, F., Zhou, J., Lee, J.W., Tan, M., Li, S., Rajnthern, L.S., Chee, M.L., Chakraborty, B., Wong, A.I., Dagan, A., Ong, M.E., Gao, F., & Liu, N. (2021)
There is a continuously growing demand for emergency department (ED) services across the world, especially under the COVID-19 pandemic. Risk triaging plays a crucial role in prioritizing limited medical resources for patients who need them most. Recently the pervasive use of Electronic Health Records (EHR) has generated a large volume of stored data, accompanied by vast opportunities for the development of predictive models which could improve emergency care. However, there is an absence of widely accepted ED benchmarks based on large-scale public EHR, which new researchers could easily access. Success in filling in this gap could enable researchers to start studies on ED more quickly and conveniently without verbose data preprocessing and facilitate comparisons among different studies and methodologies. In this paper, based on the Medical Information Mart for Intensive Care IV Emergency Department (MIMIC-IV-ED) database, we proposed a public ED benchmark suite and obtained a benchmark dataset containing over 500,000 ED visits episodes from 2011 to 2019. Three ED-based prediction tasks (hospitalization, critical outcomes, and 72-hour ED revisit) were introduced, where various popular methodologies, from machine learning methods to clinical scoring systems, were implemented. The results of their performance were evaluated and compared. Model codes are open-source so that anyone with access to MIMIC-IV-ED could follow the
same steps of data processing, build the benchmarks, and reproduce the experiments. This study provided insights, suggestions, as well as protocols for future researchers to process the raw data and quickly build up models for emergency care.
Input variables : MIMIC-IV ED Dataset
Output Variables : Prediction of Hospitalization, Critical Outcomes, 72 hr ED Revisits
Metrics to Monitor
Statistical
|
:
|
Somers D |
Accuracy |
Precision and Recall |
Confusion Matrix |
F1 Score |
Roc and Auc |
Prevalence |
Detection Rate |
Balanced Accuracy |
Cohen's Kappa |
Concordance |
Gini Coefficent |
KS Statistic |
Youden's J Index
|
Business
|
:
|
Population at High Risk of Disease |
Risk by Geography |
Risk by Demographics |
Risk by Clinical Parameters |
Optimized Hospital Resource Utilization |
Decreased Cost of Care |
Decreased Patient Visits
|
Infrastructure
|
:
|
Log Bytes |
Logging/User/IAMPolicy |
Logging/User/VPN |
CPU Utilization |
Memory Usage |
Error Count |
Prediction Count |
Prediction Latencies |
Private Endpoint Prediction Latencies |
Private Endpoint Response Count
|
Visit Model :
arxiv.org