TY - JOUR
T1 - Leveraging electronic health records for predictive modeling of post-surgical complications
AU - Weller, Grant B.
AU - Lovely, Jenna
AU - Larson, David
AU - Earnshaw, Berton A.
AU - Huebner, Marianne
PY - 2017/1/1
Y1 - 2017/1/1
N2 - Hospital-specific electronic health record systems are used to inform clinical practice about best practices and quality improvements. Many surgical centers have developed deterministic clinical decision rules to discover adverse events (e.g. postoperative complications) using electronic health record data. However, these data provide opportunities to use probabilistic methods for early prediction of adverse health events, which may be more informative than deterministic algorithms. Electronic health record data from a set of 9598 colorectal surgery cases from 2010 to 2014 were used to predict the occurrence of selected complications including surgical site infection, ileus, and bleeding. Consistent with previous studies, we find a high rate of missing values for both covariates and complication information (4–90%). Several machine learning classification methods are trained on an 80% random sample of cases and tested on a remaining holdout set. Predictive performance varies by complication, although an area under the receiver operating characteristic curve as high as 0.86 on testing data was achieved for bleeding complications, and accuracy for all complications compares favorably to existing clinical decision rules. Our results confirm that electronic health records provide opportunities for improved risk prediction of surgical complications; however, consideration of data quality and consistency standards is an important step in predictive modeling with such data.
AB - Hospital-specific electronic health record systems are used to inform clinical practice about best practices and quality improvements. Many surgical centers have developed deterministic clinical decision rules to discover adverse events (e.g. postoperative complications) using electronic health record data. However, these data provide opportunities to use probabilistic methods for early prediction of adverse health events, which may be more informative than deterministic algorithms. Electronic health record data from a set of 9598 colorectal surgery cases from 2010 to 2014 were used to predict the occurrence of selected complications including surgical site infection, ileus, and bleeding. Consistent with previous studies, we find a high rate of missing values for both covariates and complication information (4–90%). Several machine learning classification methods are trained on an 80% random sample of cases and tested on a remaining holdout set. Predictive performance varies by complication, although an area under the receiver operating characteristic curve as high as 0.86 on testing data was achieved for bleeding complications, and accuracy for all complications compares favorably to existing clinical decision rules. Our results confirm that electronic health records provide opportunities for improved risk prediction of surgical complications; however, consideration of data quality and consistency standards is an important step in predictive modeling with such data.
KW - clinical decision rules
KW - data mining
KW - electronic health data
KW - predictive modeling
KW - regularization
KW - statistical machine learning
UR - http://www.scopus.com/inward/record.url?scp=85044096079&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85044096079&partnerID=8YFLogxK
U2 - 10.1177/0962280217696115
DO - 10.1177/0962280217696115
M3 - Article
C2 - 29298612
AN - SCOPUS:85044096079
SN - 0962-2802
JO - Statistical Methods in Medical Research
JF - Statistical Methods in Medical Research
ER -