TY - JOUR
T1 - Subphenotyping depression using machine learning and electronic health records
AU - Xu, Zhenxing
AU - Wang, Fei
AU - Adekkanattu, Prakash
AU - Bose, Budhaditya
AU - Vekaria, Veer
AU - Brandt, Pascal
AU - Jiang, Guoqian
AU - Kiefer, Richard C.
AU - Luo, Yuan
AU - Pacheco, Jennifer A.
AU - Rasmussen, Luke V.
AU - Xu, Jie
AU - Alexopoulos, George
AU - Pathak, Jyotishman
N1 - Funding Information:
This work was supported in part by funding from NIH R01GM105688, R01MH105384, R01MH119177 and P50MH113838.
Publisher Copyright:
© 2020 The Authors. Learning Health Systems published by Wiley Periodicals LLC on behalf of University of Michigan.
PY - 2020/10/1
Y1 - 2020/10/1
N2 - Objective: To identify depression subphenotypes from Electronic Health Records (EHRs) using machine learning methods, and analyze their characteristics with respect to patient demographics, comorbidities, and medications. Materials and Methods: Using EHRs from the INSIGHT Clinical Research Network (CRN) database, multiple machine learning (ML) algorithms were applied to analyze 11 275 patients with depression to discern depression subphenotypes with distinct characteristics. Results: Using the computational approaches, we derived three depression subphenotypes: Phenotype_A (n = 2791; 31.35%) included patients who were the oldest (mean (SD) age, 72.55 (14.93) years), had the most comorbidities, and took the most medications. The most common comorbidities in this cluster of patients were hyperlipidemia, hypertension, and diabetes. Phenotype_B (mean (SD) age, 68.44 (19.09) years) was the largest cluster (n = 4687; 52.65%), and included patients suffering from moderate loss of body function. Asthma, fibromyalgia, and Chronic Pain and Fatigue (CPF) were common comorbidities in this subphenotype. Phenotype_C (n = 1452; 16.31%) included patients who were younger (mean (SD) age, 63.47 (18.81) years), had the fewest comorbidities, and took fewer medications. Anxiety and tobacco use were common comorbidities in this subphenotype. Conclusion: Computationally deriving depression subtypes can provide meaningful insights and improve understanding of depression as a heterogeneous disorder. Further investigation is needed to assess the utility of these derived phenotypes to inform clinical trial design and interpretation in routine patient care.
AB - Objective: To identify depression subphenotypes from Electronic Health Records (EHRs) using machine learning methods, and analyze their characteristics with respect to patient demographics, comorbidities, and medications. Materials and Methods: Using EHRs from the INSIGHT Clinical Research Network (CRN) database, multiple machine learning (ML) algorithms were applied to analyze 11 275 patients with depression to discern depression subphenotypes with distinct characteristics. Results: Using the computational approaches, we derived three depression subphenotypes: Phenotype_A (n = 2791; 31.35%) included patients who were the oldest (mean (SD) age, 72.55 (14.93) years), had the most comorbidities, and took the most medications. The most common comorbidities in this cluster of patients were hyperlipidemia, hypertension, and diabetes. Phenotype_B (mean (SD) age, 68.44 (19.09) years) was the largest cluster (n = 4687; 52.65%), and included patients suffering from moderate loss of body function. Asthma, fibromyalgia, and Chronic Pain and Fatigue (CPF) were common comorbidities in this subphenotype. Phenotype_C (n = 1452; 16.31%) included patients who were younger (mean (SD) age, 63.47 (18.81) years), had the fewest comorbidities, and took fewer medications. Anxiety and tobacco use were common comorbidities in this subphenotype. Conclusion: Computationally deriving depression subtypes can provide meaningful insights and improve understanding of depression as a heterogeneous disorder. Further investigation is needed to assess the utility of these derived phenotypes to inform clinical trial design and interpretation in routine patient care.
KW - depression
KW - electronic health records
KW - machine learning
KW - phenotyping
UR - http://www.scopus.com/inward/record.url?scp=85088824128&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85088824128&partnerID=8YFLogxK
U2 - 10.1002/lrh2.10241
DO - 10.1002/lrh2.10241
M3 - Article
AN - SCOPUS:85088824128
SN - 2379-6146
VL - 4
JO - Learning Health Systems
JF - Learning Health Systems
IS - 4
M1 - e10241
ER -