A multicenter evaluation of computable phenotyping approaches for SARS-CoV-2 infection and COVID-19 hospitalizations

Rohan Khera; Bobak J. Mortazavi; Veer Sangha; Frederick Warner; H. Patrick Young; Joseph S. Ross; Nilay D. Shah; Elitza S. Theel; William G. Jenkinson; Camille Knepper; Karen Wang; David Peaper; Richard A. Martinello; Cynthia A. Brandt; Zhenqiu Lin; Albert I. Ko; Harlan M. Krumholz; Benjamin D. Pollock; Wade L. Schulz

doi:10.1038/s41746-022-00570-4

A multicenter evaluation of computable phenotyping approaches for SARS-CoV-2 infection and COVID-19 hospitalizations

Rohan Khera, Bobak J. Mortazavi, Veer Sangha, Frederick Warner, H. Patrick Young, Joseph S. Ross, Nilay D. Shah, Elitza S. Theel, William G. Jenkinson, Camille Knepper, Karen Wang, David Peaper, Richard A. Martinello, Cynthia A. Brandt, Zhenqiu Lin, Albert I. Ko, Harlan M. Krumholz, Benjamin D. Pollock, Wade L. Schulz

Research output: Contribution to journal › Article › peer-review

Abstract

Diagnosis codes are used to study SARS-CoV2 infections and COVID-19 hospitalizations in administrative and electronic health record (EHR) data. Using EHR data (April 2020–March 2021) at the Yale-New Haven Health System and the three hospital systems of the Mayo Clinic, computable phenotype definitions based on ICD-10 diagnosis of COVID-19 (U07.1) were evaluated against positive SARS-CoV-2 PCR or antigen tests. We included 69,423 patients at Yale and 75,748 at Mayo Clinic with either a diagnosis code or a positive SARS-CoV-2 test. The precision and recall of a COVID-19 diagnosis for a positive test were 68.8% and 83.3%, respectively, at Yale, with higher precision (95%) and lower recall (63.5%) at Mayo Clinic, varying between 59.2% in Rochester to 97.3% in Arizona. For hospitalizations with a principal COVID-19 diagnosis, 94.8% at Yale and 80.5% at Mayo Clinic had an associated positive laboratory test, with secondary diagnosis of COVID-19 identifying additional patients. These patients had a twofold higher inhospital mortality than based on principal diagnosis. Standardization of coding practices is needed before the use of diagnosis codes in clinical research and epidemiological surveillance of COVID-19.

Original language	English (US)
Article number	27
Journal	npj Digital Medicine
Volume	5
Issue number	1
DOIs	https://doi.org/10.1038/s41746-022-00570-4
State	Published - Dec 2022

ASJC Scopus subject areas

Medicine (miscellaneous)
Health Informatics
Computer Science Applications
Health Information Management

Access to Document

10.1038/s41746-022-00570-4

Cite this

Khera, R., Mortazavi, B. J., Sangha, V., Warner, F., Patrick Young, H., Ross, J. S., Shah, N. D., Theel, E. S., Jenkinson, W. G., Knepper, C., Wang, K., Peaper, D., Martinello, R. A., Brandt, C. A., Lin, Z., Ko, A. I., Krumholz, H. M., Pollock, B. D., & Schulz, W. L. (2022). A multicenter evaluation of computable phenotyping approaches for SARS-CoV-2 infection and COVID-19 hospitalizations. npj Digital Medicine, 5(1), Article 27. https://doi.org/10.1038/s41746-022-00570-4

Khera, R, Mortazavi, BJ, Sangha, V, Warner, F, Patrick Young, H, Ross, JS, Shah, ND, Theel, ES, Jenkinson, WG, Knepper, C, Wang, K, Peaper, D, Martinello, RA, Brandt, CA, Lin, Z, Ko, AI, Krumholz, HM, Pollock, BD & Schulz, WL 2022, 'A multicenter evaluation of computable phenotyping approaches for SARS-CoV-2 infection and COVID-19 hospitalizations', npj Digital Medicine, vol. 5, no. 1, 27. https://doi.org/10.1038/s41746-022-00570-4

@article{1bab097003ec402786cb67e9c2af736f,

title = "A multicenter evaluation of computable phenotyping approaches for SARS-CoV-2 infection and COVID-19 hospitalizations",

abstract = "Diagnosis codes are used to study SARS-CoV2 infections and COVID-19 hospitalizations in administrative and electronic health record (EHR) data. Using EHR data (April 2020–March 2021) at the Yale-New Haven Health System and the three hospital systems of the Mayo Clinic, computable phenotype definitions based on ICD-10 diagnosis of COVID-19 (U07.1) were evaluated against positive SARS-CoV-2 PCR or antigen tests. We included 69,423 patients at Yale and 75,748 at Mayo Clinic with either a diagnosis code or a positive SARS-CoV-2 test. The precision and recall of a COVID-19 diagnosis for a positive test were 68.8% and 83.3%, respectively, at Yale, with higher precision (95%) and lower recall (63.5%) at Mayo Clinic, varying between 59.2% in Rochester to 97.3% in Arizona. For hospitalizations with a principal COVID-19 diagnosis, 94.8% at Yale and 80.5% at Mayo Clinic had an associated positive laboratory test, with secondary diagnosis of COVID-19 identifying additional patients. These patients had a twofold higher inhospital mortality than based on principal diagnosis. Standardization of coding practices is needed before the use of diagnosis codes in clinical research and epidemiological surveillance of COVID-19.",

author = "Rohan Khera and Mortazavi, {Bobak J.} and Veer Sangha and Frederick Warner and {Patrick Young}, H. and Ross, {Joseph S.} and Shah, {Nilay D.} and Theel, {Elitza S.} and Jenkinson, {William G.} and Camille Knepper and Karen Wang and David Peaper and Martinello, {Richard A.} and Brandt, {Cynthia A.} and Zhenqiu Lin and Ko, {Albert I.} and Krumholz, {Harlan M.} and Pollock, {Benjamin D.} and Schulz, {Wade L.}",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s).",

year = "2022",

month = dec,

doi = "10.1038/s41746-022-00570-4",

language = "English (US)",

volume = "5",

journal = "npj Digital Medicine",

issn = "2398-6352",

publisher = "Nature Publishing Group",

number = "1",

}

TY - JOUR

T1 - A multicenter evaluation of computable phenotyping approaches for SARS-CoV-2 infection and COVID-19 hospitalizations

AU - Khera, Rohan

AU - Mortazavi, Bobak J.

AU - Sangha, Veer

AU - Warner, Frederick

AU - Patrick Young, H.

AU - Ross, Joseph S.

AU - Shah, Nilay D.

AU - Theel, Elitza S.

AU - Jenkinson, William G.

AU - Knepper, Camille

AU - Wang, Karen

AU - Peaper, David

AU - Martinello, Richard A.

AU - Brandt, Cynthia A.

AU - Lin, Zhenqiu

AU - Ko, Albert I.

AU - Krumholz, Harlan M.

AU - Pollock, Benjamin D.

AU - Schulz, Wade L.

PY - 2022/12

Y1 - 2022/12

N2 - Diagnosis codes are used to study SARS-CoV2 infections and COVID-19 hospitalizations in administrative and electronic health record (EHR) data. Using EHR data (April 2020–March 2021) at the Yale-New Haven Health System and the three hospital systems of the Mayo Clinic, computable phenotype definitions based on ICD-10 diagnosis of COVID-19 (U07.1) were evaluated against positive SARS-CoV-2 PCR or antigen tests. We included 69,423 patients at Yale and 75,748 at Mayo Clinic with either a diagnosis code or a positive SARS-CoV-2 test. The precision and recall of a COVID-19 diagnosis for a positive test were 68.8% and 83.3%, respectively, at Yale, with higher precision (95%) and lower recall (63.5%) at Mayo Clinic, varying between 59.2% in Rochester to 97.3% in Arizona. For hospitalizations with a principal COVID-19 diagnosis, 94.8% at Yale and 80.5% at Mayo Clinic had an associated positive laboratory test, with secondary diagnosis of COVID-19 identifying additional patients. These patients had a twofold higher inhospital mortality than based on principal diagnosis. Standardization of coding practices is needed before the use of diagnosis codes in clinical research and epidemiological surveillance of COVID-19.

AB - Diagnosis codes are used to study SARS-CoV2 infections and COVID-19 hospitalizations in administrative and electronic health record (EHR) data. Using EHR data (April 2020–March 2021) at the Yale-New Haven Health System and the three hospital systems of the Mayo Clinic, computable phenotype definitions based on ICD-10 diagnosis of COVID-19 (U07.1) were evaluated against positive SARS-CoV-2 PCR or antigen tests. We included 69,423 patients at Yale and 75,748 at Mayo Clinic with either a diagnosis code or a positive SARS-CoV-2 test. The precision and recall of a COVID-19 diagnosis for a positive test were 68.8% and 83.3%, respectively, at Yale, with higher precision (95%) and lower recall (63.5%) at Mayo Clinic, varying between 59.2% in Rochester to 97.3% in Arizona. For hospitalizations with a principal COVID-19 diagnosis, 94.8% at Yale and 80.5% at Mayo Clinic had an associated positive laboratory test, with secondary diagnosis of COVID-19 identifying additional patients. These patients had a twofold higher inhospital mortality than based on principal diagnosis. Standardization of coding practices is needed before the use of diagnosis codes in clinical research and epidemiological surveillance of COVID-19.

UR - http://www.scopus.com/inward/record.url?scp=85126232048&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85126232048&partnerID=8YFLogxK

U2 - 10.1038/s41746-022-00570-4

DO - 10.1038/s41746-022-00570-4

M3 - Article

AN - SCOPUS:85126232048

SN - 2398-6352

VL - 5

JO - npj Digital Medicine

JF - npj Digital Medicine

IS - 1

M1 - 27

ER -

A multicenter evaluation of computable phenotyping approaches for SARS-CoV-2 infection and COVID-19 hospitalizations

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this