Application of a natural language processing algorithm to asthma ascertainment: An automated chart review

Chung Il Wi; Sunghwan Sohn; Mary C. Rolfes; Alicia Seabright; Euijung Ryu; Gretchen Voge; Kay A. Bachman; Miguel A. Park; Hirohito Kita; Ivana T. Croghan; Hongfang Liu; Young J. Juhn

doi:10.1164/rccm.201610-2006OC

Application of a natural language processing algorithm to asthma ascertainment: An automated chart review

Chung Il Wi, Sunghwan Sohn, Mary C. Rolfes, Alicia Seabright, Euijung Ryu, Gretchen Voge, Kay A. Bachman, Miguel A. Park, Hirohito Kita, Ivana T. Croghan, Hongfang Liu, Young J. Juhn

Research output: Contribution to journal › Article › peer-review

29 Scopus citations

Abstract

Rationale: Difficulty of asthma ascertainment and its associated methodologic heterogeneity have created significant barriers to asthma care and research. Objectives: We evaluated the validity of an existing natural language processing (NLP) algorithm for asthma criteria to enable an automated chart review using electronic medical records (EMRs). Methods: The study was designed as a retrospective birth cohort study using a random sample of 500 subjects from the 1997-2007 Mayo Birth Cohort who were born at Mayo Clinic and enrolled in primary pediatric care at Mayo Clinic Rochester. Performance of NLP-based asthma ascertainment using predetermined asthma criteria was assessed by determining both criterion validity (chart review of EMRs by abstractor as a gold standard) and construct validity (association with known risk factors for asthma, such as allergic rhinitis). Measurements and Main Results: After excluding three subjects whose respiratory symptoms could be attributed to other conditions (e.g., tracheomalacia), among the remaining eligible 497 subjects, 51% were male, 77% white persons, and the median age at last follow-up date was 11.5 years. The asthma prevalence was 31% in the study cohort. Sensitivity, specificity, positive predictive value, and negative predictive value for NLP algorithm in predicting asthma status were 97%, 95%, 90%, and 98%, respectively. The risk factors for asthma (e.g., allergic rhinitis) that were identified either by NLP or the abstractor were the same. Conclusions: Asthma ascertainment through NLP should be considered in the era of EMRs because it can enable large-scale clinical studies in a more time-efficient manner and improve the recognition and care of childhood asthma in practice.

Original language	English (US)
Pages (from-to)	430-437
Number of pages	8
Journal	American journal of respiratory and critical care medicine
Volume	196
Issue number	4
DOIs	https://doi.org/10.1164/rccm.201610-2006OC
State	Published - Aug 15 2017

Keywords

Electronic medical records
Informatics
Retrospective study

ASJC Scopus subject areas

Pulmonary and Respiratory Medicine
Critical Care and Intensive Care Medicine

Access to Document

10.1164/rccm.201610-2006OC

Cite this

Wi, C. I., Sohn, S., Rolfes, M. C., Seabright, A., Ryu, E., Voge, G., Bachman, K. A., Park, M. A., Kita, H., Croghan, I. T., Liu, H., & Juhn, Y. J. (2017). Application of a natural language processing algorithm to asthma ascertainment: An automated chart review. American journal of respiratory and critical care medicine, 196(4), 430-437. https://doi.org/10.1164/rccm.201610-2006OC

Wi, CI , Sohn, S, Rolfes, MC, Seabright, A, Ryu, E, Voge, G, Bachman, KA, Park, MA , Kita, H , Croghan, IT , Liu, H & Juhn, YJ 2017, 'Application of a natural language processing algorithm to asthma ascertainment: An automated chart review', American journal of respiratory and critical care medicine, vol. 196, no. 4, pp. 430-437. https://doi.org/10.1164/rccm.201610-2006OC

@article{77b242a79f62491b887c487f01b658ea,

title = "Application of a natural language processing algorithm to asthma ascertainment: An automated chart review",

abstract = "Rationale: Difficulty of asthma ascertainment and its associated methodologic heterogeneity have created significant barriers to asthma care and research. Objectives: We evaluated the validity of an existing natural language processing (NLP) algorithm for asthma criteria to enable an automated chart review using electronic medical records (EMRs). Methods: The study was designed as a retrospective birth cohort study using a random sample of 500 subjects from the 1997-2007 Mayo Birth Cohort who were born at Mayo Clinic and enrolled in primary pediatric care at Mayo Clinic Rochester. Performance of NLP-based asthma ascertainment using predetermined asthma criteria was assessed by determining both criterion validity (chart review of EMRs by abstractor as a gold standard) and construct validity (association with known risk factors for asthma, such as allergic rhinitis). Measurements and Main Results: After excluding three subjects whose respiratory symptoms could be attributed to other conditions (e.g., tracheomalacia), among the remaining eligible 497 subjects, 51% were male, 77% white persons, and the median age at last follow-up date was 11.5 years. The asthma prevalence was 31% in the study cohort. Sensitivity, specificity, positive predictive value, and negative predictive value for NLP algorithm in predicting asthma status were 97%, 95%, 90%, and 98%, respectively. The risk factors for asthma (e.g., allergic rhinitis) that were identified either by NLP or the abstractor were the same. Conclusions: Asthma ascertainment through NLP should be considered in the era of EMRs because it can enable large-scale clinical studies in a more time-efficient manner and improve the recognition and care of childhood asthma in practice.",

keywords = "Electronic medical records, Informatics, Retrospective study",

author = "Wi, {Chung Il} and Sunghwan Sohn and Rolfes, {Mary C.} and Alicia Seabright and Euijung Ryu and Gretchen Voge and Bachman, {Kay A.} and Park, {Miguel A.} and Hirohito Kita and Croghan, {Ivana T.} and Hongfang Liu and Juhn, {Young J.}",

note = "Publisher Copyright: Copyright {\textcopyright} 2017 by the American Thoracic Society.",

year = "2017",

month = aug,

day = "15",

doi = "10.1164/rccm.201610-2006OC",

language = "English (US)",

volume = "196",

pages = "430--437",

journal = "American journal of respiratory and critical care medicine",

issn = "1073-449X",

publisher = "American Thoracic Society",

number = "4",

}

TY - JOUR

T1 - Application of a natural language processing algorithm to asthma ascertainment

T2 - An automated chart review

AU - Wi, Chung Il

AU - Sohn, Sunghwan

AU - Rolfes, Mary C.

AU - Seabright, Alicia

AU - Ryu, Euijung

AU - Voge, Gretchen

AU - Bachman, Kay A.

AU - Park, Miguel A.

AU - Kita, Hirohito

AU - Croghan, Ivana T.

AU - Liu, Hongfang

AU - Juhn, Young J.

PY - 2017/8/15

Y1 - 2017/8/15

N2 - Rationale: Difficulty of asthma ascertainment and its associated methodologic heterogeneity have created significant barriers to asthma care and research. Objectives: We evaluated the validity of an existing natural language processing (NLP) algorithm for asthma criteria to enable an automated chart review using electronic medical records (EMRs). Methods: The study was designed as a retrospective birth cohort study using a random sample of 500 subjects from the 1997-2007 Mayo Birth Cohort who were born at Mayo Clinic and enrolled in primary pediatric care at Mayo Clinic Rochester. Performance of NLP-based asthma ascertainment using predetermined asthma criteria was assessed by determining both criterion validity (chart review of EMRs by abstractor as a gold standard) and construct validity (association with known risk factors for asthma, such as allergic rhinitis). Measurements and Main Results: After excluding three subjects whose respiratory symptoms could be attributed to other conditions (e.g., tracheomalacia), among the remaining eligible 497 subjects, 51% were male, 77% white persons, and the median age at last follow-up date was 11.5 years. The asthma prevalence was 31% in the study cohort. Sensitivity, specificity, positive predictive value, and negative predictive value for NLP algorithm in predicting asthma status were 97%, 95%, 90%, and 98%, respectively. The risk factors for asthma (e.g., allergic rhinitis) that were identified either by NLP or the abstractor were the same. Conclusions: Asthma ascertainment through NLP should be considered in the era of EMRs because it can enable large-scale clinical studies in a more time-efficient manner and improve the recognition and care of childhood asthma in practice.

AB - Rationale: Difficulty of asthma ascertainment and its associated methodologic heterogeneity have created significant barriers to asthma care and research. Objectives: We evaluated the validity of an existing natural language processing (NLP) algorithm for asthma criteria to enable an automated chart review using electronic medical records (EMRs). Methods: The study was designed as a retrospective birth cohort study using a random sample of 500 subjects from the 1997-2007 Mayo Birth Cohort who were born at Mayo Clinic and enrolled in primary pediatric care at Mayo Clinic Rochester. Performance of NLP-based asthma ascertainment using predetermined asthma criteria was assessed by determining both criterion validity (chart review of EMRs by abstractor as a gold standard) and construct validity (association with known risk factors for asthma, such as allergic rhinitis). Measurements and Main Results: After excluding three subjects whose respiratory symptoms could be attributed to other conditions (e.g., tracheomalacia), among the remaining eligible 497 subjects, 51% were male, 77% white persons, and the median age at last follow-up date was 11.5 years. The asthma prevalence was 31% in the study cohort. Sensitivity, specificity, positive predictive value, and negative predictive value for NLP algorithm in predicting asthma status were 97%, 95%, 90%, and 98%, respectively. The risk factors for asthma (e.g., allergic rhinitis) that were identified either by NLP or the abstractor were the same. Conclusions: Asthma ascertainment through NLP should be considered in the era of EMRs because it can enable large-scale clinical studies in a more time-efficient manner and improve the recognition and care of childhood asthma in practice.

KW - Electronic medical records

KW - Informatics

KW - Retrospective study

UR - http://www.scopus.com/inward/record.url?scp=85028622979&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85028622979&partnerID=8YFLogxK

U2 - 10.1164/rccm.201610-2006OC

DO - 10.1164/rccm.201610-2006OC

M3 - Article

C2 - 28375665

AN - SCOPUS:85028622979

SN - 1073-449X

VL - 196

SP - 430

EP - 437

JO - American journal of respiratory and critical care medicine

JF - American journal of respiratory and critical care medicine

IS - 4

ER -

Application of a natural language processing algorithm to asthma ascertainment: An automated chart review

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this