Information extraction for populating lung cancer clinical research data

Liwei Wang, Lei Luo, Yanshan Wang, Jason A. Wampfler, Ping Yang, Hongfang Liu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Lung cancer is the second most common cancer and the wide adoption of electronic health records (EHRs) offers a potential of accelerating cohort-related epidemiological studies using informatics approaches. In this study, we developed and evaluated a natural language processing (NLP) system to extract information on stage, histology, grade and therapies (chemotherapy, radiotherapy and surgery) automatically for lung cancer patients from clinical narratives including clinical notes, pathology reports and surgery reports. Evaluation showed promising results with the recalls for stage, histology, grade, and therapies achieving 89%, 98%, 80%, and 100% respectively and the precisions were 71%, 89%, 90%, and 100% respectively. This study demonstrated the feasibility and accuracy of extracting related information from clinical narratives for lung cancer research.

Original languageEnglish (US)
Title of host publication2019 IEEE International Conference on Healthcare Informatics, ICHI 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781538691380
DOIs
StatePublished - Jun 2019
Event7th IEEE International Conference on Healthcare Informatics, ICHI 2019 - Xi'an, China
Duration: Jun 10 2019Jun 13 2019

Publication series

Name2019 IEEE International Conference on Healthcare Informatics, ICHI 2019

Conference

Conference7th IEEE International Conference on Healthcare Informatics, ICHI 2019
CountryChina
CityXi'an
Period6/10/196/13/19

Fingerprint

Histology
Information Storage and Retrieval
Surgery
Lung Neoplasms
Natural language processing systems
Chemotherapy
Radiotherapy
Pathology
Research
Natural Language Processing
Informatics
Clinical Pathology
Electronic Health Records
Health
Feasibility Studies
Epidemiologic Studies
Drug Therapy
Therapeutics
Neoplasms

Keywords

  • Grade
  • Histology
  • Lung cancer
  • Natural language processing
  • Stage
  • Treatments

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Health Informatics
  • Biomedical Engineering

Cite this

Wang, L., Luo, L., Wang, Y., Wampfler, J. A., Yang, P., & Liu, H. (2019). Information extraction for populating lung cancer clinical research data. In 2019 IEEE International Conference on Healthcare Informatics, ICHI 2019 [8904601] (2019 IEEE International Conference on Healthcare Informatics, ICHI 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICHI.2019.8904601

Information extraction for populating lung cancer clinical research data. / Wang, Liwei; Luo, Lei; Wang, Yanshan; Wampfler, Jason A.; Yang, Ping; Liu, Hongfang.

2019 IEEE International Conference on Healthcare Informatics, ICHI 2019. Institute of Electrical and Electronics Engineers Inc., 2019. 8904601 (2019 IEEE International Conference on Healthcare Informatics, ICHI 2019).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Wang, L, Luo, L, Wang, Y, Wampfler, JA, Yang, P & Liu, H 2019, Information extraction for populating lung cancer clinical research data. in 2019 IEEE International Conference on Healthcare Informatics, ICHI 2019., 8904601, 2019 IEEE International Conference on Healthcare Informatics, ICHI 2019, Institute of Electrical and Electronics Engineers Inc., 7th IEEE International Conference on Healthcare Informatics, ICHI 2019, Xi'an, China, 6/10/19. https://doi.org/10.1109/ICHI.2019.8904601
Wang L, Luo L, Wang Y, Wampfler JA, Yang P, Liu H. Information extraction for populating lung cancer clinical research data. In 2019 IEEE International Conference on Healthcare Informatics, ICHI 2019. Institute of Electrical and Electronics Engineers Inc. 2019. 8904601. (2019 IEEE International Conference on Healthcare Informatics, ICHI 2019). https://doi.org/10.1109/ICHI.2019.8904601
Wang, Liwei ; Luo, Lei ; Wang, Yanshan ; Wampfler, Jason A. ; Yang, Ping ; Liu, Hongfang. / Information extraction for populating lung cancer clinical research data. 2019 IEEE International Conference on Healthcare Informatics, ICHI 2019. Institute of Electrical and Electronics Engineers Inc., 2019. (2019 IEEE International Conference on Healthcare Informatics, ICHI 2019).
@inproceedings{428e34d3ee1a430996963891bca64317,
title = "Information extraction for populating lung cancer clinical research data",
abstract = "Lung cancer is the second most common cancer and the wide adoption of electronic health records (EHRs) offers a potential of accelerating cohort-related epidemiological studies using informatics approaches. In this study, we developed and evaluated a natural language processing (NLP) system to extract information on stage, histology, grade and therapies (chemotherapy, radiotherapy and surgery) automatically for lung cancer patients from clinical narratives including clinical notes, pathology reports and surgery reports. Evaluation showed promising results with the recalls for stage, histology, grade, and therapies achieving 89{\%}, 98{\%}, 80{\%}, and 100{\%} respectively and the precisions were 71{\%}, 89{\%}, 90{\%}, and 100{\%} respectively. This study demonstrated the feasibility and accuracy of extracting related information from clinical narratives for lung cancer research.",
keywords = "Grade, Histology, Lung cancer, Natural language processing, Stage, Treatments",
author = "Liwei Wang and Lei Luo and Yanshan Wang and Wampfler, {Jason A.} and Ping Yang and Hongfang Liu",
year = "2019",
month = "6",
doi = "10.1109/ICHI.2019.8904601",
language = "English (US)",
series = "2019 IEEE International Conference on Healthcare Informatics, ICHI 2019",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
booktitle = "2019 IEEE International Conference on Healthcare Informatics, ICHI 2019",

}

TY - GEN

T1 - Information extraction for populating lung cancer clinical research data

AU - Wang, Liwei

AU - Luo, Lei

AU - Wang, Yanshan

AU - Wampfler, Jason A.

AU - Yang, Ping

AU - Liu, Hongfang

PY - 2019/6

Y1 - 2019/6

N2 - Lung cancer is the second most common cancer and the wide adoption of electronic health records (EHRs) offers a potential of accelerating cohort-related epidemiological studies using informatics approaches. In this study, we developed and evaluated a natural language processing (NLP) system to extract information on stage, histology, grade and therapies (chemotherapy, radiotherapy and surgery) automatically for lung cancer patients from clinical narratives including clinical notes, pathology reports and surgery reports. Evaluation showed promising results with the recalls for stage, histology, grade, and therapies achieving 89%, 98%, 80%, and 100% respectively and the precisions were 71%, 89%, 90%, and 100% respectively. This study demonstrated the feasibility and accuracy of extracting related information from clinical narratives for lung cancer research.

AB - Lung cancer is the second most common cancer and the wide adoption of electronic health records (EHRs) offers a potential of accelerating cohort-related epidemiological studies using informatics approaches. In this study, we developed and evaluated a natural language processing (NLP) system to extract information on stage, histology, grade and therapies (chemotherapy, radiotherapy and surgery) automatically for lung cancer patients from clinical narratives including clinical notes, pathology reports and surgery reports. Evaluation showed promising results with the recalls for stage, histology, grade, and therapies achieving 89%, 98%, 80%, and 100% respectively and the precisions were 71%, 89%, 90%, and 100% respectively. This study demonstrated the feasibility and accuracy of extracting related information from clinical narratives for lung cancer research.

KW - Grade

KW - Histology

KW - Lung cancer

KW - Natural language processing

KW - Stage

KW - Treatments

UR - http://www.scopus.com/inward/record.url?scp=85075930748&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85075930748&partnerID=8YFLogxK

U2 - 10.1109/ICHI.2019.8904601

DO - 10.1109/ICHI.2019.8904601

M3 - Conference contribution

AN - SCOPUS:85075930748

T3 - 2019 IEEE International Conference on Healthcare Informatics, ICHI 2019

BT - 2019 IEEE International Conference on Healthcare Informatics, ICHI 2019

PB - Institute of Electrical and Electronics Engineers Inc.

ER -