Information extraction for populating lung cancer clinical research data

Liwei Wang, Lei Luo, Yanshan Wang, Jason A. Wampfler, Ping Yang, Hongfang Liu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Lung cancer is the second most common cancer and the wide adoption of electronic health records (EHRs) offers a potential of accelerating cohort-related epidemiological studies using informatics approaches. In this study, we developed and evaluated a natural language processing (NLP) system to extract information on stage, histology, grade and therapies (chemotherapy, radiotherapy and surgery) automatically for lung cancer patients from clinical narratives including clinical notes, pathology reports and surgery reports. Evaluation showed promising results with the recalls for stage, histology, grade, and therapies achieving 89%, 98%, 80%, and 100% respectively and the precisions were 71%, 89%, 90%, and 100% respectively. This study demonstrated the feasibility and accuracy of extracting related information from clinical narratives for lung cancer research.

Original languageEnglish (US)
Title of host publication2019 IEEE International Conference on Healthcare Informatics, ICHI 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781538691380
DOIs
StatePublished - Jun 2019
Event7th IEEE International Conference on Healthcare Informatics, ICHI 2019 - Xi'an, China
Duration: Jun 10 2019Jun 13 2019

Publication series

Name2019 IEEE International Conference on Healthcare Informatics, ICHI 2019

Conference

Conference7th IEEE International Conference on Healthcare Informatics, ICHI 2019
Country/TerritoryChina
CityXi'an
Period6/10/196/13/19

Keywords

  • Grade
  • Histology
  • Lung cancer
  • Natural language processing
  • Stage
  • Treatments

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Health Informatics
  • Biomedical Engineering

Fingerprint

Dive into the research topics of 'Information extraction for populating lung cancer clinical research data'. Together they form a unique fingerprint.

Cite this