MedXN: An open source medication extraction and normalization tool for clinical text

Sunghwan Sohn; Cheryl Clark; Scott R. Halgrim; Sean P. Murphy; Christopher G. Chute; Hongfang Liu

doi:10.1136/amiajnl-2013-002190

MedXN: An open source medication extraction and normalization tool for clinical text

Sunghwan Sohn, Cheryl Clark, Scott R. Halgrim, Sean P. Murphy, Christopher G. Chute, Hongfang Liu

Research output: Contribution to journal › Article › peer-review

42 Scopus citations

Abstract

Objective: We developed the Medication Extraction and Normalization (MedXN) system to extract comprehensive medication information and normalize it to the most appropriate RxNorm concept unique identifier (RxCUI) as specifically as possible. Methods Medication: descriptions in clinical notes were decomposed into medication name and attributes, which were separately extracted using RxNorm dictionary lookup and regular expression. Then, each medication name and its attributes were combined together according to RxNorm convention to find the most appropriate RxNorm representation. To do this, we employed serialized hierarchical steps implemented in Apache's Unstructured Information Management Architecture. We also performed synonym expansion, removed false medications, and employed inference rules to improve the medication extraction and normalization performance. Results: An evaluation on test data of 397 medication mentions showed F-measures of 0.975 for medication name and over 0.90 for most attributes. The RxCUI assignment produced F-measures of 0.932 for medication name and 0.864 for full medication information. Most false negative RxCUI assignments in full medication information are due to human assumption of missing attributes and medication names in the gold standard. Conclusions: The MedXN system (http://sourceforge. net/projects/ohnlp/files/MedXN/) was able to extract comprehensive medication information with high accuracy and demonstrated good normalization capability to RxCUI as long as explicit evidence existed. More sophisticated inference rules might result in further improvements to specific RxCUI assignments for incomplete medication descriptions.

Original language	English (US)
Pages (from-to)	858-865
Number of pages	8
Journal	Journal of the American Medical Informatics Association
Volume	21
Issue number	5
DOIs	https://doi.org/10.1136/amiajnl-2013-002190
State	Published - 2014

ASJC Scopus subject areas

Health Informatics

Access to Document

10.1136/amiajnl-2013-002190

Cite this

@article{8e8be0144ff644c0b04b2138be5ee6e6,

title = "MedXN: An open source medication extraction and normalization tool for clinical text",

abstract = "Objective: We developed the Medication Extraction and Normalization (MedXN) system to extract comprehensive medication information and normalize it to the most appropriate RxNorm concept unique identifier (RxCUI) as specifically as possible. Methods Medication: descriptions in clinical notes were decomposed into medication name and attributes, which were separately extracted using RxNorm dictionary lookup and regular expression. Then, each medication name and its attributes were combined together according to RxNorm convention to find the most appropriate RxNorm representation. To do this, we employed serialized hierarchical steps implemented in Apache's Unstructured Information Management Architecture. We also performed synonym expansion, removed false medications, and employed inference rules to improve the medication extraction and normalization performance. Results: An evaluation on test data of 397 medication mentions showed F-measures of 0.975 for medication name and over 0.90 for most attributes. The RxCUI assignment produced F-measures of 0.932 for medication name and 0.864 for full medication information. Most false negative RxCUI assignments in full medication information are due to human assumption of missing attributes and medication names in the gold standard. Conclusions: The MedXN system (http://sourceforge. net/projects/ohnlp/files/MedXN/) was able to extract comprehensive medication information with high accuracy and demonstrated good normalization capability to RxCUI as long as explicit evidence existed. More sophisticated inference rules might result in further improvements to specific RxCUI assignments for incomplete medication descriptions.",

author = "Sunghwan Sohn and Cheryl Clark and Halgrim, {Scott R.} and Murphy, {Sean P.} and Chute, {Christopher G.} and Hongfang Liu",

year = "2014",

doi = "10.1136/amiajnl-2013-002190",

language = "English (US)",

volume = "21",

pages = "858--865",

journal = "Journal of the American Medical Informatics Association",

issn = "1067-5027",

publisher = "Oxford University Press",

number = "5",

}

TY - JOUR

T1 - MedXN

T2 - An open source medication extraction and normalization tool for clinical text

AU - Sohn, Sunghwan

AU - Clark, Cheryl

AU - Halgrim, Scott R.

AU - Murphy, Sean P.

AU - Chute, Christopher G.

AU - Liu, Hongfang

PY - 2014

Y1 - 2014

N2 - Objective: We developed the Medication Extraction and Normalization (MedXN) system to extract comprehensive medication information and normalize it to the most appropriate RxNorm concept unique identifier (RxCUI) as specifically as possible. Methods Medication: descriptions in clinical notes were decomposed into medication name and attributes, which were separately extracted using RxNorm dictionary lookup and regular expression. Then, each medication name and its attributes were combined together according to RxNorm convention to find the most appropriate RxNorm representation. To do this, we employed serialized hierarchical steps implemented in Apache's Unstructured Information Management Architecture. We also performed synonym expansion, removed false medications, and employed inference rules to improve the medication extraction and normalization performance. Results: An evaluation on test data of 397 medication mentions showed F-measures of 0.975 for medication name and over 0.90 for most attributes. The RxCUI assignment produced F-measures of 0.932 for medication name and 0.864 for full medication information. Most false negative RxCUI assignments in full medication information are due to human assumption of missing attributes and medication names in the gold standard. Conclusions: The MedXN system (http://sourceforge. net/projects/ohnlp/files/MedXN/) was able to extract comprehensive medication information with high accuracy and demonstrated good normalization capability to RxCUI as long as explicit evidence existed. More sophisticated inference rules might result in further improvements to specific RxCUI assignments for incomplete medication descriptions.

AB - Objective: We developed the Medication Extraction and Normalization (MedXN) system to extract comprehensive medication information and normalize it to the most appropriate RxNorm concept unique identifier (RxCUI) as specifically as possible. Methods Medication: descriptions in clinical notes were decomposed into medication name and attributes, which were separately extracted using RxNorm dictionary lookup and regular expression. Then, each medication name and its attributes were combined together according to RxNorm convention to find the most appropriate RxNorm representation. To do this, we employed serialized hierarchical steps implemented in Apache's Unstructured Information Management Architecture. We also performed synonym expansion, removed false medications, and employed inference rules to improve the medication extraction and normalization performance. Results: An evaluation on test data of 397 medication mentions showed F-measures of 0.975 for medication name and over 0.90 for most attributes. The RxCUI assignment produced F-measures of 0.932 for medication name and 0.864 for full medication information. Most false negative RxCUI assignments in full medication information are due to human assumption of missing attributes and medication names in the gold standard. Conclusions: The MedXN system (http://sourceforge. net/projects/ohnlp/files/MedXN/) was able to extract comprehensive medication information with high accuracy and demonstrated good normalization capability to RxCUI as long as explicit evidence existed. More sophisticated inference rules might result in further improvements to specific RxCUI assignments for incomplete medication descriptions.

UR - http://www.scopus.com/inward/record.url?scp=84906323732&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84906323732&partnerID=8YFLogxK

U2 - 10.1136/amiajnl-2013-002190

DO - 10.1136/amiajnl-2013-002190

M3 - Article

C2 - 24637954

AN - SCOPUS:84906323732

SN - 1067-5027

VL - 21

SP - 858

EP - 865

JO - Journal of the American Medical Informatics Association

JF - Journal of the American Medical Informatics Association

IS - 5

ER -

MedXN: An open source medication extraction and normalization tool for clinical text

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this