Use of Natural Language Processing Algorithms to Identify Common Data Elements in Operative Notes for Total Hip Arthroplasty

Cody C. Wyles; Meagan E. Tibbo; Sunyang Fu; Yanshan Wang; Sunghwan Sohn; Walter K. Kremers; Daniel J. Berry; David G. Lewallen; Hilal Maradit-Kremers

doi:10.2106/JBJS.19.00071

Use of Natural Language Processing Algorithms to Identify Common Data Elements in Operative Notes for Total Hip Arthroplasty

Cody C. Wyles, Meagan E. Tibbo, Sunyang Fu, Yanshan Wang, Sunghwan Sohn, Walter K. Kremers, Daniel J. Berry, David G. Lewallen, Hilal Maradit-Kremers

Research output: Contribution to journal › Article › peer-review

11 Scopus citations

Abstract

Background:Manual chart review is labor-intensive and requires specialized knowledge possessed by highly trained medical professionals. Natural language processing (NLP) tools are distinctive in their ability to extract critical information from raw text in electronic health records (EHRs). As a proof of concept for the potential application of this technology, we examined the ability of NLP to correctly identify common elements described by surgeons in operative notes for total hip arthroplasty (THA).Methods:We evaluated primary THAs that had been performed at a single academic institution from 2000 to 2015. A training sample of operative reports was randomly selected to develop prototype NLP algorithms, and additional operative reports were randomly selected as the test sample. Three separate algorithms were created with rules aimed at capturing (1) the operative approach, (2) the fixation method, and (3) the bearing surface category. The algorithms were applied to operative notes to evaluate the language used by 29 different surgeons at our center and were applied to EHR data from outside facilities to determine external validity. Accuracy statistics were calculated with use of manual chart review as the gold standard.Results:The operative approach algorithm demonstrated an accuracy of 99.2% (95% confidence interval [CI], 97.1% to 99.9%). The fixation technique algorithm demonstrated an accuracy of 90.7% (95% CI, 86.8% to 93.8%). The bearing surface algorithm demonstrated an accuracy of 95.8% (95% CI, 92.7% to 97.8%). Additionally, the NLP algorithms applied to operative reports from other institutions yielded comparable performance, demonstrating external validity.Conclusions:NLP-enabled algorithms are a promising alternative to the current gold standard of manual chart review for identifying common data elements from orthopaedic operative notes. The present study provides a proof of concept for use of NLP techniques in clinical research studies and registry-development endeavors to reliably extract data of interest in an expeditious and cost-effective manner.

Original language	English (US)
Pages (from-to)	1931-1938
Number of pages	8
Journal	Journal of Bone and Joint Surgery - American Volume
Volume	101
Issue number	21
DOIs	https://doi.org/10.2106/JBJS.19.00071
State	Published - Nov 6 2019

ASJC Scopus subject areas

Surgery
Orthopedics and Sports Medicine

Access to Document

10.2106/JBJS.19.00071

Cite this

@article{a9cc761e5bd8454f90cdbfeded1950c2,

title = "Use of Natural Language Processing Algorithms to Identify Common Data Elements in Operative Notes for Total Hip Arthroplasty",

abstract = "Background:Manual chart review is labor-intensive and requires specialized knowledge possessed by highly trained medical professionals. Natural language processing (NLP) tools are distinctive in their ability to extract critical information from raw text in electronic health records (EHRs). As a proof of concept for the potential application of this technology, we examined the ability of NLP to correctly identify common elements described by surgeons in operative notes for total hip arthroplasty (THA).Methods:We evaluated primary THAs that had been performed at a single academic institution from 2000 to 2015. A training sample of operative reports was randomly selected to develop prototype NLP algorithms, and additional operative reports were randomly selected as the test sample. Three separate algorithms were created with rules aimed at capturing (1) the operative approach, (2) the fixation method, and (3) the bearing surface category. The algorithms were applied to operative notes to evaluate the language used by 29 different surgeons at our center and were applied to EHR data from outside facilities to determine external validity. Accuracy statistics were calculated with use of manual chart review as the gold standard.Results:The operative approach algorithm demonstrated an accuracy of 99.2% (95% confidence interval [CI], 97.1% to 99.9%). The fixation technique algorithm demonstrated an accuracy of 90.7% (95% CI, 86.8% to 93.8%). The bearing surface algorithm demonstrated an accuracy of 95.8% (95% CI, 92.7% to 97.8%). Additionally, the NLP algorithms applied to operative reports from other institutions yielded comparable performance, demonstrating external validity.Conclusions:NLP-enabled algorithms are a promising alternative to the current gold standard of manual chart review for identifying common data elements from orthopaedic operative notes. The present study provides a proof of concept for use of NLP techniques in clinical research studies and registry-development endeavors to reliably extract data of interest in an expeditious and cost-effective manner.",

author = "Wyles, {Cody C.} and Tibbo, {Meagan E.} and Sunyang Fu and Yanshan Wang and Sunghwan Sohn and Kremers, {Walter K.} and Berry, {Daniel J.} and Lewallen, {David G.} and Hilal Maradit-Kremers",

note = "Publisher Copyright: Copyright {\textcopyright} 2019 by the Journal of Bone and Joint Surgery, Incorporated.",

year = "2019",

month = nov,

day = "6",

doi = "10.2106/JBJS.19.00071",

language = "English (US)",

volume = "101",

pages = "1931--1938",

journal = "Journal of Bone and Joint Surgery - American Volume",

issn = "0021-9355",

publisher = "Journal of Bone and Joint Surgery Inc.",

number = "21",

}

TY - JOUR

T1 - Use of Natural Language Processing Algorithms to Identify Common Data Elements in Operative Notes for Total Hip Arthroplasty

AU - Wyles, Cody C.

AU - Tibbo, Meagan E.

AU - Fu, Sunyang

AU - Wang, Yanshan

AU - Sohn, Sunghwan

AU - Kremers, Walter K.

AU - Berry, Daniel J.

AU - Lewallen, David G.

AU - Maradit-Kremers, Hilal

PY - 2019/11/6

Y1 - 2019/11/6

N2 - Background:Manual chart review is labor-intensive and requires specialized knowledge possessed by highly trained medical professionals. Natural language processing (NLP) tools are distinctive in their ability to extract critical information from raw text in electronic health records (EHRs). As a proof of concept for the potential application of this technology, we examined the ability of NLP to correctly identify common elements described by surgeons in operative notes for total hip arthroplasty (THA).Methods:We evaluated primary THAs that had been performed at a single academic institution from 2000 to 2015. A training sample of operative reports was randomly selected to develop prototype NLP algorithms, and additional operative reports were randomly selected as the test sample. Three separate algorithms were created with rules aimed at capturing (1) the operative approach, (2) the fixation method, and (3) the bearing surface category. The algorithms were applied to operative notes to evaluate the language used by 29 different surgeons at our center and were applied to EHR data from outside facilities to determine external validity. Accuracy statistics were calculated with use of manual chart review as the gold standard.Results:The operative approach algorithm demonstrated an accuracy of 99.2% (95% confidence interval [CI], 97.1% to 99.9%). The fixation technique algorithm demonstrated an accuracy of 90.7% (95% CI, 86.8% to 93.8%). The bearing surface algorithm demonstrated an accuracy of 95.8% (95% CI, 92.7% to 97.8%). Additionally, the NLP algorithms applied to operative reports from other institutions yielded comparable performance, demonstrating external validity.Conclusions:NLP-enabled algorithms are a promising alternative to the current gold standard of manual chart review for identifying common data elements from orthopaedic operative notes. The present study provides a proof of concept for use of NLP techniques in clinical research studies and registry-development endeavors to reliably extract data of interest in an expeditious and cost-effective manner.

AB - Background:Manual chart review is labor-intensive and requires specialized knowledge possessed by highly trained medical professionals. Natural language processing (NLP) tools are distinctive in their ability to extract critical information from raw text in electronic health records (EHRs). As a proof of concept for the potential application of this technology, we examined the ability of NLP to correctly identify common elements described by surgeons in operative notes for total hip arthroplasty (THA).Methods:We evaluated primary THAs that had been performed at a single academic institution from 2000 to 2015. A training sample of operative reports was randomly selected to develop prototype NLP algorithms, and additional operative reports were randomly selected as the test sample. Three separate algorithms were created with rules aimed at capturing (1) the operative approach, (2) the fixation method, and (3) the bearing surface category. The algorithms were applied to operative notes to evaluate the language used by 29 different surgeons at our center and were applied to EHR data from outside facilities to determine external validity. Accuracy statistics were calculated with use of manual chart review as the gold standard.Results:The operative approach algorithm demonstrated an accuracy of 99.2% (95% confidence interval [CI], 97.1% to 99.9%). The fixation technique algorithm demonstrated an accuracy of 90.7% (95% CI, 86.8% to 93.8%). The bearing surface algorithm demonstrated an accuracy of 95.8% (95% CI, 92.7% to 97.8%). Additionally, the NLP algorithms applied to operative reports from other institutions yielded comparable performance, demonstrating external validity.Conclusions:NLP-enabled algorithms are a promising alternative to the current gold standard of manual chart review for identifying common data elements from orthopaedic operative notes. The present study provides a proof of concept for use of NLP techniques in clinical research studies and registry-development endeavors to reliably extract data of interest in an expeditious and cost-effective manner.

UR - http://www.scopus.com/inward/record.url?scp=85074674692&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85074674692&partnerID=8YFLogxK

U2 - 10.2106/JBJS.19.00071

DO - 10.2106/JBJS.19.00071

M3 - Article

C2 - 31567670

AN - SCOPUS:85074674692

SN - 0021-9355

VL - 101

SP - 1931

EP - 1938

JO - Journal of Bone and Joint Surgery - American Volume

JF - Journal of Bone and Joint Surgery - American Volume

IS - 21

ER -

Use of Natural Language Processing Algorithms to Identify Common Data Elements in Operative Notes for Total Hip Arthroplasty

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this