A Fusion NLP Model for the Inference of Standardized Thyroid Nodule Malignancy Scores from Radiology Report Text

Thiago Santos; Omar N. Kallas; Janice Newsome; Daniel Rubin; Judy Wawira Gichoya; Imon Banerjee

A Fusion NLP Model for the Inference of Standardized Thyroid Nodule Malignancy Scores from Radiology Report Text

Thiago Santos, Omar N. Kallas, Janice Newsome, Daniel Rubin, Judy Wawira Gichoya, Imon Banerjee

Diagnostic Radiology

Research output: Contribution to journal › Article › peer-review

Abstract

Radiology reports are a rich resource for advancing deep learning applications for medical images, facilitating the generation of large-scale annotated image databases. Although the ambiguity and subtlety of natural language poses a significant challenge to information extraction from radiology reports. Thyroid Imaging Reporting and Data Systems (TI-RADS) has been proposed as a system to standardize ultrasound imaging reports for thyroid cancer screening and diagnosis, through the implementation of structured templates and a standardized thyroid nodule malignancy risk scoring system; however there remains significant variation in radiologist practice when it comes to diagnostic thyroid ultrasound interpretation and reporting. In this work, we propose a computerized approach using a contextual embedding and fusion strategy for the large-scale inference of TI-RADS final assessment categories from narrative ultrasound (US) reports. The proposed model has achieved high accuracy on an internal data set, and high performance scores on an external validation dataset.

Original language	English (US)
Pages (from-to)	1079-1088
Number of pages	10
Journal	AMIA ... Annual Symposium proceedings. AMIA Symposium
Volume	2021
State	Published - 2021

ASJC Scopus subject areas

General Medicine

Cite this

@article{2d0b22e7086b42368fe6ddeba3907aee,

title = "A Fusion NLP Model for the Inference of Standardized Thyroid Nodule Malignancy Scores from Radiology Report Text",

abstract = "Radiology reports are a rich resource for advancing deep learning applications for medical images, facilitating the generation of large-scale annotated image databases. Although the ambiguity and subtlety of natural language poses a significant challenge to information extraction from radiology reports. Thyroid Imaging Reporting and Data Systems (TI-RADS) has been proposed as a system to standardize ultrasound imaging reports for thyroid cancer screening and diagnosis, through the implementation of structured templates and a standardized thyroid nodule malignancy risk scoring system; however there remains significant variation in radiologist practice when it comes to diagnostic thyroid ultrasound interpretation and reporting. In this work, we propose a computerized approach using a contextual embedding and fusion strategy for the large-scale inference of TI-RADS final assessment categories from narrative ultrasound (US) reports. The proposed model has achieved high accuracy on an internal data set, and high performance scores on an external validation dataset.",

author = "Thiago Santos and Kallas, {Omar N.} and Janice Newsome and Daniel Rubin and Gichoya, {Judy Wawira} and Imon Banerjee",

year = "2021",

language = "English (US)",

volume = "2021",

pages = "1079--1088",

journal = "AMIA ... Annual Symposium proceedings. AMIA Symposium",

issn = "1559-4076",

publisher = "American Medical Informatics Association",

}

TY - JOUR

T1 - A Fusion NLP Model for the Inference of Standardized Thyroid Nodule Malignancy Scores from Radiology Report Text

AU - Santos, Thiago

AU - Kallas, Omar N.

AU - Newsome, Janice

AU - Rubin, Daniel

AU - Gichoya, Judy Wawira

AU - Banerjee, Imon

PY - 2021

Y1 - 2021

N2 - Radiology reports are a rich resource for advancing deep learning applications for medical images, facilitating the generation of large-scale annotated image databases. Although the ambiguity and subtlety of natural language poses a significant challenge to information extraction from radiology reports. Thyroid Imaging Reporting and Data Systems (TI-RADS) has been proposed as a system to standardize ultrasound imaging reports for thyroid cancer screening and diagnosis, through the implementation of structured templates and a standardized thyroid nodule malignancy risk scoring system; however there remains significant variation in radiologist practice when it comes to diagnostic thyroid ultrasound interpretation and reporting. In this work, we propose a computerized approach using a contextual embedding and fusion strategy for the large-scale inference of TI-RADS final assessment categories from narrative ultrasound (US) reports. The proposed model has achieved high accuracy on an internal data set, and high performance scores on an external validation dataset.

AB - Radiology reports are a rich resource for advancing deep learning applications for medical images, facilitating the generation of large-scale annotated image databases. Although the ambiguity and subtlety of natural language poses a significant challenge to information extraction from radiology reports. Thyroid Imaging Reporting and Data Systems (TI-RADS) has been proposed as a system to standardize ultrasound imaging reports for thyroid cancer screening and diagnosis, through the implementation of structured templates and a standardized thyroid nodule malignancy risk scoring system; however there remains significant variation in radiologist practice when it comes to diagnostic thyroid ultrasound interpretation and reporting. In this work, we propose a computerized approach using a contextual embedding and fusion strategy for the large-scale inference of TI-RADS final assessment categories from narrative ultrasound (US) reports. The proposed model has achieved high accuracy on an internal data set, and high performance scores on an external validation dataset.

UR - http://www.scopus.com/inward/record.url?scp=85126841133&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85126841133&partnerID=8YFLogxK

M3 - Article

C2 - 35308953

AN - SCOPUS:85126841133

SN - 1559-4076

VL - 2021

SP - 1079

EP - 1088

JO - AMIA ... Annual Symposium proceedings. AMIA Symposium

JF - AMIA ... Annual Symposium proceedings. AMIA Symposium

ER -

A Fusion NLP Model for the Inference of Standardized Thyroid Nodule Malignancy Scores from Radiology Report Text

Abstract

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this