Initial evaluation of a continuous speech recognition program for radiology

K. M. Kanal; N. J. Hangiandreou; A. M.G. Sykes; H. E. Eklund; P. A. Araoz; J. A. Leon; B. J. Erickson

doi:10.1007/s10278-001-0022-z

Initial evaluation of a continuous speech recognition program for radiology

K. M. Kanal, N. J. Hangiandreou, A. M.G. Sykes, H. E. Eklund, P. A. Araoz, J. A. Leon, B. J. Erickson

Radiology

Research output: Contribution to journal › Article › peer-review

24 Scopus citations

Abstract

The aims of this work were to measure the accuracy of one continuous speech recognition product and dependence on the speaker's gender and status as a native or nonnative English speaker, and evaluate the product's potential for routine use in transcribing radiology reports. IBM MedSpeak/Radiology software, version 1.1 was evaluated by 6 speakers. Two were nonnative English speakers, and 3 were men. Each speaker dictated a set of 12 reports. The reports included neurologic and body imaging examinations performed with 6 different modalities. The dictated and original report texts were compared, and error rates for overall, significant, and subtle significant errors were computed. Error rate dependence on modality, native English speaker status, and gender were evaluated by performing t tests. The overall error rate was 10.3 ± 3.3%. No difference in accuracy between men and women was found; however, significant differences were seen for overall and significant errors when comparing native and nonnative English speakers (P = .009 and P = .008, respectively). The speech recognition software is approximately 90% accurate, and while practical implementation issues (rather than accuracy) currently limit routine use of this product throughout a radiology practice, application in niche areas such as the emergency room currently is being pursued. This methodology provides a convennient way to compare the initial accuracy of different speech recognition products, and changes in accuracy over time, in a detailed and sensitive manner.

Original language	English (US)
Pages (from-to)	30-37
Number of pages	8
Journal	Journal of Digital Imaging
Volume	14
Issue number	1
DOIs	https://doi.org/10.1007/s10278-001-0022-z
State	Published - Mar 2001

Keywords

Computers
Continuous speech recognition
Radiology transcription

ASJC Scopus subject areas

Radiological and Ultrasound Technology
Radiology Nuclear Medicine and imaging
Computer Science Applications

Access to Document

10.1007/s10278-001-0022-z

Cite this

@article{65712fcdabfa4ee38ec06198c8889b17,

title = "Initial evaluation of a continuous speech recognition program for radiology",

abstract = "The aims of this work were to measure the accuracy of one continuous speech recognition product and dependence on the speaker's gender and status as a native or nonnative English speaker, and evaluate the product's potential for routine use in transcribing radiology reports. IBM MedSpeak/Radiology software, version 1.1 was evaluated by 6 speakers. Two were nonnative English speakers, and 3 were men. Each speaker dictated a set of 12 reports. The reports included neurologic and body imaging examinations performed with 6 different modalities. The dictated and original report texts were compared, and error rates for overall, significant, and subtle significant errors were computed. Error rate dependence on modality, native English speaker status, and gender were evaluated by performing t tests. The overall error rate was 10.3 ± 3.3%. No difference in accuracy between men and women was found; however, significant differences were seen for overall and significant errors when comparing native and nonnative English speakers (P = .009 and P = .008, respectively). The speech recognition software is approximately 90% accurate, and while practical implementation issues (rather than accuracy) currently limit routine use of this product throughout a radiology practice, application in niche areas such as the emergency room currently is being pursued. This methodology provides a convennient way to compare the initial accuracy of different speech recognition products, and changes in accuracy over time, in a detailed and sensitive manner.",

keywords = "Computers, Continuous speech recognition, Radiology transcription",

author = "Kanal, {K. M.} and Hangiandreou, {N. J.} and Sykes, {A. M.G.} and Eklund, {H. E.} and Araoz, {P. A.} and Leon, {J. A.} and Erickson, {B. J.}",

year = "2001",

month = mar,

doi = "10.1007/s10278-001-0022-z",

language = "English (US)",

volume = "14",

pages = "30--37",

journal = "Journal of Digital Imaging",

issn = "0897-1889",

publisher = "Springer New York",

number = "1",

}

TY - JOUR

T1 - Initial evaluation of a continuous speech recognition program for radiology

AU - Kanal, K. M.

AU - Hangiandreou, N. J.

AU - Sykes, A. M.G.

AU - Eklund, H. E.

AU - Araoz, P. A.

AU - Leon, J. A.

AU - Erickson, B. J.

PY - 2001/3

Y1 - 2001/3

N2 - The aims of this work were to measure the accuracy of one continuous speech recognition product and dependence on the speaker's gender and status as a native or nonnative English speaker, and evaluate the product's potential for routine use in transcribing radiology reports. IBM MedSpeak/Radiology software, version 1.1 was evaluated by 6 speakers. Two were nonnative English speakers, and 3 were men. Each speaker dictated a set of 12 reports. The reports included neurologic and body imaging examinations performed with 6 different modalities. The dictated and original report texts were compared, and error rates for overall, significant, and subtle significant errors were computed. Error rate dependence on modality, native English speaker status, and gender were evaluated by performing t tests. The overall error rate was 10.3 ± 3.3%. No difference in accuracy between men and women was found; however, significant differences were seen for overall and significant errors when comparing native and nonnative English speakers (P = .009 and P = .008, respectively). The speech recognition software is approximately 90% accurate, and while practical implementation issues (rather than accuracy) currently limit routine use of this product throughout a radiology practice, application in niche areas such as the emergency room currently is being pursued. This methodology provides a convennient way to compare the initial accuracy of different speech recognition products, and changes in accuracy over time, in a detailed and sensitive manner.

AB - The aims of this work were to measure the accuracy of one continuous speech recognition product and dependence on the speaker's gender and status as a native or nonnative English speaker, and evaluate the product's potential for routine use in transcribing radiology reports. IBM MedSpeak/Radiology software, version 1.1 was evaluated by 6 speakers. Two were nonnative English speakers, and 3 were men. Each speaker dictated a set of 12 reports. The reports included neurologic and body imaging examinations performed with 6 different modalities. The dictated and original report texts were compared, and error rates for overall, significant, and subtle significant errors were computed. Error rate dependence on modality, native English speaker status, and gender were evaluated by performing t tests. The overall error rate was 10.3 ± 3.3%. No difference in accuracy between men and women was found; however, significant differences were seen for overall and significant errors when comparing native and nonnative English speakers (P = .009 and P = .008, respectively). The speech recognition software is approximately 90% accurate, and while practical implementation issues (rather than accuracy) currently limit routine use of this product throughout a radiology practice, application in niche areas such as the emergency room currently is being pursued. This methodology provides a convennient way to compare the initial accuracy of different speech recognition products, and changes in accuracy over time, in a detailed and sensitive manner.

KW - Computers

KW - Continuous speech recognition

KW - Radiology transcription

UR - http://www.scopus.com/inward/record.url?scp=0035263404&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0035263404&partnerID=8YFLogxK

U2 - 10.1007/s10278-001-0022-z

DO - 10.1007/s10278-001-0022-z

M3 - Article

C2 - 11310913

AN - SCOPUS:0035263404

SN - 0897-1889

VL - 14

SP - 30

EP - 37

JO - Journal of Digital Imaging

JF - Journal of Digital Imaging

IS - 1

ER -

Initial evaluation of a continuous speech recognition program for radiology

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this