TY - GEN
T1 - Towards a clinical tool for automatic intelligibility assessment
AU - Berisha, Visar
AU - Utianski, Rene
AU - Liss, Julie
PY - 2013/10/18
Y1 - 2013/10/18
N2 - An important, yet under-explored, problem in speech processing is the automatic assessment of intelligibility for pathological speech. In practice, intelligibility assessment is often done through subjective tests administered by speech pathologists; however research has shown that these tests are inconsistent, costly, and exhibit poor reliability. Although some automatic methods for intelligibility assessment for telecommunications exist, research specific to pathological speech has been limited. Here, we propose an algorithm that captures important multi-scale perceptual cues shown to correlate well with intelligibility. Nonlinear classifiers are trained at each time scale and a final intelligibility decision is made using ensemble learning methods from machine learning. Preliminary results indicate a marked improvement in intelligibility assessment over published baseline results.
AB - An important, yet under-explored, problem in speech processing is the automatic assessment of intelligibility for pathological speech. In practice, intelligibility assessment is often done through subjective tests administered by speech pathologists; however research has shown that these tests are inconsistent, costly, and exhibit poor reliability. Although some automatic methods for intelligibility assessment for telecommunications exist, research specific to pathological speech has been limited. Here, we propose an algorithm that captures important multi-scale perceptual cues shown to correlate well with intelligibility. Nonlinear classifiers are trained at each time scale and a final intelligibility decision is made using ensemble learning methods from machine learning. Preliminary results indicate a marked improvement in intelligibility assessment over published baseline results.
KW - intelligibility assessment
KW - machine learning
KW - multi-scale analysis
KW - speech pathology
UR - http://www.scopus.com/inward/record.url?scp=84890499378&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84890499378&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2013.6638172
DO - 10.1109/ICASSP.2013.6638172
M3 - Conference contribution
AN - SCOPUS:84890499378
SN - 9781479903566
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 2825
EP - 2828
BT - 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings
T2 - 2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
Y2 - 26 May 2013 through 31 May 2013
ER -