Quantitative and graphic acoustic analysis of phonatory modulations: The modulogram

Eugene H. Buder, Edythe A. Strand

Research output: Contribution to journalArticle

15 Citations (Scopus)

Abstract

A method is presented for analyzing phonatory instabilities that occur as modulations of fundamental frequency (f0) and sound pressure level (SPL) on the order of 0.2 to 20 cycles per second. Such long-term phonatory instabilities, including but not limited to traditional notions of tremor, are distinct from cycle-to-cycle perturbation such as jitter or shimmer. For each of the 2 parameters (f0, in Hz, and SPL, in dB), 3 frequency domains are proposed: (a) flutter (10-20 Hz), (b) tremor (2-10 Hz), and (c) wow (0.2-2.0 Hz), yielding 6 types of instability. Analyses were implemented using fast Fourier transforms (FFTs) with domain-specific analysis parameters. Outputs include a graphic display in the form of a set of low-frequency spectrograms (the "modulogram") and quantitative measures of the frequencies, magnitudes, durations, and sinusoidal form of the instabilities. An index of a given instability is developed by combining its duration and average modulation magnitude into a single quantity. Performance of the algorithms was assessed by analyzing test signals with known degrees of modulation, and a range of applications was reviewed to provide a rationale for use of modulograms in phonatory assessment.

Original languageEnglish (US)
Pages (from-to)475-490
Number of pages16
JournalJournal of Speech, Language, and Hearing Research
Volume46
Issue number2
DOIs
StatePublished - Apr 2003

Fingerprint

Tremor
Acoustics
acoustics
Pressure
Fourier Analysis
Modulation
Acoustic Analysis
performance
Sound

Keywords

  • Amyotrophic lateral sclerosis
  • Dysarthria
  • Phonatory instability
  • Spasmodic dysphonia
  • Voice assessment

ASJC Scopus subject areas

  • Rehabilitation
  • Linguistics and Language
  • Health Professions(all)

Cite this

Quantitative and graphic acoustic analysis of phonatory modulations : The modulogram. / Buder, Eugene H.; Strand, Edythe A.

In: Journal of Speech, Language, and Hearing Research, Vol. 46, No. 2, 04.2003, p. 475-490.

Research output: Contribution to journalArticle

@article{787b39b78ec143038e72c25bb58bf6b6,
title = "Quantitative and graphic acoustic analysis of phonatory modulations: The modulogram",
abstract = "A method is presented for analyzing phonatory instabilities that occur as modulations of fundamental frequency (f0) and sound pressure level (SPL) on the order of 0.2 to 20 cycles per second. Such long-term phonatory instabilities, including but not limited to traditional notions of tremor, are distinct from cycle-to-cycle perturbation such as jitter or shimmer. For each of the 2 parameters (f0, in Hz, and SPL, in dB), 3 frequency domains are proposed: (a) flutter (10-20 Hz), (b) tremor (2-10 Hz), and (c) wow (0.2-2.0 Hz), yielding 6 types of instability. Analyses were implemented using fast Fourier transforms (FFTs) with domain-specific analysis parameters. Outputs include a graphic display in the form of a set of low-frequency spectrograms (the {"}modulogram{"}) and quantitative measures of the frequencies, magnitudes, durations, and sinusoidal form of the instabilities. An index of a given instability is developed by combining its duration and average modulation magnitude into a single quantity. Performance of the algorithms was assessed by analyzing test signals with known degrees of modulation, and a range of applications was reviewed to provide a rationale for use of modulograms in phonatory assessment.",
keywords = "Amyotrophic lateral sclerosis, Dysarthria, Phonatory instability, Spasmodic dysphonia, Voice assessment",
author = "Buder, {Eugene H.} and Strand, {Edythe A.}",
year = "2003",
month = "4",
doi = "10.1044/1092-4388(2003/039)",
language = "English (US)",
volume = "46",
pages = "475--490",
journal = "Journal of Speech, Language, and Hearing Research",
issn = "1092-4388",
publisher = "American Speech-Language-Hearing Association (ASHA)",
number = "2",

}

TY - JOUR

T1 - Quantitative and graphic acoustic analysis of phonatory modulations

T2 - The modulogram

AU - Buder, Eugene H.

AU - Strand, Edythe A.

PY - 2003/4

Y1 - 2003/4

N2 - A method is presented for analyzing phonatory instabilities that occur as modulations of fundamental frequency (f0) and sound pressure level (SPL) on the order of 0.2 to 20 cycles per second. Such long-term phonatory instabilities, including but not limited to traditional notions of tremor, are distinct from cycle-to-cycle perturbation such as jitter or shimmer. For each of the 2 parameters (f0, in Hz, and SPL, in dB), 3 frequency domains are proposed: (a) flutter (10-20 Hz), (b) tremor (2-10 Hz), and (c) wow (0.2-2.0 Hz), yielding 6 types of instability. Analyses were implemented using fast Fourier transforms (FFTs) with domain-specific analysis parameters. Outputs include a graphic display in the form of a set of low-frequency spectrograms (the "modulogram") and quantitative measures of the frequencies, magnitudes, durations, and sinusoidal form of the instabilities. An index of a given instability is developed by combining its duration and average modulation magnitude into a single quantity. Performance of the algorithms was assessed by analyzing test signals with known degrees of modulation, and a range of applications was reviewed to provide a rationale for use of modulograms in phonatory assessment.

AB - A method is presented for analyzing phonatory instabilities that occur as modulations of fundamental frequency (f0) and sound pressure level (SPL) on the order of 0.2 to 20 cycles per second. Such long-term phonatory instabilities, including but not limited to traditional notions of tremor, are distinct from cycle-to-cycle perturbation such as jitter or shimmer. For each of the 2 parameters (f0, in Hz, and SPL, in dB), 3 frequency domains are proposed: (a) flutter (10-20 Hz), (b) tremor (2-10 Hz), and (c) wow (0.2-2.0 Hz), yielding 6 types of instability. Analyses were implemented using fast Fourier transforms (FFTs) with domain-specific analysis parameters. Outputs include a graphic display in the form of a set of low-frequency spectrograms (the "modulogram") and quantitative measures of the frequencies, magnitudes, durations, and sinusoidal form of the instabilities. An index of a given instability is developed by combining its duration and average modulation magnitude into a single quantity. Performance of the algorithms was assessed by analyzing test signals with known degrees of modulation, and a range of applications was reviewed to provide a rationale for use of modulograms in phonatory assessment.

KW - Amyotrophic lateral sclerosis

KW - Dysarthria

KW - Phonatory instability

KW - Spasmodic dysphonia

KW - Voice assessment

UR - http://www.scopus.com/inward/record.url?scp=0037392727&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0037392727&partnerID=8YFLogxK

U2 - 10.1044/1092-4388(2003/039)

DO - 10.1044/1092-4388(2003/039)

M3 - Article

C2 - 14700387

AN - SCOPUS:0037392727

VL - 46

SP - 475

EP - 490

JO - Journal of Speech, Language, and Hearing Research

JF - Journal of Speech, Language, and Hearing Research

SN - 1092-4388

IS - 2

ER -