Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels

Maryn, Youri; Corthals, Pqul; van Cauwenberge, Paul; Roy, Nelson; De Bodt, Marc

doi:10.1016/J.JVOICE.2008.12.014

Title

Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels

Author

Maryn, Youri

Corthals, Pqul

van Cauwenberge, Paul

Roy, Nelson

De Bodt, Marc

Abstract

To improve ecological validity, perceptual and instrumental assessment of disordered voice, including overall voice quality, should ideally sample both sustained vowels and continuous speech. This investigation assessed the utility of combining both voice contexts for the purpose of auditory-perceptual ratings as well as acoustic measurement of overall voice quality. Sustained vowel and continuous speech samples from 251 subjects with (n = 229) or without (n = 22) various voice disorders were concatenated and perceptually rated on overall voice quality by five experienced voice clinicians. After removing the nonvoiced segments within the continuous speech samples, the concatenated samples were analyzed using 13 acoustic measures based on fundamental frequency perturbation, amplitude perturbation, spectral and cepstral analyses. Stepwise multiple regression analysis yielded a six-variable acoustic model for the multiparametric measurement of overall voice quality of the concatenated samples (with a cepstral measure as the main contributor to the prediction of overall voice quality). The correlation of this model with mean ratings of overall voice quality resulted in rs = 0.78. A cross-validation approach involving the iterated internal cross-correlations with 30 subgroups of 100, 50, and 10 samples confirmed a comparable degree of association. Furthermore, the ability of the model to distinguish voice-disordered from vocally normal participants was assessed using estimates of diagnostic precision including receiver operating characteristic (ROC) curve analysis, sensitivity, and specificity, as well as likelihood ratios (LRs), which adjust for base-rate differences between the groups. Depending on the cutoff criteria employed, the analyses revealed an impressive area under ROC = 0.895 as well as respectable sensitivity, specificity, and LR. The results support the diagnostic utility of combining voice samples from both continuous speech and sustained vowels in acoustic and perceptual analysis of disordered voice. The findings are discussed in relation to the extant literature and the need for further refinement of the acoustic algorithm.

Language

English

Source (journal)

Journal of voice. - New York, N.Y.

Publication

New York, N.Y. : 2010

ISSN

0892-1997

DOI

10.1016/J.JVOICE.2008.12.014

Volume/pages

24 :5 (2010) , p. 540-555

ISI

000281710100004

Full text (Publisher's DOI)

https://doi.org/10.1016/J.JVOICE.2008.12.014

Faculty/Department				Faculty of Medicine and Health Sciences

Research group				Translational Neurosciences (TNW)
Publication type				A1 Journal article

Subject				Human medicine

Affiliation				Publications with a UAntwerp address

Web of Science

View record in Web of Science®

View citing articles in Web of Science®

Identifier

Creation

20.10.2010

Last edited

23.08.2022

To cite this reference

https://hdl.handle.net/10067/842520151162165141