Análisis acústico de la vozmedidas temporales, espectrales y cepstrales en la voz normal con el Praat en una muestra de hablantes de español

  1. Jonathan Delgado Hernández
  2. Nieves María León Gómez
  3. Alejandra Jiménez Ruiz
  4. Laura Izquierdo Arteaga
Journal:
Revista de Investigación en Logopedia

ISSN: 2174-5218

Year of publication: 2017

Volume: 7

Issue: 2

Pages: 108-127

Type: Article

More publications in: Revista de Investigación en Logopedia

Abstract

Acoustic analysis is a tool that provides objective information about the voice. In recent years, long-term average spectrum (LTAS) and cepstral measurements, such as the smoothed cepstral peak prominence (CPPs), have complemented the traditional measures, demonstrating in many studies a high correlation with the degree of severity of dysphonia. The objective of this descriptive study was to calculate the normal values of temporal, spectral and cepstral measures in a sample of 50 Spanish speakers (25 men and 25 women) in the Praat, taking into account the main factors that influence their reliability as the microphone type, the ambient noise level, the analysis program and the acoustic parameters used. Two voice samples were made for each subject: 1) sustained /a/ with which the CPPs were calculated and the parameters of the fundamental frequency (F0), noise and frequency and amplitude perturbations, and 2) a connected speech sample where CPPs and LTAS slopes were calculated. The results of the analysis with the sustained vowel show significant differences as a function of the sex in the F0, the jitter absolute and in all the parameters of the perturbation of the amplitude and the noise. In connected speech, significant differences were observed between men and women in the spectral slope obtained from the trend line through the LTAS and the CPPs

Bibliographic References

  • Awan S. N., y Roy N. (2006). Toward the development of an objective index of dysphonia severity: a four-factor acoustic model. Clinical Linguistics and Phonetics, 20, 35–49.
  • Baken, R. J., y Orlikoff, R. F. (2000). Clinical Measurement of Speech and Voice, 2ª ed. San Diego: Singular Publishing Group.
  • Balasubramanium, R. K., Bhat, J. S., Fahim, S. 3 rd , y Raju, R. 3 rd . (2011). Cepstral analysis of voice in unilateral adductor vocal fold palsy. Journal of Voice, 25, 326–329.
  • Barsties, B., y De Bodt, M. (2015). Assessment of voice quality: current state-of-theart. Auris Nasus Larynx, 42, 183-188.
  • Barsties, B., y Maryn, Y. (2016). External validation of the Acoustic Voice Quality Index version 03.01 with extended representativity. Annals of Otology, Rhinology & Laryngology, 125, 571-583.
  • Boersma P., y Weenink D. (2015). Praat: doing phonetics by computer [Computer program]. Version 5.4.22 http://www.praat.org/. Consultado el 6 de Febrero de 2017.
  • Brinca, L. F., Batista, A. P. F., Tavares, A. I., Gonçalves, I. C., y Moreno, M. L. (2014). Use of cepstral analyses for differentiating normal from dysphonic voices: A comparative study of connected speech versus sustained vowel in European Portuguese female speakers. Journal of Voice, 28, 282-286.
  • Cobeta, I., y Núñez, F. (2013). Análisis de la señal acústica. En I. Cobeta, F. Núñez y S. Fernández (Eds.). Patología de la voz (pp. 188-198). Barcelona: Marge Médica Books.
  • de Krom, G. (1993). A cepstrum-based technique for determining a harmonics-tonoise ratio in speech signals. Journal of Speech and Hearing Research, 36, 254–266.
  • Dejonckere, P. H. (1998). Cepstral voice analysis: link with perception and stroboscopy. Revue LaryngologieOtologieRhinologie (Bord), 119, 245–246.
  • Dejonckere, P., y Wieneke, G. (1996). Cepstra of normal and pathological voices: correlation with acoustic, aerodynamic and perceptual data. En M. J. Ball y M.
  • Duckworth (Eds.). Advances in Clinical Phonetics Vol. 6. (pp. 217-226). Amsterdam: John Benjamins.
  • Dejonkere, P. H. (1983). Recognition of hoarseness by means of LTAS. International Journal of Rehabilitation Research, 6, 343-345.
  • Delgado-Hernández, J., León-Gómez, N., Izquierdo-Arteaga, L., y Llanos-Fumero, Y. (2017). Análisis cepstral de la voz normal y patológica en adultos españoles. Medida de la Prominencia del Pico Cepstral Suavizado en vocales sostenidas versus habla conectada. Acta Otorrinolaringológica Española. Doi: 10.1016/j.otorri.2017.05.006.
  • Deliyski, D. D., Evans, M. K., y Shaw, H. S. (2005). Influence of data acquisition environment on accuracy of acoustic voice quality measurements. Journal of Voice, 19, 176–186.
  • Deliyski, D. D., Shaw, H. S., Evans, M. K., y Vesselinow, R. (2006). Regression tree approach to studying factors influencing acoustic voice analysis. Folia Phoniatrica et Logopaedica, 58, 274-288.
  • Deliyski, D. D., Shaw, H. S., y Evans, M. K. (2005). Adverse effects of environmental noise on acoustic voice quality measurements. Journal of Voice, 19, 15-28.
  • Eadie, T. L., y Baylor, C. R. (2006). The effect of perceptual training on inexperienced listeners judgments of dysphonic voice. Journal of Voice, 20, 527–544.
  • Elisei, N. G. (2012). Análisis acústico de la voz normal y patológica utilizando dos sistemas diferentes: ANAGRAF y PRAAT. Interdisciplinaria, 29, 271-286.
  • Ferrer, C. A., De Bodt, M., Maryn, Y., Van de Heyning, P., y Hernández-Díaz, M. E. (2007). Properties of the cepstral peak prominence and its usefulness in vocal quality measurements. En C. Manfredi (Ed.). Models and analysis of vocal emissions for biomedical applications: 5th international workshop (pp. 93-96). Florencia: Firenze University Press.
  • Heman-Ackah, Y. D., Michael, D. D., Baroody, M. M., Ostrowski, R., Hillenbrand, J., Heuer, R. J., … y Staloff, R. T. (2003). Cepstral peak prominence: a more reliable measure of dysphonia. Annals of Otology, Rhinology and Laryngology, 112, 324– 333.
  • Heman-Ackah, Y. D., Michael, D. D., y Goding G. S. Jr. (2002). The relationship between cepstral peak prominence and selected parameters of dysphonia. Journal of Voice, 16, 20–27.
  • Heman-Ackah, Y. D., Sataloff, R. T., Laureyns, G., Lurie, D., Michael, D. D., Heuer, R., … y Lyons, K. (2014). Quantifying the cepstral peak prominence, a measure of dysphonia. Journal of Voice, 28, 783-788.
  • Hillenbrand, J., Cleveland, R. A., y Erickson, R. (1994). Acoustic correlates of breathy vocal quality. Journal of Speech and Hearing Research, 37, 769–778.
  • Hillenbrand, J., y Houde, R. A. (1996). Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. Journal of Speech and Hearing Research, 39, 311–321.
  • Hochmuth, S., Brand, T., Zokoll, M. A., Castro, F. Z., Wardenga, N., y Kollmeier, B. (2012). A Spanish matrix sentence test for assessing speech reception thresholds in noise. International Journal of Audiology, 51, 536-544.
  • Ingrisano, D. R., Perry, C. K., y Jepson, K. R. (1998). Environmental noise: a threat to automatic voice analysis. American Journal of Speech-Language Pathology, 7, 91– 96.
  • Kaplan, H. M. (1971). Anatomy and physiology of speech (2 nd ed.). Nueva York: Mcgraw-hill book company.
  • Klatt, D. H., y Klatt, L. C. (1975). Analysis, synthesis, and perception on voice quality variations among female and male talkers. Journal of the Acoustical Society of America, 87, 820-857
  • Lowell, S. Y., Colton, R. H., Kelley, R. T., y Hahn, Y. C. (2011). Spectral and cepstralbased measures during continuous speech: capacity to distinguish dysphonia and consistency within a speaker. Journal of Voice, 25, 223-232.
  • Maryn, Y., Corthals, P., De Bodt, M., Van Cauwenberge, P., y Deliyski, D. (2009). Perturbation measures of voice: a comparative study between MultiDimensional Voice Program and Praat. Folia Phoniatrica et Logopaedica, 61, 217-226.
  • Maryn, Y., Roy, N., De Bodt, M., Van Cauwenberge, P., y Corthals, P. (2009). Acoustic measurement of overall voice quality: a meta-analysis. Journal of the Acoustical Society of America, 126, 2619–2634.
  • Maryn, Y., y Weenink, D. (2015). Objective dysphonia measures in the program Praat: smoothed cepstral peak prominence and acoustic voice quality index. Journal of Voice, 29, 35-43.
  • Moers, C., Möbius, B., Rosanowski, F., Nöth, E., Eysholdt, U., y Haderlein, T. (2012). Vowel-and text-based cepstral analysis of chronic hoarseness. Journal of Voice, 26, 416-424.
  • Núñez-Batalla, F. (2008). Función fonatoria. Tratado de otorrinolaringología y cirugía de cabeza y cuello, 2ª ed. (pp. 2503-2518). En C. Suárez, C., L. M. Gil-Carcedo, J. Marco, J. E. Medina, P. Ortega y J. Trinidad (Eds.). Madrid: Ed. Médica Panamericana.
  • Núñez-Batalla, F., Corte-Santos, P., Señaris-González, B., Rodríguez-Prado, N., y SuárezNieto, C. (2004). Evaluación espectral cuantitativa de la hipofunción vocal. Acta Otorrinolaringológica Española, 55, 327-333.
  • Núñez-Batalla, F., González-Márquez, R., Peláez-González, M. B., González-Laborda, I., Fernández-Fernández, M., y Morato-Galán, M. (2014). Acoustic voice analysis using the Praat program: comparative study with the Dr. Speech program. Acta Otorrinolaringológica española, 65, 170-176.
  • Oğuz, H., Kiliç, M. A., y Şafak, M. A. (2011). Comparison of results in two acoustic analysis programs: Praat and MDVP. Turkish Journal of Medical Sciences, 41, 835- 841.
  • Sauder, C., Bretl, M. y Eadie, T. (2017). Predicting Voice disorder status from smoothed measures of cepstral peak prominence using Praat and Analysis of Dysphonia in Speech and Voice (ADSV). Journal of Voice, 31, 557-566.
  • Titze, I. R., y Winholtz, W. S. (1993). Effects of microphone type and placement on voice perturbation measurements. Journal of Speech and Hearing Research, 36, 1177–1190.
  • Wolfe, V. I., Martin, D. P., y Palmer, C. I. (2000). Perception of dysphonic voice quality by naive listeners. Journal of Speech, Language and Hearing Research, 43, 697–705.
  • Wolfe, V., y Martin, D. (1997). Acoustic correlates of dysphonia: type and severity. Journal of Communication Disorders, 30, 403–415.