Pérez-Toro PA, Klumpp P, Vasquez-Correa JC, Schuster M, Nöth E, Orozco-Arroyave JR, Arias Vergara T (2022)
Publication Type: Conference contribution
Publication year: 2022
Publisher: Springer Science and Business Media Deutschland GmbH
Book Volume: 13502 LNAI
Pages Range: 352-363
Conference Proceedings Title: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Event location: Brno, CZE
ISBN: 9783031162695
DOI: 10.1007/978-3-031-16270-1_29
Spectrograms provide a visual representation of the time-frequency variations of a speech signal. Furthermore, the color scales can be used as a pre-processing normalization step. In this study, we investigated the suitability of using different color scales for the reconstruction of spectrograms together with bottleneck features extracted from Convolutional AutoEncoders (CAEs). We trained several CAEs considering different parameters such as the number of channels, wideband/narrowband spectrograms, and different color scales. Additionally, we tested the suitability of the proposed CAE architecture for the prediction of the severity of Parkinson’s Disease (PD) and for the nasality level in children with Cleft Lip and Palate (CLP). The results showed that it is possible to estimate the neurological state for PD with Spearman’s correlations of up to 0.71 using the Grayscale, and the nasality level in CLP with F-scores of up to 0.58 using the raw spectrogram. Although the color scales improved performance in some cases, it is not clear which color scale is the most suitable for the selected application, as we did not find significant differences in the results for each color scale.
APA:
Pérez-Toro, P.A., Klumpp, P., Vasquez-Correa, J.C., Schuster, M., Nöth, E., Orozco-Arroyave, J.R., & Arias Vergara, T. (2022). 50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders. In Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 352-363). Brno, CZE: Springer Science and Business Media Deutschland GmbH.
MLA:
Pérez-Toro, Paula Andrea, et al. "50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders." Proceedings of the 25th International Conference on Text, Speech, and Dialogue, TSD 2022, Brno, CZE Ed. Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala, Springer Science and Business Media Deutschland GmbH, 2022. 352-363.
BibTeX: Download