On the Use of a Foundation Acoustic Model to Identify Highly Relevant Phonetic Information of Parkinson’s Speech

Escobar-Grisales D, Ríos-Urrego CD, Orozco-Arroyave JR (2025)


Publication Type: Conference contribution

Publication year: 2025

Journal

Publisher: Springer Science and Business Media Deutschland GmbH

Book Volume: 2222 CCIS

Pages Range: 71-81

Conference Proceedings Title: Communications in Computer and Information Science

Event location: Barranquilla, COL

ISBN: 9783031745942

DOI: 10.1007/978-3-031-74595-9_7

Abstract

Parkinson’s disease (PD) is a neurological condition that produces several speech deficits, typically known as hypokinetic dysarthria, affecting the production of different phonemes and resulting in an impaired speech communication. This work presents a detailed investigation based on the wav2vec 2.0 foundational model specifically tuned to perform the automatic discrimination between PD and healthy control (HC) subjects. The investigation showed that, instead of considering the complete wav2vec 2.0 architecture with 12 layers, the five layer is enough to find a model suitable to obtain good classification accuracies. Besides, this work presents a framework where frame-wise classification results are considered, enabling a detailed analysis regarding which phonemes and phonological classes are more accurate for performing the classification. All experiments are evaluated in an external and independent test set, therefore given the good results found in this work, which motivates us to continue working in this direction. For future work, we plan to modify the method to perform the time-stamp labeling to model co-articulation information in speech produced by PD patients.

Involved external institutions

How to cite

APA:

Escobar-Grisales, D., Ríos-Urrego, C.D., & Orozco-Arroyave, J.R. (2025). On the Use of a Foundation Acoustic Model to Identify Highly Relevant Phonetic Information of Parkinson’s Speech. In Juan Carlos Figueroa-García, Elvis Eduardo Gaona García, German Hernández, Diego Fernando Suero Pérez (Eds.), Communications in Computer and Information Science (pp. 71-81). Barranquilla, COL: Springer Science and Business Media Deutschland GmbH.

MLA:

Escobar-Grisales, D., C. D. Ríos-Urrego, and J. R. Orozco-Arroyave. "On the Use of a Foundation Acoustic Model to Identify Highly Relevant Phonetic Information of Parkinson’s Speech." Proceedings of the 11th Workshop on Engineering Applications, WEA 2024, Barranquilla, COL Ed. Juan Carlos Figueroa-García, Elvis Eduardo Gaona García, German Hernández, Diego Fernando Suero Pérez, Springer Science and Business Media Deutschland GmbH, 2025. 71-81.

BibTeX: Download