Speech recognition with μ-law companded features on reverberated signals

Haderlein T, Stemmer G, Nöth E, Haderlein T (2003)


Publication Status: Published

Publication Type: Conference contribution, Conference Contribution

Publication year: 2003

Publisher: Springer-Verlag

City/Town: Berlin

Book Volume: 2807

Pages Range: 173-180

Conference Proceedings Title: Proceedings on the 6th International Conference on Text, Speech, Dialogue - TSD 2003

Event location: Ceske Budejovice CZ

URI: https://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=9444285489&origin=inward

Abstract

One of the goals of the EMBASSI project is the creation of a speech interface between a user and a TV set or VCR. The interface should allow spontaneous speech recorded by microphones far away from the speaker. This paper describes experiments evaluating the robustness of a speech recognizer against reverberation. For this purpose a speech corpus was recorded with several different distortion types under real-life conditions. On these data the recognition results for reverberated signals using μ-law companded features were compared to an MFCC baseline system. Trained with clear speech, the word accuracy for the μ-law features on highly reverberated signals was 3 percent points better than the baseline result.

Authors with CRIS profile

How to cite

APA:

Haderlein, T., Stemmer, G., Nöth, E., & Haderlein, T. (2003). Speech recognition with μ-law companded features on reverberated signals. In Matouzsek V.; Mautner P. (Eds.), Proceedings on the 6th International Conference on Text, Speech, Dialogue - TSD 2003 (pp. 173-180). Ceske Budejovice, CZ: Berlin: Springer-Verlag.

MLA:

Haderlein, Tino, et al. "Speech recognition with μ-law companded features on reverberated signals." Proceedings of the 6th International Conference on Text, Speech, Dialogue - TSD 2003, Ceske Budejovice Ed. Matouzsek V.; Mautner P., Berlin: Springer-Verlag, 2003. 173-180.

BibTeX: Download