Effect of phase-sensitive environment model and higher order VTS on noisy speech feature enhancement

Stouten, V.; van Hamme, H.; Wambacq, P.

Title

Author

Stouten, V.

van Hamme, H.

Wambacq, P.

Abstract

Model-based techniques for robust speech recognition often require the statistics of noisy speech. In this paper, we propose two modifications to obtain more accurate versions of the statistics of the combined HMM (starting from a clean speech and a noise model). Usually, the phase difference between speech and noise is neglected in the acoustic environment model. However, we show how a phase-sensitive environment model can be efficiently integrated in the context of Multi-Stream Model-Based Feature Enhancement and gives rise to more accurate covariance matrices for the noisy speech. Also, by expanding the Vector Taylor Series up to the second order term, an improved noisy speech mean can be obtained. Finally, we explain how the front-end clean speech model itself can be improved by a preprocessing of the training data. Recognition results on the Aurora4 database illustrate the effect on the noise robustness for each of these modifications.

Language

English

Source (journal)

Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing. - [Piscataway, N.J.], 1998, currens

Source (book)

30th IEEE International Conference on Acoustics, Speech, and Signal, Processing, MAR 19-23, 2005, Philadelphia, PA

Publication

New york : Ieee , 2005

ISBN

0-7803-8874-7

Volume/pages

(2005) , p. 433-436

ISI

000229404200109

Publication type				P1 Proceeding

Subject				Physics

Affiliation				Publications with a UAntwerp address

Web of Science

View record in Web of Science®

View citing articles in Web of Science®

Identifier

Creation

03.01.2013

Last edited

17.06.2024

To cite this reference

https://hdl.handle.net/10067/1036920151162165141