Title
|
|
|
|
Effect of phase-sensitive environment model and higher order VTS on noisy speech feature enhancement
| |
Author
|
|
|
|
| |
Abstract
|
|
|
|
Model-based techniques for robust speech recognition often require the statistics of noisy speech. In this paper, we propose two modifications to obtain more accurate versions of the statistics of the combined HMM (starting from a clean speech and a noise model). Usually, the phase difference between speech and noise is neglected in the acoustic environment model. However, we show how a phase-sensitive environment model can be efficiently integrated in the context of Multi-Stream Model-Based Feature Enhancement and gives rise to more accurate covariance matrices for the noisy speech. Also, by expanding the Vector Taylor Series up to the second order term, an improved noisy speech mean can be obtained. Finally, we explain how the front-end clean speech model itself can be improved by a preprocessing of the training data. Recognition results on the Aurora4 database illustrate the effect on the noise robustness for each of these modifications. |
| |
Language
|
|
|
|
English
| |
Source (journal)
|
|
|
|
Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing. - [Piscataway, N.J.], 1998, currens
| |
Source (book)
|
|
|
|
30th IEEE International Conference on Acoustics, Speech, and Signal, Processing, MAR 19-23, 2005, Philadelphia, PA
| |
Publication
|
|
|
|
New york
:
Ieee
,
2005
| |
ISBN
|
|
|
|
0-7803-8874-7
| |
Volume/pages
|
|
|
|
(2005)
, p. 433-436
| |
ISI
|
|
|
|
000229404200109
| |
|