A Noise-Robust Speech Recognition Method Composed of Weak Noise Suppression and Weak Vector Taylor Series Adaptation

December 6, 2012
Shuji Komeij | NEC Corporation

This presentation proposes a noise-robust speech recognition method composed of weak noise suppression (NS) and weak Vector Taylor Series Adaptation (VTSA). The proposed method compensates defects of NS and VTSA, and gains only the advantages by them. The weak NS reduces distortion by over-suppression that may accompany noise-suppressed speech. The weak VTSA avoids over-adaptation by offsetting a part of acoustic-model adaptation that corresponds to the suppressed noise. Evaluation results with the AURORA2 database show that the proposed method achieves as much as 1.2 points higher word accuracy (87.4%) than a method with VTSA alone (86.2%) that is always better than its counterpart with NS.

Speaker Details

Shuji Komeij, a Research Member of Information and Media Laboratories, NEC Corporation, received B. Eng. and M. Eng. degrees from Tokyo University of Agriculture and Technology in 2007, and from the University of Tokyo in 2009, respectively. He has been engaged in research projects on noise robust speech recognition.

Research Area
- Human language technologies

Watch Next

MSR Talk: Unsupervised Speech Reverberation Control with Diffusion Implicit Bridges
May 14, 2024
Eloi Moliner,

Hannes Gamper

A Noise-Robust Speech Recognition Method Composed of Weak Noise Suppression and Weak Vector Taylor Series Adaptation

Speaker Details

Research Area

Watch Next

MSR Talk: Unsupervised Speech Reverberation Control with Diffusion Implicit Bridges