Recent Advances in Robust Speech Recognition Technology


Javier Ramírez, Juan Manuel Górriz

DOI: 10.2174/9781608051724111010
eISBN: 978-1-60805-172-4, 2011
ISBN: 978-1-60805-389-6

Indexed in: Scopus, EBSCO.

This E-book is a collection of articles that describe advances in speech recognition technology. Robustness in speech recognition refe...[view complete introduction]
US $
Buy Personal eBook
Order Library eBook
Order Printed Copy
Order PDF + Printed Copy (Special Offer)

*(Excluding Mailing and Handling)

🔒Secure Checkout Personal information is secured with SSL technology

Reviewing Feature Non-Linear Transformations for Robust Speech Recognition

- Pp. 190-196 (7)

Luz Garcia, Jose Carlos Segura and Angel de la Torre


The aim of Robust Speech Recognition is to reduce as much as possible the environmental mismatch between the training and test conditions in order to optimally use the acoustic models in the recognition process. There are several factors producing such mismatch: inter-speaker variability, intra-speaker variability, and changes in the speaker environment or in the channel characteristics. The changes in the environment represent a challenging area of work and constitute one of the main driving forces of research in voice processing, that nowadays faces application scenarios like mobile phones, moving cars, spontaneous speech, speech masked by other speech, speech masked by music or non-stationary noises. The different strategies that fight the effects of additive noise in the voice signal and the recognition process will be summarized in this review, focusing in the normalization techniques and particularly in the non linear transformations of the MFCC features. Histogram Equalization and Parametric Histogram Equalization with their variants and evolutions will be analyzed as main representatives of this family of non-linear feature transformations.

Purchase Chapter  Book Details


Webmaster Contact: Copyright © 2019 Bentham Science