A Case Study of Speech Recognition in Spanish: from Conventional to Deep Approach

Please use this identifier to cite or link to this item: http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/1886

Title:	A Case Study of Speech Recognition in Spanish: from Conventional to Deep Approach
Authors:	Becerra de la Rosa, Aldonso De la Rosa Vargas, José Ismael González Ramírez, Efrén
Issue Date:	Oct-2016
Publisher:	IEEE
Abstract:	The aim of this paper is to exhibit a comparative case study of the conventional speech recognition GMM-HMM (Gaussian mixture model - hidden Markov model) architecture and the recent model based on deep neural networks. During years the GMM approach has controlled the speech recognition tasks, however it has been surpassed with the resurgence of artificial neural networks. To exemplify these acoustic modeling frameworks, a case study has been conducted by using the Kaldi toolkit, employing a personalized speaker-independent mid-vocabulary voice corpus for recognition of digit strings and personal name lists in latin spanish on a connected-words pone dialing task. The speech recognition accuracy obtained in the results shows a better word error rate by using the DNN acoustic modeling. A 20:71% relative improvement is obtained with DNNHMM models (3:33% WER) in respect to the lowest GMM-HMM rate (4:20% WER).
URI:	http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/1886 https://doi.org/10.48779/xc36-yn86
Other Identifiers:	info:eu-repo/semantics/publishedVersion
Appears in Collections:	Documentos Académicos-- M. en Ciencias del Proc. de la Info.

Files in This Item:

File	Description	Size	Format
64_Becerra_DelaRosa_IEEEANDESCON P1 2016.pdf	Becerra_DelaRosa_IEEEANDESCON P1 2016	405,7 kB	Adobe PDF	View/Open

This item is licensed under a Creative Commons License