Speech recognition in a dialog system: from conventional to deep processing A case study applied to Spanish

Please use this identifier to cite or link to this item: http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/1713

Title:	Speech recognition in a dialog system: from conventional to deep processing A case study applied to Spanish
Authors:	Becerra, Aldonso De la Rosa Vargas, José Ismael González Ramírez, Efrén
Issue Date:	Aug-2018
Publisher:	Springer
Abstract:	The aim of this paper is to illustrate an overview of the automatic speech recognition (ASR) module in a spoken dialog system and how it has evolved from the conventional GMM-HMM (Gaussian mixture model - hidden Markov model) architecture toward the recent nonlinear DNN-HMM (deep neural network) scheme. GMMs have dominated for a long time the baseline of speech recognition, but in the past years with the resurgence of artificial neural networks (ANNs), the former models have been surpassed in most recognition tasks. An outstanding consideration for ANNs-based acoustic model is the fact that their weights can be adjusted in two training steps: i) initialization of the weights (with or without pre-training) and ii) fine-tuning.
URI:	http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/1713 https://doi.org/10.48779/2d22-9s79
ISSN:	1380-7501 1573-7721
Other Identifiers:	info:eu-repo/semantics/publishedVersion
Appears in Collections:	Documentos Académicos-- M. en Ciencias del Proc. de la Info.

Files in This Item:

File	Description	Size	Format
25_Becerra_DelaRosa MTAP P1 2018.pdf	Becerra_DelaRosa MTAP 2018	506,65 kB	Adobe PDF	View/Open

This item is licensed under a Creative Commons License