Please use this identifier to cite or link to this item: http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/1713
Title: Speech recognition in a dialog system: from conventional to deep processing A case study applied to Spanish
Authors: Becerra, Aldonso
De la Rosa Vargas, José Ismael
González Ramírez, Efrén
Issue Date: Aug-2018
Publisher: Springer
Abstract: The aim of this paper is to illustrate an overview of the automatic speech recognition (ASR) module in a spoken dialog system and how it has evolved from the conventional GMM-HMM (Gaussian mixture model - hidden Markov model) architecture toward the recent nonlinear DNN-HMM (deep neural network) scheme. GMMs have dominated for a long time the baseline of speech recognition, but in the past years with the resurgence of artificial neural networks (ANNs), the former models have been surpassed in most recognition tasks. An outstanding consideration for ANNs-based acoustic model is the fact that their weights can be adjusted in two training steps: i) initialization of the weights (with or without pre-training) and ii) fine-tuning.
URI: http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/1713
https://doi.org/10.48779/2d22-9s79
ISSN: 1380-7501
1573-7721
Other Identifiers: info:eu-repo/semantics/publishedVersion
Appears in Collections:*Documentos Académicos*-- M. en Ciencias del Proc. de la Info.

Files in This Item:
File Description SizeFormat 
25_Becerra_DelaRosa MTAP P1 2018.pdfBecerra_DelaRosa MTAP 2018506,65 kBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons