Please use this identifier to cite or link to this item:
http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/1713
Title: | Speech recognition in a dialog system: from conventional to deep processing A case study applied to Spanish |
Authors: | Becerra, Aldonso De la Rosa Vargas, José Ismael González Ramírez, Efrén |
Issue Date: | Aug-2018 |
Publisher: | Springer |
Abstract: | The aim of this paper is to illustrate an overview of the automatic speech recognition (ASR) module in a spoken dialog system and how it has evolved from the conventional GMM-HMM (Gaussian mixture model - hidden Markov model) architecture toward the recent nonlinear DNN-HMM (deep neural network) scheme. GMMs have dominated for a long time the baseline of speech recognition, but in the past years with the resurgence of artificial neural networks (ANNs), the former models have been surpassed in most recognition tasks. An outstanding consideration for ANNs-based acoustic model is the fact that their weights can be adjusted in two training steps: i) initialization of the weights (with or without pre-training) and ii) fine-tuning. |
URI: | http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/1713 https://doi.org/10.48779/2d22-9s79 |
ISSN: | 1380-7501 1573-7721 |
Other Identifiers: | info:eu-repo/semantics/publishedVersion |
Appears in Collections: | *Documentos Académicos*-- M. en Ciencias del Proc. de la Info. |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
25_Becerra_DelaRosa MTAP P1 2018.pdf | Becerra_DelaRosa MTAP 2018 | 506,65 kB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License