Please use this identifier to cite or link to this item:
http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/3429
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor | 31249 | en_US |
dc.contributor.other | https://orcid.org/0000-0002-7337-8974 | en_US |
dc.coverage.spatial | Global | en_US |
dc.creator | Rodarte Rodríguez, Armando | - |
dc.creator | Becerra Sánchez, Aldonso | - |
dc.creator | De La Rosa Vargas, José I. | - |
dc.creator | Escalante García, Nivia I. | - |
dc.creator | Olvera González, José E. | - |
dc.creator | Velásquez Martínez, Emmanuel de J. | - |
dc.creator | Zepeda Valles, Gustavo | - |
dc.date.accessioned | 2023-10-30T18:55:40Z | - |
dc.date.available | 2023-10-30T18:55:40Z | - |
dc.date.issued | 2022-10-30 | - |
dc.identifier | info:eu-repo/semantics/publishedVersion | en_US |
dc.identifier.isbn | 978-3-031-20321-3 | en_US |
dc.identifier.uri | http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/3429 | - |
dc.identifier.uri | http://dx.doi.org/10.48779/ricaxcan-260 | - |
dc.description.abstract | The speech is a biological or physical feature unique to each person, and this is widely used in speaker identification tasks like access control, transaction authentication, home automation applications, among others. The aim of this research is to propose a connected-words speaker recognition scheme based on a closed-set speaker-independent voice corpus in noisy environments that can be applied in contexts such as forensic purposes. Using a KDD analysis, MFCCs were used as filtering technique to extract speech features from 158 speakers, to later carry out the speaker identification process. Paper presents a performance comparison of ANN, KNN and logistic regression models, which obtained a F1 score of 98%, 98.32% and 97.75%, respectively. The results show that schemes such as KNN and ANN can achieve a similar performance in full voice files when applying the proposed KDD framework, generating robust models applied in forensic environments. | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Springer | en_US |
dc.relation | https://link.springer.com/chapter/10.1007/978-3-031-20322-0_21 | en_US |
dc.relation.ispartof | https://link.springer.com/conference/cimps | en_US |
dc.relation.isbasedon | UAZ-2022-38599 Diseño de esquemas robustos para reconocimiento de voz y sistemas End-to-End (E2E): uso de nuevas funciones de costo y algoritmos de eliminación de ruido | en_US |
dc.relation.uri | generalPublic | en_US |
dc.rights | Attribution 3.0 United States | * |
dc.rights.uri | http://creativecommons.org/licenses/by/3.0/us/ | * |
dc.source | International Conference on Software Process Improvement CIMPS 2022: New Perspectives in Software Engineering, pp. 299–312 | en_US |
dc.subject.classification | INGENIERIA Y TECNOLOGIA [7] | en_US |
dc.subject.other | Artificial intelligence | en_US |
dc.subject.other | KDD | en_US |
dc.subject.other | Prototyping | en_US |
dc.subject.other | Speaker identification | en_US |
dc.subject.other | Speech processing | en_US |
dc.title | Speaker Identification in Noisy Environments for Forensic Purposes | en_US |
dc.type | info:eu-repo/semantics/conferenceProceedings | en_US |
Appears in Collections: | *Documentos Académicos*-- M. en Ciencias del Proc. de la Info. |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
82_RodarteA_DelaRosa CIMPS P1 2022_23.pdf | RodarteA_DelaRosa CIMPS 2022 | 65,04 kB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License