Speaker Identification in Noisy Environments for Forensic Purposes

Rodarte Rodríguez, Armando; Becerra Sánchez, Aldonso; De La Rosa Vargas, José I.; Escalante García, Nivia I.; Olvera González, José E.; Velásquez Martínez, Emmanuel de J.; Zepeda Valles, Gustavo

Please use this identifier to cite or link to this item: http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/3429

Full metadata record

DC Field	Value	Language
dc.contributor	31249	en_US
dc.contributor.other	https://orcid.org/0000-0002-7337-8974	en_US
dc.coverage.spatial	Global	en_US
dc.creator	Rodarte Rodríguez, Armando	-
dc.creator	Becerra Sánchez, Aldonso	-
dc.creator	De La Rosa Vargas, José I.	-
dc.creator	Escalante García, Nivia I.	-
dc.creator	Olvera González, José E.	-
dc.creator	Velásquez Martínez, Emmanuel de J.	-
dc.creator	Zepeda Valles, Gustavo	-
dc.date.accessioned	2023-10-30T18:55:40Z	-
dc.date.available	2023-10-30T18:55:40Z	-
dc.date.issued	2022-10-30	-
dc.identifier	info:eu-repo/semantics/publishedVersion	en_US
dc.identifier.isbn	978-3-031-20321-3	en_US
dc.identifier.uri	http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/3429	-
dc.identifier.uri	http://dx.doi.org/10.48779/ricaxcan-260	-
dc.description.abstract	The speech is a biological or physical feature unique to each person, and this is widely used in speaker identification tasks like access control, transaction authentication, home automation applications, among others. The aim of this research is to propose a connected-words speaker recognition scheme based on a closed-set speaker-independent voice corpus in noisy environments that can be applied in contexts such as forensic purposes. Using a KDD analysis, MFCCs were used as filtering technique to extract speech features from 158 speakers, to later carry out the speaker identification process. Paper presents a performance comparison of ANN, KNN and logistic regression models, which obtained a F1 score of 98%, 98.32% and 97.75%, respectively. The results show that schemes such as KNN and ANN can achieve a similar performance in full voice files when applying the proposed KDD framework, generating robust models applied in forensic environments.	en_US
dc.language.iso	eng	en_US
dc.publisher	Springer	en_US
dc.relation	https://link.springer.com/chapter/10.1007/978-3-031-20322-0_21	en_US
dc.relation.ispartof	https://link.springer.com/conference/cimps	en_US
dc.relation.isbasedon	UAZ-2022-38599 Diseño de esquemas robustos para reconocimiento de voz y sistemas End-to-End (E2E): uso de nuevas funciones de costo y algoritmos de eliminación de ruido	en_US
dc.relation.uri	generalPublic	en_US
dc.rights	Attribution 3.0 United States	*
dc.rights.uri	http://creativecommons.org/licenses/by/3.0/us/	*
dc.source	International Conference on Software Process Improvement CIMPS 2022: New Perspectives in Software Engineering, pp. 299–312	en_US
dc.subject.classification	INGENIERIA Y TECNOLOGIA [7]	en_US
dc.subject.other	Artificial intelligence	en_US
dc.subject.other	KDD	en_US
dc.subject.other	Prototyping	en_US
dc.subject.other	Speaker identification	en_US
dc.subject.other	Speech processing	en_US
dc.title	Speaker Identification in Noisy Environments for Forensic Purposes	en_US
dc.type	info:eu-repo/semantics/conferenceProceedings	en_US
Appears in Collections:	Documentos Académicos-- M. en Ciencias del Proc. de la Info.

Files in This Item:

File	Description	Size	Format
82_RodarteA_DelaRosa CIMPS P1 2022_23.pdf	RodarteA_DelaRosa CIMPS 2022	65,04 kB	Adobe PDF	View/Open

Show simple item record

This item is licensed under a Creative Commons License