Implementation of an algorithm for users identification considering physiological problems affecting the speech
DOI:
https://doi.org/10.15332/iteckne.v14i2.1767Keywords:
Identification, physiological problems, voiceAbstract
This paper shows the design and implementation of an algorithm for users voice identification, including considerations on physiologic issues affecting the speech so that when users manifest these problems, lower rates on fake rejections decrease. For purposes of managing the contemplated physiologic problems, algorithm design also takes a standard algorithm that uses cepstral coefficients which include additional characteristics determined by voice acoustic analysis. A test including several records from people with a healthy voice and those with voice affections is carried out in order to observe the performance of the algorithm; thus, observing that when applying those characteristics in voice analysis a better result is achieved regarding the case when cepstral coefficients are implemented.Downloads
References
S. Vasuhi, V. Vaidehi, B. N. Nare, T. Treesa, ‘‘An efficient multi-modal biometric person authentication system using Fuzzy Logic’’, in IEEE Second International Conference in Advanced Computing (ICoAC), December, 2010.
H. Hollien, ‘‘Barriers to Progress in Speaker Identification’’, Linguistic Evidence in Security, Law and Intelligence, vol. 1, no. 1, pp. 1-23, 2013.
T. Kinnunen, H. Li, ‘‘An Overview of Text Independent Speaker Recognition: from Features to Supervectors’’, Speech Communication, vol. 52, no. 1, pp. 12-40, 2010.
R. Peacocke, ‘‘An Introduction to Speech and Speaker Recognition’’, Computer, vol. 26, no. 8, pp. 26-33, 1990.
R. Price, J. Willmore, W. Roberts, K. Zyga, ‘‘Genetically optimized feed forward neural networks for speaker identification’’, in Fourth International Conference on Knowledge-Based Intelligent Engineering Systems and Allied Technologies, Sept. 2000.
T. Gannert, ‘‘A Speaker Verification System Under the Scope: Alize’’, Master’s Thesis in Speech Technology at TMH, 2007.
J. P. Teixeira, A. Gonçalves, ‘‘Algorithm for jitter and Shimmer measurement in pathologic voices’’, Procedia Computer Science, vol. 100, pp. 271-279, 2016.
E. San Segundo, A. Tsanas, P. Gómez, ‘‘Euclidean Distances as measures of speaker similarity including identical twin pairs: A forensic investigation using source and filter voice characteristics’’, Forensic Science International, vol. 270, pp. 25-38, 2017.
H. Beigi, ‘‘Speaker Recognition’’, InTech, 2012.
A. Boles, P. Rad, ‘‘Voice biometrics: Deep learning-based voiceprint authentication system’’, in 12th System of Systems Engineering Conference (SoSE), pp. 1-6, June, 2017.
E. Ramdinmawii, V. K. Mittal, ‘‘Gender identification from speech signal by examining the speech production characteristics’’, in International Conference on Signal Processing and Communication (ICSC), pp. 244-249, December, 2016.
H. Pérez, J. Martínez, I. Espinosa, J. Rodríguez, H. Ávila, ‘‘Using acoustic paralinguistic information to assess the interaction quality in speech-based systems for elderly users’’, International Journal of Human-Computer Studies, vol. 98, pp. 1-13, 2017.
H. Beigi, ‘‘Fundamentals of Speaker Recognition’’, Springer US, 2011.
M. Fröhlich, H. Michaelis, H. Werner, ‘‘Acoustic Breathiness Measures in the description of patologic voices’’, in IEEE International Conference on Acoustics, Speech and Signal Processing, May, 1998.
D. Michaelis, T. Gramss, H. W. Strube, ‘‘Glottal-to-Noise Excitation Ratio - a New Measure for Describing Pathological Voices’’, ACUSTICA - acta acústica, vol. 83, pp. 700-706, 1997.
A. Mendoza, G. Mansilla, ‘‘Rinitis alérgica’’, Revista de la Sociedad Boliviana de Pediatría, vol. 41, no.1, pp. 50-53, 2002.
K. Aubry, A. El Sanharawi, A. Pommier, ‘‘Laringitis agudas del adulto’’, EMC – Otorrinolaringología, vol. 46, no. 1, pp. 1-9, 2017.
J. C. Casado, A. Pérez, ‘‘Trastornos de la voz: del diagnóstico al tratamiento’’, Ediciones Aljibe, 2009.
M. Farrús, J. Hernando, P. Ejarque, ‘‘Jitter and Shimmer Measurements for Speaker Recognition’’, in 8th Annual Conference of the International Speech Communication Association ISCA, pp. 778-81, Belgium, Aug. 2007.
M. Shahbakhi, D. Taheri, E. Tahami, ‘‘Speech Analysis for Diagnosis of Parkinson’s Disease Using Genetic Algorithm and Support Vector Machine’’, Journal of Biomedical Science and Engineering, vol. 7, no. 4, 2014.
A. Zewoudie, J. Luque, F. Hernando, ‘‘Jitter and Shimmer Measurements for Speaker Diarization’’, in VIII Jornadas en Tecnología del Habla and IV Iberian SLTech Workshop, 2014, pp. 21-30.
K. Messer, J. Matas, J. Kittler, K. Jonsson, ‘‘The extended M2VTS Database’’, in Second International Conference on Audio- and Video-based Biometric Person Authentication AVBPA, 1999, pp. 72-77.
E. D. Ellis, ‘‘Design of a Speaker Recognition Code using MATLAB’’, Department of Computer and Electrical Engineering - University of Thennessee, Knoxville Tennessee, May, 2001.
Downloads
How to Cite
Issue
Section
License
La revista ITECKNE se encuentra registrada bajo una licencia de Creative Commons Reconocimiento-NoComercial 4.0 Internacional Por lo tanto, esta obra se puede reproducir, distribuir y comunicar públicamente, siempre que se reconozca el nombre de los autores y a la Universidad Santo Tomás. Se permite citar, adaptar, transformar, autoarchivar, republicar y crear a partir del material, siempre que se reconozca adecuadamente la autoría, se proporcione un enlace a la obra original y se indique si se han realizado cambios.
La Revista ITECKNE no retiene los derechos sobre las obras publicadas y los contenidos son responsabilidad exclusiva de los autores, quienes conservan sus derechos morales, intelectuales, de privacidad y publicidad. Sin embargo esta facultada para editar, publicar, reproducir y distribuir tanto en medios impresos como digitales, además de incluir el artículo en índices internacionales y/o bases de datos, de igual manera, se faculta a la editorial para utilizar las imágenes, tablas y/o cualquier material gráfico presentado en el artículo para el diseño de carátulas o posters de la misma revista.


