Implementation of an algorithm for users identification considering physiological problems affecting the speech

  • Diego Enrique Rey-Lancheros Universidad Distrital Francisco José de Caldas. Bogotá, Colombia
  • Hernán Julian Gavilán-Acosta Universidad Distrital Francisco José de Caldas. Bogotá, Colombia
  • Helbert Eduardo Espitia-Cuchango Universidad Distrital Francisco José de Caldas. Bogotá, Colombia
Keywords: Identification, physiological problems, voice

Abstract

This paper shows the design and implementation of an algorithm for users voice identification, including considerations on physiologic issues affecting the speech so that when users manifest these problems, lower rates on fake rejections decrease. For purposes of managing the contemplated physiologic problems, algorithm design also takes a standard algorithm that uses cepstral coefficients which include additional characteristics determined by voice acoustic analysis. A test including several records from people with a healthy voice and those with voice affections is carried out in order to observe the performance of the algorithm; thus, observing that when applying those characteristics in voice analysis a better result is achieved regarding the case when cepstral coefficients are implemented.

Downloads

Download data is not yet available.

Author Biographies

Diego Enrique Rey-Lancheros, Universidad Distrital Francisco José de Caldas. Bogotá, Colombia
Ingeniero Electrónico. Universidad Distrital Francisco José de Caldas. Bogotá, Colombia.
Hernán Julian Gavilán-Acosta, Universidad Distrital Francisco José de Caldas. Bogotá, Colombia
Ingeniero Electrónico. Universidad Distrital Francisco José de Caldas. Bogotá, Colombia.
Helbert Eduardo Espitia-Cuchango, Universidad Distrital Francisco José de Caldas. Bogotá, Colombia
Ingeniero Electrónico. Universidad Distrital Francisco José de Caldas Bogotá, Colombia

References

S. Vasuhi, V. Vaidehi, B. N. Nare, T. Treesa, ‘‘An efficient multi-modal biometric person authentication system using Fuzzy Logic’’, in IEEE Second International Conference in Advanced Computing (ICoAC), December, 2010.

H. Hollien, ‘‘Barriers to Progress in Speaker Identification’’, Linguistic Evidence in Security, Law and Intelligence, vol. 1, no. 1, pp. 1-23, 2013.

T. Kinnunen, H. Li, ‘‘An Overview of Text Independent Speaker Recognition: from Features to Supervectors’’, Speech Communication, vol. 52, no. 1, pp. 12-40, 2010.

R. Peacocke, ‘‘An Introduction to Speech and Speaker Recognition’’, Computer, vol. 26, no. 8, pp. 26-33, 1990.

R. Price, J. Willmore, W. Roberts, K. Zyga, ‘‘Genetically optimized feed forward neural networks for speaker identification’’, in Fourth International Conference on Knowledge-Based Intelligent Engineering Systems and Allied Technologies, Sept. 2000.

T. Gannert, ‘‘A Speaker Verification System Under the Scope: Alize’’, Master’s Thesis in Speech Technology at TMH, 2007.

J. P. Teixeira, A. Gonçalves, ‘‘Algorithm for jitter and Shimmer measurement in pathologic voices’’, Procedia Computer Science, vol. 100, pp. 271-279, 2016.

E. San Segundo, A. Tsanas, P. Gómez, ‘‘Euclidean Distances as measures of speaker similarity including identical twin pairs: A forensic investigation using source and filter voice characteristics’’, Forensic Science International, vol. 270, pp. 25-38, 2017.

H. Beigi, ‘‘Speaker Recognition’’, InTech, 2012.

A. Boles, P. Rad, ‘‘Voice biometrics: Deep learning-based voiceprint authentication system’’, in 12th System of Systems Engineering Conference (SoSE), pp. 1-6, June, 2017.

E. Ramdinmawii, V. K. Mittal, ‘‘Gender identification from speech signal by examining the speech production characteristics’’, in International Conference on Signal Processing and Communication (ICSC), pp. 244-249, December, 2016.

H. Pérez, J. Martínez, I. Espinosa, J. Rodríguez, H. Ávila, ‘‘Using acoustic paralinguistic information to assess the interaction quality in speech-based systems for elderly users’’, International Journal of Human-Computer Studies, vol. 98, pp. 1-13, 2017.

H. Beigi, ‘‘Fundamentals of Speaker Recognition’’, Springer US, 2011.

M. Fröhlich, H. Michaelis, H. Werner, ‘‘Acoustic Breathiness Measures in the description of patologic voices’’, in IEEE International Conference on Acoustics, Speech and Signal Processing, May, 1998.

D. Michaelis, T. Gramss, H. W. Strube, ‘‘Glottal-to-Noise Excitation Ratio - a New Measure for Describing Pathological Voices’’, ACUSTICA - acta acústica, vol. 83, pp. 700-706, 1997.

A. Mendoza, G. Mansilla, ‘‘Rinitis alérgica’’, Revista de la Sociedad Boliviana de Pediatría, vol. 41, no.1, pp. 50-53, 2002.

K. Aubry, A. El Sanharawi, A. Pommier, ‘‘Laringitis agudas del adulto’’, EMC – Otorrinolaringología, vol. 46, no. 1, pp. 1-9, 2017.

J. C. Casado, A. Pérez, ‘‘Trastornos de la voz: del diagnóstico al tratamiento’’, Ediciones Aljibe, 2009.

M. Farrús, J. Hernando, P. Ejarque, ‘‘Jitter and Shimmer Measurements for Speaker Recognition’’, in 8th Annual Conference of the International Speech Communication Association ISCA, pp. 778-81, Belgium, Aug. 2007.

M. Shahbakhi, D. Taheri, E. Tahami, ‘‘Speech Analysis for Diagnosis of Parkinson’s Disease Using Genetic Algorithm and Support Vector Machine’’, Journal of Biomedical Science and Engineering, vol. 7, no. 4, 2014.

A. Zewoudie, J. Luque, F. Hernando, ‘‘Jitter and Shimmer Measurements for Speaker Diarization’’, in VIII Jornadas en Tecnología del Habla and IV Iberian SLTech Workshop, 2014, pp. 21-30.

K. Messer, J. Matas, J. Kittler, K. Jonsson, ‘‘The extended M2VTS Database’’, in Second International Conference on Audio- and Video-based Biometric Person Authentication AVBPA, 1999, pp. 72-77.

E. D. Ellis, ‘‘Design of a Speaker Recognition Code using MATLAB’’, Department of Computer and Electrical Engineering - University of Thennessee, Knoxville Tennessee, May, 2001.
How to Cite
Rey-Lancheros, D., Gavilán-Acosta, H., & Espitia-Cuchango, H. (1). Implementation of an algorithm for users identification considering physiological problems affecting the speech. ITECKNE, 14(2), 131-139. https://doi.org/https://doi.org/10.15332/iteckne.v14i2.1767
Section
Research and Innovation Articles