Autores
Oropeza Rodríguez José Luis
Suárez Guerra Sergio
Título Using Values of the Human Cochlea in the Macro and Micro Mechanical Model for Automatic Speech Recognition
Tipo Congreso
Sub-tipo SCOPUS
Descripción Lecture Notes in Computer Science; 13th Mexican International Conference on Artificial Intelligence, MICAI 2014
Resumen Recently the parametric representation using cochlea behavior has been used in different studies related with Automatic Speech Recognition (ASR). That is because this hearing organ in mammalians is the most important element used to make a transduction of the sound pressure that is received by the outer ear. This paper shows how the macro and micro mechanical model is used in ASR tasks. The values that Neely, Elliot and Ku founded in their works, related with the macro and micro mechanical model such as Neely were used to set the central frequencies of a bank filter to obtain parameters from the speech in a similar form as MFCC (Mel Frequency Cepstrum Coefficients) has been constructed. An approach that considers a new form to distribute the bank filter in our parametric representation is proposed. Then this distribution of the bank filter to have a different representation of the speech in frequency domain compared with MFCC is applied. The response of these three values mentioned above into macro and micro mechanical model to create the central frequencies of the bank filter were used, then the Mel scale function substituted by a representation based in the cochlear response based on the Neely model. This model was used with a set of different parameters of the cochlea, used by Nelly, Elliot and Ku in their works, such as mass, damping and stiffness; among others. A performance of 98 to 100% was reached for a task that uses Spanish isolated digits pronounced by 5 differe
Observaciones (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Code 111059
Lugar Tuxtla Gutiérrez
País Mexico
No. de páginas 242-251
Vol. / Cap. 8857
Inicio 2014-11-16
Fin 2014-11-22
ISBN/ISSN 978-331913649-3