Autores
Oropeza Rodríguez José Luis
Suárez Guerra Sergio
Jiménez Hernández Mario
Título The Place Theory as an Alternative Solution in Automatic Speech Recognition Tasks
Tipo Revista
Sub-tipo ISI
Descripción Lecture Notes in Computer Science
Resumen Recently the parametric representation using cochlea behavior has been used in different studies related with Automatic Speech Recognition (ASR). This paper shows how using an alternative solution reported in the state of the art solves the Lesser and Berkeley’s cochlea model in ASR tasks. An approach that considers a new form to construct the bank filter in the parametric representation used to extract MFCC is proposed. Then this distribution of the bank filter to have a new representation of the speech in frequency domain is used. It is important to indicate that MFCC parameters use Mel scale to create a bank filter. The cochlea behavior based on the theory to create the central frequencies of the bank filter was used, .The Mel scale function was substituted for our purpose. A 98.5% performance was reached, for a task that uses isolated digits pronounced by 5 different speakers in the Spanish language and corpus SUSAS with neutral sound records with some advantages in comparison with MFCC was used.
Observaciones Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Lugar Puerto Vallarta
País Mexico
No. de páginas 167-174
Vol. / Cap. 8827
Inicio 2014-11-02
Fin 2014-11-05
ISBN/ISSN 978-3-319-12567-1