Speech Coding Effect on Amazigh Alphabet Speech Recognition Performance

Mohamed Hamidi, Hassan Satori, Ouissam Zealouk, Khalid Satori ,LIIAN,

This paper is related to the speech coding problems that occur in VoIP system based automatic speech recognition where speech is coded for the transmission from the user to the recognition server. We evaluate the influence of G711 and GSM audio codecs on the speech recognition performance. In our approach, Mel-Frequency Cepstral Coefficients is used as feature extraction technique. The Gaussian mixture models and Hidden Markov models are exploited on features modelling. Our vocabulary includes the Amazigh Letters. Our finding indicate that the best system performances were found for G711 codec, 3 HMM, and 16 GMMs.

Volume 11 | 02-Special Issue

Pages: 1392-1400