Optimization of MLP Using Genetic Algorithms Applied to Arabic Speech Recognition

M. Ben Nasr; S. Saoud; A. Cherif

doi:10.15866/irecos.v8i2.3137

Optimization of MLP Using Genetic Algorithms Applied to Arabic Speech Recognition

M. Ben Nasr^(1*), S. Saoud⁽²⁾, A. Cherif⁽³⁾

^(*) Corresponding author

DOI's assignment:
the author of the article can submit here a request for assignment of a DOI number to this resource!
Cost of the service: euros 10,00 (for a DOI)

Abstract

This paper presents a novel system for Arabic speech recognition of Arabic isolated words with mono-locutor and a small vocabulary. We have used a database consisting of eleven isolated Arabic words each of them was repeated twenty five times by the same -locutor. Mel Frequency Cepstral Coefficient (MFCC) and Bionic Wavelet Transform (BWT) are used for feature extraction from each recorded word. The obtained coefficients were then concatenated to construct one input node of a Multi-Layer Perceptual (MLP) used for features classification and recognition. We describe in this paper the use of Genetic Algorithm (GA) for optimizing the topology of each Multi-Layer Perceptron (MLP). So, the GA is utilized to find the optimal number of neurons in input layer and in hidden layer, the training epochs and the learning goal of network. From the results, it was observed that the integration of the GA with feed forward network can improve classification rate to 100%
Copyright © Praise Worthy Prize - All rights reserved.

Keywords

Arabic Speech Recognition; Multi-Layer Perceptron (MLP); Genetic Algorithm (GA); Mel Frequency Cepstral Coefficient MFCC; Bionic Wavelet Transform (BWT)

Full Text:

PDF

References

T. Lee and P.C. Ching, Cantonese Syllable Recognition Using Neural Networks, IEEE Transactions on Speech and Audio Processing, 1999, Vol. 7(4), pp. 466-472.

V. Skorpil and J. Stastny, Back-Propagation and K-Means Algorithms Comparison, in Proc. of 8 IEEE International Conference on Signal Processing, 2006, pp. 16-20.

L. R. Rabiner and B. H. Juang, Fundamentals of Speech Recognition,Prentice-Hall, New Jersey, 1993.

C.H. Lin, C.H. Wu, P.Y. Ting and H.M. Wang, Frameworks for Recognition of Mandarin Syllables with Tones Using Sub-syllabic Units, Speech Communication, 1996, Vol. 18, pp. 175-190.

Z. Sakka, A. Kachouri and M. Samet Speech Denoising and Arabic Speaker Recognition System Using Subband Approach, International Review on Computers and Software (IRECOS), 2007, Vol. 2. n. 3,pp. 264 – 271.

B.B. Mosbah, Speech Recognition for Disabilities People, in Proc. of IEEE International Conference on Information and Communication Technologies, 2006, Vol. 1, pp. 864-869.

Dr.R.L.K.Venkateswarlu, Dr. R. V. Kumari and G.Vani Jayasri Speech Recognition using Radial basis Function Neural Network, IEEE, 2011.

C.R Houk, J. A. Joines, and M. G. Kay, A genetic algorithm for function optimization: a MATLAB implementation, North Carolina University NCSUIE technical, 1995, report 95-09.

Z. Michalewicz, Genetic Algorithms + Data Structure= Evolution Programs Adaptive, AI series, Springer Verlag, NewYork, 1996.

Md. Rabiullslam, Md. F. Rahmant and M.A. Goffar Khant, Improvement of Speech Enhancement Techniques for Robust Speaker Identification in Noise, Proceedings of 2009 12th International Conference on Computer and Information Technology (ICCIT 2009) Dhaka, Bangladesh.

A. Zabidi, et al., Mel-Frequency Cepstrum Coefficient Analysis of Infant Cry with Hypothyroidism, 5th Int. Colloquium on Signal Processing & Its Applications, Kuala Lumpur, Malaysia, 2009.

X. Yuan, Auditory Model-Based Bionic Wavelet Transform for Speech Enhancement", Master's thesis, Marquette University, Milwaukee, WI, USA, 2003.

O. Sayadi and M.B. Shamsollahi, Multiadaptive Bionic Wavelet Transform: Application to ECG Denoising and Baseline Wandering Reduction, EURASIP Journal of Applied Signal Processing, 2007.

J. Yao and Y. T. Zhang, Bionic wavelet transform: A new time-frequency method based on an auditory model, IEEE Transactions on Biomedical Engineering, 2001, vol.48, no.8, pp.856-863.

J. Yao and Y. T. Zhang, The application of bionic wavelet transform to speech signal processing in cochlear implants using neural network simulations, IEEE Transactions on Biomedical Engineering, 2002, vol.49, no.11, pp.1299-1309.

M. Talbi, L. Salhi, S. Abid, A. Cherif, Recurrent Neural Network and Bionic Wavelet Transform for speech enhancement, Int. J. Signal and Imaging Systems Engineering, 2010, vol.3, no.2, pp.93-101.

M. N. Huda, M.M Hasan, F. Hassan, M. R. A. Kotwal, G. Muhammad and C. M. Rahman, Articulatory Feature Extraction for Speech Recognition using Neural Network,International Review on Computers and Software (IRECOS), 2011,Vol. 6 N. 1 pp. 25-31.

Aggarwal, R.K., Application of genetically optimized neural networks for Hindi speech recognition system, Information and Communication Technologies (WICT), 2011 World Congress on.

G. Renner, A. Ekart, Genetic Algorithms in Computer Aided Design, Computer Aided Design, 35(8), 709-726, 2003.

R. Sahraeian, B. Zamani, A. Akbari and A. Ayatollahi, Eigenspace-Based MLLR Adaptation Using MCE, International Review on Computers and Software (IRECOS), 2010, Vol. 5 N. 6, pp. 628-634.

Refbacks

There are currently no refbacks.

Username
Password
Remember me