Open Access Open Access  Restricted Access Subscription or Fee Access

Spoken Digits Recognition Using Wavelet Transform and Power Spectrum Density Estimation


(*) Corresponding author


Authors' affiliations


DOI: https://doi.org/10.15866/irecos.v11i1.8261

Abstract


This work aims to develop an automatic recognition system for isolated spoken words based on power spectrum density estimation and similarity measurements. The pre-processing step prepares the signal for the ulterior phases. For features extraction, we apply the discrete wavelet transform to the speech signal and then the algorithm of Welch is used to estimate the power spectrum density. At the stage of matching, we determine the similarity between power spectra using discrete to continuous algorithm. The experiments give considerable recognition rates with the parameters used.
Copyright © 2016 Praise Worthy Prize - All rights reserved.

Keywords


Automatic Speech Recognition; Power Spectrum Density; Discrete Wavelet Transform; Similarity Measurements

Full Text:

PDF


References


A. Waibel, K.F. Lee, Why Study Speech Recognition, In Readings in speech recognition, (California: Morgan Kaufmann Publishers, 1990, 1-5).
http://dx.doi.org/10.1016/b978-0-08-051584-7.50004-8

L. Rabiner, B.H. Juang, Fundamentals of Speech Recognition (Prentice Hall, 1993).

W. Chou, B.H. Juang, Pattern Recognition in Speech and Language Processing (CRC PRESS, 2003).
http://dx.doi.org/10.1201/9780203010525

S. Katagiri, Handbook of Neural Networks for Speech Processing (Artech House, 2000).

H. Sakoe, S. Chiba, Dynamic programming algorithm optimization for spoken word recognition, IEEE Transactions on Acoustics, Speech and Signal Processing, Vol. 26, n.1, pp. 43-49, 1978.
http://dx.doi.org/10.1109/tassp.1978.1163055

R.W. Andrew, D.C. Keith, Statistical Pattern Recognition (Wiley & Sons, 2011).
http://dx.doi.org/10.1002/9781119952954

Ben Nasr, M., Saoud, S., Cherif, A., Optimization of MLP using genetic algorithms applied to Arabic speech recognition, (2013) International Review on Computers and Software (IRECOS), 8 (2), pp. 653-659.

Rojathai, S., Venkatesulu, M., An effective tamil speech word recognition technique with aid of MFCC and HMM (Hidden Markov Model), (2013) International Review on Computers and Software (IRECOS), 8 (2), pp. 577-586.

A.A.M. Abushariah, T.S. Gunawan, O.O. Khalifa, M.A.M. Abushariah, English Digits Speech Recognition System Based on Hidden Markov Models, International Conference on Computer and Communication Engineering (Pages: 1-5 Year of Publication: 2010 ISBN: 978-1-4244-6233-9).
http://dx.doi.org/10.1109/iccce.2010.5556819

S.V. Chapaneri, Spoken Digits Recognition using Weighted MFCC and Improved Features for Dynamic Time Warping, International Journal of Computer Applications, Vol. 40, n. 3, pp. 6-12, 2012.
http://dx.doi.org/10.5120/5022-7167

S.V. Chapaneri et al, Efficient Speech Recognition System for Isolated Digits, International Journal of Computer Science & Engineering Technology, Vol. 4, n. 3, pp. 228-236, 2013.

L. Rabiner, M. Sambur, An algorithm for determining the endpoints of isolated utterances, Bell System Technical Journal, Vol. 54, n. 2, pp. 297-315, 1975.
http://dx.doi.org/10.1002/j.1538-7305.1975.tb02840.x

H.G. Stark. Wavelets and signal processing (Springer 2005).
http://dx.doi.org/10.1007/3-540-27481-2

I.Daubechies. Orthonormal Bases of Compactly Supported Wavelets. Communications on Pure and Applied Math, Vol.41, pp. 909-996, 1988.
http://dx.doi.org/10.1002/cpa.3160410705

P. Stoica, R. Moses, Spectrum Analysis of Signals (Prentice Hall, 2005).

P.D. Welch, The use of fast fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms, IEEE Transactions on Audio and Electroacoustics, Vol. 15, n. 2, pp. 70-73, 1967.
http://dx.doi.org/10.1109/tau.1967.1161901

M.S. Bartlett, Smoothing Periodogram from Time series with continuous Spectra, Nature, Vol.161, pp. 686-687, 1948.
http://dx.doi.org/10.1038/161686a0

A. Cossé‐Barbi, M. Raji, Discrete pattern recognition by fitting onto a continuous function, Journal of computational chemistry, Vol. 18, n.15, pp. 1875-1892, 1997.
http://dx.doi.org/10.1002/(sici)1096-987x(19971130)18:15%3C1875::aid-jcc4%3E3.0.co;2-l


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize