Performance Evaluation of the Hearing Impaired Speech Recognition in Noisy Environment


(*) Corresponding author


Authors' affiliations


DOI's assignment:
the author of the article can submit here a request for assignment of a DOI number to this resource!
Cost of the service: euros 10,00 (for a DOI)

Abstract


Most of the noise suppression algorithms concentrated on normal hearing listeners. This study investigates the use of Recursive least square filter in improving automatic speech recognition of the hearing impaired speech in noisy environment. The children who are deaf or hearing impaired suffers in many ways regarding education and in public places to communicate with the normal speakers since their voices are difficult to understand due to guttural and monotone nature even though they undergone speech therapy. Because some of the sounds cannot be taught, only we can feel. To develop the recognition system for their speeches in practical situation, we have analyzed the recognition accuracy of the hearing impaired speech in various noise environments where their speech becomes still worse due to background noises. Here we have considered babble, white and factory noise in different SNR levels such as 10db, 5db, 0db,-5db and the recognition results were obtained as 27.8% for babble noise, 10.6% for white noise and 13.6% for factory noise when SNR is 0db. Spectral subtraction is initially applied and due to its poor performance it is shown that, application of Adaptive recursive least square filter improves significantly the recognition performance with filter order 128, even for the low SNR level 0db, such as 70.0% for babble noise, 67.4% for white noise and 61% for factory noise. The result suggests that regardless of impaired speech, in the practical noisy environment also we can achieve comparable recognition accuracy for hearing impaired speeches.
Copyright © 2013 Praise Worthy Prize - All rights reserved.

Keywords


Mel Frequency Cepstral Coefficients (MFCC); Perceptual Linear Prediction Coefficients (PLP); Speech Recognition (SR); Deaf Or Hearing Impaired Speech; Hidden Markov Model (HMM); Hidden Markov Model Tool Kit (HTK); Recursive Least Square (RLS) Filter; Leas

Full Text:

PDF


References


Nickerson R.S.,“Characteristics of the Speech of the deaf persons”, Volta Review, V-77, pp.342-362,1965.

Pickett,J.M and Costam, A. (1968), “A Visual Speech trainer with simplified indication to vowel spectrum”, American Annals of the deaf, V-113, pp. 253-258.

Nickerson R.S, Stevens, K.N., Boothroyd. A, and Rollins, A (1974), “Some observations on timing in the Speech of deaf and hearing speakers, Report no.2905.

Ruscello,D.M., Sholtis, D.M., and Moreau. V.K., Adult’s awareness of certain articulatory gestures, Perceptual And Motor Skills,V-50, 3(2), pp.1156-1158,1980.

Arnold,D. and Tremblay.A,(1979), “Interaction of Deaf and Hearing Pre-School Children”, Journal of Commun. Diorders, 12, 1979, pp. 245-251.

C.Jeyalakshmi, V.Krishnamurthi, A.Revathi (2012), “Building robust hmm models for speech recognition of Hearing impaired”, EE Times-India,eMedia Asia Ltd, page 1-11.

Vimala.C and Radha.V, (2012), "Optimal Adaptive Filtering Technique for Tamil Speech Enhancement", International Journal of Computer applications,Volume 41– No.17, pp 23-29.

Monson H.Hayes (1996), “Statistical digital signal processing and modeling”, John Wiley & Sons, Inc.

Soni Changlani, Dr. M.K. Gupta,(2011), “Applications and Simulation of Adaptive Filter in Speech Enhancement”, International journal of Advanced Computer Engineering and Architecture, , V- 1, No.1, pp 97-101.

C.Jeyalakshmi, V. krishnamurthi, A. Revathy, Deaf speech Assessment using digital processing techniques, Signal & Image Processing : An International journal (SIPIJ), vol.1,no.1, sep2010.

Phillips C Loizou “Speech enhancement theory and practice”,(2007) 1st edition, Boca Raton, CRC Press, Releases Taylor & Francis.

Boll,S.F. (1979), "Suppression of Acoustic Noise in Speech using Spectral Subtraction", IEEE Trans ASSP, V-27(2),pp-113-120.

Ravi Bolimera, Siva Prasad Nandyala T. Kishore Kumar (2012), “Speech Enhancement using Spectral Subtraction, Affine Projection Algorithms and Classical Adaptive Filters”, Proc. of the Intl. Conf. on Advances in Electronics, Electrical and Computer Science Engineering.

Anuradha R. Fukane, Shashikant L. Sahare, (2012),“Different Approaches of Spectral Subtraction method for Enhancing the Speech Signal in Noisy Environments”, International Journal of Scientific & Engineering Research, V- 2, Issue 5.

Yi Hu and Philipos C. Loizou,(2007),“Subjective comparison and evaluation of speech enhancement algorithms” IEEE Trans. Speech Audio Proc, V-49(7),pp 588–601.

Murty, K.S.R., & Yegnanarayana, B, (2006), “Combining evidence from residual phase and MFCC features for speaker recognition”, IEEE Signal Processing Letters, vol.13, No.1, , pp 52–55.

Shaughnessy D.O., Speech communication: human and machine, Addison-Wesley, 2003.

Lim Sin Chee,Ooi Chia Ai, M.Hariharan and Sazali Yaacob, (2009), “MFCC based recognition of repetitions and prolongations in stuttered speech using K-NN and LDA” Proc. of SCORED, vol.16-18, (IEEE) pp 416–419.

Umesh S. and Cohen L. and Nelson D. "Fitting the mel scale", Proc. ICASSP, vol.1 ,1999 (IEEE) pp 217–220.

H.Hermansky, “Perceptual linear predictive (PLP) analysis of speech”, J. Acoust. Soc. Am., vol. 87, pp.1738- 1752, 1990.

Steve Young, Gunnar Evermann, Thomas Hain, Dan Kershaw, Gareth Moore, Julian Odell, Dave Ollason, Dan Povey, Valtcho Valtchev, Phil Woodland, The HTK Book, version 3.2.1, copyright 1995-1999 Microsoft Corporation copyright 2001-2002 Cambridge University Engineering department.

L.R.Rabiner, “A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition” Proceedings of the IEEE, vol.77, no. 8, Feb 1989.

Picone J, Signal modeling techniques in speech recognition, Proceedings of the IEEE, Vol.81,No.9, 1993, pp 1215–1247.

Pujol P, Pol S, Nadeu C, Hagen A, Bourlard H, “Comparison and combination of features in a hybrid HMM/MLP and a HMM/GMM speech recognition system”, IEEE Transactions on Speech and Audio processing, vol.13, Issue.1, page14-22, 2005.

S. Chehresa, M. H. Savoji, Speech Enhancement Based On Gaussian Mixture Modeling and Wiener Filtering, (2012) International Journal on Communications Antenna and Propagation (IRECAP), 2 (2), pp. 111-122.


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize