Open Access Open Access  Restricted Access Subscription or Fee Access

Speech Signal Enhancement Using Empirical Mode Decomposition and Adaptive Method Based on the Signal for Noise Ratio Objective Evaluation


(*) Corresponding author


Authors' affiliations


DOI: https://doi.org/10.15866/irecos.v9i8.1588

Abstract


This paper introduces a new speech enhancement algorithm. It is mainly based on the empirical mode decomposition (EMD) and the improved thresholding technique applied on the selected intrinsic mode functions (IMFs). The proposed technique is a fully data driven approach. Firstly, the noisy speech signal is iteratively decomposed via the EMD algorithm into N oscillatory components called IMFs. Secondly, an adaptive method is based on the SNR_OUT. Objective evaluation is applied, to lead the IMFs to be thresholded by giving the highest SNR_OUT value. This is meant to reconstruct a set of N enhanced speech signals. The first is obtained by thresholding only the first IMF and by keeping the others unprocessed. The second is reconstructed by gathering the sum of the two first thresholded modes and the rest of the signal. This procedure is repeated consequently until obtaining a reconstructed signal from all thresholded IMFs and the residue. The SNR_OUT value is calculated then at every iteration. Once this thresholding procedure is completed, we retain only the case corresponding to the maximum SNR_OUT. The proposed algorithm is evaluated by using the speech signals taken from the NOISEUS database and corrupted with the additive white Gaussian noise. Our approach holds also a comparison with the other states of the art speech enhancement techniques based on the EMD analysis process.
Copyright © 2014 Praise Worthy Prize - All rights reserved.

Keywords


Speech Signal; Empirical Mode Decomposition; Improved Thresholding; SNR Out

Full Text:

PDF


References


M. Berouti, R. Schwartz and J. Makhoul, Enhancement of Speech Corrupted by Acoustic Noise, IEEE International Conference on ICASSP'79, pp. 208-211, Washington, USA, 1979.

S. F. Boll, Suppression of Acoustic Noise in Speech using Spectral Subtraction, IEEE Transactions on Acoustics Speech and Signal Processing, vol. 27, no. 2, pp. 113-120, 1979.
http://dx.doi.org/10.1109/TASSP.1979.1163209

N. Virag, Single channel speech enhancement based on masking properties of the human auditory system, IEEE Transactions Speech Audio Process, vol. 7, no. 2, pp. 126–137, 1999.
http://dx.doi.org/10.1109/89.748118

L. Yang and P. C. Loizou, A geometric approach to spectral subtraction, Speech Communication, vol. 50, Issue 6, pp. 453-466, 2008.

K. Paliwal, K. Wojcicki and B. Schwerin, Single-channel speech enhancement using spectral subtraction in the short-time modulation domain, Speech communication, vol. 52, Issue 5, pp. 450-475, 2010.
http://dx.doi.org/10.1016/j.specom.2010.02.004

J. G. Proakis, C. Rader and F. Ling et al., Advanced Topics in Signal Processing, New York, Macmillan, 1992.

K. K. Paliwal and A. Basu, A speech enhancement method based on Kalman filtering, IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 177-180 Dallas, Texas, 1987.

J. S. Lim and V. Oppenheim, Enhancement and Bandwidth Compression of Noisy Speech, Proceeding of the IEEE, vol. 67, no. 12, pp. 1586- 1604, 1997.
http://dx.doi.org/10.1109/PROC.1979.11540

Y. Ephraim and D. Malah, Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, IEEE Trans. Acoust., Speech, Signal Process, vol. ASSP-33, no. 2, pp.443-445, 1985.
http://dx.doi.org/10.1109/TASSP.1985.1164550

Y. Ephraim and D. Malah, Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, IEEE Trans. Acoust., Speech, Signal Process, vol. ASSP-32, no. 6, pp.1109 -1121, 1984.
http://dx.doi.org/10.1109/TASSP.1984.1164453

T. FILLON, Traitement Numérique du Signal Acoustique pour une Aide aux Malentendants, Ph.D. dissertation, École Doctorale d'Informatique, Télécommunications et Électronique de Paris, décembre 2004.

D. Tufts, R. Kumaresan and I. Kirsteins, Data adaptive signal estimation by singular value decomposition of a data matrix, proceeding of the IEEE, vol. 70, no. 6, pp. 684-685, 1982.

Y. Ephraim and H. L. Van Trees, A signal subspace approach for speech enhancement, IEEE Transactions on Speech and Audio Processing, vol. 3, no. 4, pp. 251-266, 1995.
http://dx.doi.org/10.1109/89.397090

U. Mittal and N. Phamdo, Signal/noise KLT based approach for enhancing speech degraded by colored noise, IEEE Transactions on Speech and Audio Processing, vol. 8, no. 2, pp. 159-167, 2000.
http://dx.doi.org/10.1109/89.824700

T. Hu, L. C. De Silva and K. Sengupta, A hybrid approach of NN and HMM for facial emotion classication, Pattern Recognition Letters, vol. 23, no. 11, pp. 1303-1310, 2002.
http://dx.doi.org/10.1016/S0167-8655(02)00079-X

H. Lev-Ari and Y. Ephraim, Extension of the signal subspace speech enhancement approach to colored noise, IEEE Signal Processing Letters, vol. 10, no. 4, pp. 104-106, 2003.
http://dx.doi.org/10.1109/LSP.2003.808544

Y. Hu, Subspace and Multitaper, Methods for Speech Enhancement, Ph.D. Dissertation. Univ. of Texas at Dallas, 2003.

Justin, J., Vennila, I., Enhancement of speech signals using weighted mask and neuro-fuzzy classifier, (2013) International Review on Computers and Software (IRECOS), 8 (11), pp. 2704-2717.

Balaji, V.R., Subramanian, S., A discrete fractional cosine transform based speech enhancement system through Adaptive Kalman filter Combined with perceptual weighting filter with pitch synchronous analysis, (2013) International Review on Computers and Software (IRECOS), 8 (9), pp. 2288-2295.

Yao, C.-C., Hung, R.-W., A hybrid microphone array filter for speech enhancement, (2011) International Review on Computers and Software (IRECOS), 6 (5), pp. 640-651.

Z. Sakka, A. Kachouri, M. Samet, Speech Denoinsing and Arabic Speaker Recognition System Using Subband Approach, (2007) International Review on Computers and Software (IRECOS), 2 (3), pp. 264 - 271.

S. G. Mihov, R. M. Ivanov and A. N. Popov, Denoising Speech Signals by Wavelet Transform, Annual Journal of Electronics, 2009, ISSN 1313-1842.

D.L. Donoho, Denoising by Soft thresholding, IEEE Trans. on Information Theory, vol. 41, no. 3, pp. 613-627, 1995.
http://dx.doi.org/10.1109/18.382009

N. E. Huang, Z. Shen, S. R. Long, M. C. Wu, H. H. Shin, Q. Zheng, N. C. Yen, C. C. Tung, and H. H. Liu, The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis, Proc. R. Soc. Lond. A, Math. Phys. Sci., vol. 454, no. 1971, pp. 903–995, 1998.

I. Y. Soon, S. N. Koh, and C. K. Yeo, Noisy speech enhancement using discrete cosine transform, Speech Communication, vol. 24, no. 3, pp. 249–257, 1998.
http://dx.doi.org/10.1016/S0167-6393(98)00019-3

K. Khaldi, A. O. Boudraa, A. Bouchikhi, M. T. and H. Alouane, Speech Enhancement via EMD," Proc. EURASIP J. Adv. Signal Process, vol. 2008, 2008.

J. Tariqullah and W. Wang, Empirical mode decomposition for joint denoising and dereverberation, Proc. EUSIPCO 2011, Barcelona, Spain, 2011.

A. O. Boudraa and J.C. Cexus, EMD-Based Signal Filtering, IEEE Transactions on Instrumentation and Measurement, vol. 56, no. 6, pp. 2196–2202, 2007.
http://dx.doi.org/10.1109/TIM.2007.907967

A.O. Boudraa, J.C. Cexus and Z. Saidi, EMD-Based Signal Noise Reduction, International Journal of Signal Processing, vol. 1 no. 1, pp. 33-37, 2004.

H. Issaoui, A. Bouzid and N. Ellouze, Noisy Speech Enhancement Using Soft Thresholding on Selected Intrinsic Mode Functions, Signal Processing: An International Journal (SPJI), vol. 5, Issue 3, pp. 93-100, 2011.

L. She, Z. Xu, S. Zhang and Y. Song, Denoising of ECG Based on EMD Improved-thresholding And Mathematical Morphology Operation, Proc. IEEE BMEIN 2010, pp. 838–842, 2010.

A. M. Atto, D. Pastor and G. Mercier, Detection threshold for non-parametric estimation, Signal, Image and Video Processing, Springer, vol. 2, no. 3, pp. 207–223, 2008.
http://dx.doi.org/10.1007/s11760-008-0051-x


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize