Multi Channel Voice Active Detection Using Instance Filed Auto-Interrelation Function


(*) Corresponding author


Authors' affiliations


DOI's assignment:
the author of the article can submit here a request for assignment of a DOI number to this resource!
Cost of the service: euros 10,00 (for a DOI)

Abstract


This research work presents a method of Voice Activity Detection (VAD) in a multi-channel communication for vigorous noise reduction using the Noise Strong Voice Speech Discovery system. This method is based on the amalgamation of two phases. The initial phase utilizes the utmost apex of the normalized Instance Filed Auto-Interrelation function in multi channel communication. The second phase uses a strange mixing of cross- interrelation and zero-transit rate of the standardize auto-interrelation to the suitable measure of signal pitch and periodicity for strong noise reduction. Existing VAD coding techniques in multi channel communication are not appropriate to specific location and disgrace the speech quality while decoding. The score outcomes of the two phases are combined using the TotalWeigh Fusion to create the proposed Auto-Interrelation Zero Transit Rate (AIZT) Voice Active Detection. Accuracy of the AIZT is compared with the standardized LTSV-based VAD scheme and VAD methods to outperform the best accomplishing method. Performance of the Auto-Interrelation Zero Transit Rate (AIZT) Voice Active Detection is measured in terms of multi channel communication efficiency, noise level. Experiments are conducted in the M2VTS audio-visual database, TIMIT dataset and the Dutch subset of a test database to evaluate the communication in multiple channels.
Copyright © 2013 Praise Worthy Prize - All rights reserved.

Keywords


Voice Active Detection; Multi Channel Communication; Utmost Apex; Auto-Interrelation Zero Transit Rate; Total Weigh Fusion; Instance Filed

Full Text:

PDF


References


Yesenia Lacouture-Parodi., Emanu¨el A. P. Habets., Jacob Benesty., “Multichannel Noise Reduction Wiener Filter In The Karhunen-Loe`Ve Expansion Domain,” IEEE Transaction on 2012

Xianglong Liu., Yuan Liang., Yihua Lou., He Li., Baosong Shan., “Noise-Robust Voice Activity Detector Based on Hidden Semi-Markov Models.,” International Conference on Pattern Recognition, 2010

Yasunari Obuchi., Ryu Takeda., and Naoyuki Kanda., “Voice Activity Detection Based on Augmented Statistical Noise Suppression,” Signal & Information Processing Association Conference (APSIPA ASC), 2012

Lee Ngee Tan., Bengt J. Borgstrom., and Abeer Alwan., “Voice Activity Detection Using Harmonic Frequency Components In Likelihood Ratio Test,” IEEE Transaction on ICASSP 2010

M. H. Moattar., and M. M. Homayounpour., and Nima Khademi Kalantari., “A New Approach For Robust Realtime Voice Activity Detection Using Spectral Pattern,” IEEE 978-1-4244-4296-6/10/$25.00, 2010

Prasanta Kumar Ghosh., Andreas Tsiartas., and Shrikanth Narayanan., “Robust Voice Activity Detection Using Long-Term Signal Variability,” IEEE Transactions On Audio, Speech, And Language Processing, Vol. 19, No. 3, March 2011

Mehrez Souden., Jingdong Chen., Jacob Benesty., and Sofiène Affes., “An Integrated Solution for Online Multichannel Noise Tracking and Reduction,” IEEE Transactions On Audio, Speech, And Languagen Processing, vol. 19, no. 7, September 2011.

M. H. Moattar., and M. M. Homayounpour., “A Simple But Efficient Real-Time Voice Activity Detection Algorithm,” European Signal Processing Conference (EUSIPCO 2009)

C. B. O. Lopes., A. L. Gonc¸alves., J. Scharcanski., and C. R. Jung., “Color-Based Lips Extraction Applied To Voice Activity Detection”, IEEE on Image Processing, 2011

Kim Ngo., Marc Moonen., Søren Holdt Jensen., and Jan Wouters., “A Flexible Speech Distortionweighted Multi-Channel Wiener Filter For Noise Reduction In Hearing Aids,” IEEE on Acoustics, Speech and Signal Processing (ICASSP), 2011

Datao You., Jiqing Han., Guibin Zheng., Tieran Zheng., “Sparse Power Spectrum Based Robust Voice Activity Detector”, IEEE ICASSP 2012

Vaclav Bolom., Jan Ingerle., Pavel Sovka., “Derivation of Criterion Suitable for Evaluation of Multi channel Noise Reduction Systems for Speech Processing,” IEEE on Radio Engineering, Vol. 18, No. 4, December 2009

Larkhoon Leem., Hyungmin Cho., Jason Bau., Quinn A. Jacobson., Subhasish Mitra1., “ERSA: Error Resilient System Architecture for Probabilistic Applications,” IEEE 978-3-9810801-6-2/DATE10, 2010


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize