Open Access Open Access  Restricted Access Subscription or Fee Access

Stop Consonant-Short Vowel (SCSV) Classification for Tamil Speech Utterances


(*) Corresponding author


Authors' affiliations


DOI: https://doi.org/10.15866/irecos.v11i2.8481

Abstract


Tamil Language is one of the ancient Dravidian languages spoken in south India. Most of the Indian languages are syllabic in nature and syllables are in the form of Consonant-Vowel (CV) units. In Tamil language, CV pattern occurs in the beginning, middle and end of a word. In this work, CV units formed with Stop Consonant – Short Vowel (SCSV) were considered for classification task. The work carried out in three stages, Vowel Onset Point (VOP) detection, CV segmentation and classification. VOP is an event at which the consonant part ends and vowel part begins. VOPs are identified using linear Prediction residuals which provide significant characteristics of the excitation source. To segment the CV units, fixed length spectral frames before and after VOPs are considered. Both production based features - Linear Predictive Cepstral Coefficients (LPCC) and perception based features - Mel Frequency Cepstral Coefficients (MFCC) are extracted and given as input to the classifiers designed with multilayer perceptron and support vector machine. A speech corpus of 200 Tamil words uttered by 15 native speakers was used, which covers all SCSV units formed with Tamil stop consonants (/k/,/ch/,/d/,/t/,/p/) and short vowels (/a/,/i/, /u/, /e/, /o/). The classifiers are trained and tested for its performance using various measures. The results indicate that the model built with MFCC using support vector machine RBF kernel outperforms.
Copyright © 2016 Praise Worthy Prize - All rights reserved.

Keywords


Consonant-Vowel Segmentation; Vowel Onset Point; Linear Prediction Residual; Mel Frequency Cepstral Coefficients; Multilayer Perceptron; Support Vector Machine

Full Text:

PDF


References


C. C. Sekhar, S. M. Santhosh and B. Yegnanarayana, A modular approach for recognition of isolated stop-consonant-vowel (SCV) utterances in Indian Languages,(1995) Journal of the Acoustic Society of India, 23 (1), pp.28-35.

S. Ilakkuvanar, Tholkappiyam (in English), (1963) India:Kural Neri Publishing House.

K. Karunakaran, and V. Jeya, moZhiyiyal, (1997) India: Chennai Kavitha Pathippakam.

A .Rathinavelu, G.Anupriya, and A.S.Muthanantha Murugavel, Speech Recognition Model for Tamil Stops, (2007) International conference of Computational Intelligence and Intelligent Systems (ICCIIS’07),London.

K. Geetha, and E. Chandra, Monosyllable Isolated Word Recognition for Tamil language using Continuous Density Hidden Markov Model, (2015) IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT), pp.1-6.
http://dx.doi.org/10.1109/icecct.2015.7226056

C. C. Sekhar and B. Yegnanarayana, Recognition of stop-consonant-vowel (SCV) segments in continuous speech using neural network models, (1996) Journal of the Institution of Electronics and Telecommunication Engineers, 42 (4) and 5, July-October, pp. 269-280.
http://dx.doi.org/10.1080/03772063.1996.11415933

S. V. Gangashetty, C. C. Sekhar, and B. Yegnanarayana, Detection of vowel onset points in continuous speech using autoassociative neural network models, (2004) In Proceedings of International conference on spoken language processing, Korea, October, pp. 1081-1084.

S. V. Gangashetty, C. C. Sekhar, and B. Yegnanarayana, Spotting multilingual consonant-vowel units of speech using neural network models, (2005) ISCA Tutorial and Research Workshop on Nonlinear Speech Processing, Barcelona, pp. 287-297, April.
http://dx.doi.org/10.1007/11613107_27

A. K. Vuppala, K. Sreenivasa Rao and Saswat Chakrabarti, Improved consonant-vowel recognition for low bit-rate coded speech, (2011) Wiley International Journal of Adaptive Control and Signal Processing, 26 (4), pp. 333-349.
http://dx.doi.org/10.1002/acs.1286

T. M. Thasleema and N. K. Narayanan, Wavelet transform based consonant - vowel (CV) classification using SVMs,(2012) ICONIP 2012, Part II, LNCS 7664, Springer, pp. 250-257.
http://dx.doi.org/10.1007/978-3-642-34481-7_31

V. Anantha Natarajan and S. Jothilakshmi, Segmentation of continuous speech into consonant and vowel units using formant frequencies, (2012) International Journal of Computer applications, 56 (15), October.
http://dx.doi.org/10.5120/8968-3185

S. R. M. Prasanna, S. V. Gangashetty, and B. Yegnanarayana, Significance of vowel onset point for speech analysis, (2001) In Proceedings of international conference on signal processing and communications, pp. 81–88.

D. J. Hermes, Vowel onset detection, (1990) The Journal of the Acoustical Society of America, 87, pp. 866-873.
http://dx.doi.org/10.1121/1.398896

S. V. Gangashetty, C. C. Sekhar and B. Yegnanarayana, Detection of vowel onset points in continuous speech using autoassociative neural network models, (2004) In Proceedings of the international conference on spoken language processing, pp. 401-410.

S. V. Gangashetty, C. C. Sekhar, and B. Yegnanarayana, Spotting multilingual consonant-vowel units of speech using neural network models, (2005) In Nonlinear Analyses and Algorithms for Speech Processing, LNCS 3817, Springer, pp. 303-317.
http://dx.doi.org/10.1007/11613107_27

A. K. Vuppala and K. Sreenivasa Rao, vowel onset point detection for noisy speech using spectral energy at formant frequencies, (2013) International Journal of Speech Technology, Springer, 16, pp. 229-235, June.
http://dx.doi.org/10.1007/s10772-012-9179-8

A. Akila and E.Chandra E, Syllable Based Continuous Speech Recognizer with Varied Length Maximum Likelihood Character Segmentation, (2013) IEEE Conference ICACCI 2013, pp.935-940, August.
http://dx.doi.org/10.1109/icacci.2013.6637302

S. R. M. Prasanna and B. Yegnanarayana, Detection of vowel onset point events using excitation source information, (2005) In Proceedings of Interspeech, pp. 1133–1136.

D. Gabor, Theory of Communication,(1946) Journal of IEE, 93, pp. 429-457.

A. K. Vuppala, K. Sreenivasa Rao and Saswat Chakrabarti, Spotting and recognition of consonant-vowel units from continuous speech using accurate vowel onset points,( 2012) Circuits, Systems, and Signal Processing, 31(4), pp. 1459-1474.
http://dx.doi.org/10.1007/s00034-012-9391-4

Khalid A. Darabkh, Ala F. Khalifeh, Iyad F. Jafar, Baraa A. Bathech, and Saed W. Sabah, A Yet Efficient Communication System with Hearing-Impaired People Based on Isolated Words of Arabic Language, (2013) IAENG International Journal of Computer Science, 40(3), August.

M. Anusuya and S. Katti, Front end analysis of speech recognition: a review, (2011) Int. J. Speech Technology, 14(2), pp. 99–145.
http://dx.doi.org/10.1007/s10772-010-9088-7

L. Rabiner and B.H. Juang, Fundamentals of Speech Recognition,( 1993) Signal Processing Series, New Jersey: PTR Prentice Hall.

S.B. Davis, and P. Mermelstein,(1980) Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences, IEEE Transactions on Acoustics, Speech, and Signal Processing, 28(4),pp. 357–366.
http://dx.doi.org/10.1109/tassp.1980.1163420

M. Tim Jones, “Artificial Intelligence: A Systems Approach, Jones & Bartlett Learning, USA, 2015

I. Steinwart, and A. Christmann, Support Vector Machines (Information Science and Statistics), (2008), Springer
http://dx.doi.org/10.1007/978-0-387-77242-4

J.Bell, Support Vector Machines, in Machine Learning: Hands-On for Developers and Technical Professionals, (2014),John Wiley & Sons, Inc, Indianapolis, IN, USA
http://dx.doi.org/10.1002/9781119183464.ch7

K. P. Soman, R. Loganathan, and V. Ajay, Machine Learning with SVM and Other Kernal Methods , (2009), Prentice-Hall of India.


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize