A Novel Approach for English to Dravidian Language Rule Based Machine Translation


(*) Corresponding author


Authors' affiliations


DOI's assignment:
the author of the article can submit here a request for assignment of a DOI number to this resource!
Cost of the service: euros 10,00 (for a DOI)

Abstract


In this paper, propose a method for translating text from English to Tamil which is one of the Dravidian languages. Rule based machine translation technique is used here, which involves the formation of rules which helps in re-ordering of the syntactic structures of the source language sentence along with its dependency information which bring that close to the structure of the target sentence. The parser identifies the syntactical elements in English sentences and suggests its Dravidian language translation taking into account various grammatical forms of those Dravidian languages. The usage of the parser in developing the syntactic structure plays a major role in the translation process. There are mainly two types of rules used here, one is transfer link rule and the other is morphological rules. In this method, the transfer link rules are used for generating target structure. Morphological rules are used for assigning morphological features. Context Free Grammars (CFG) is used in generation of the language structures. By using this approach, given English text can be translated to its Tamil equivalent.
Copyright © 2013 Praise Worthy Prize - All rights reserved.

Keywords


Natural Language Processing; Machine Translation; Reordering; Parsing; Morphological Analysis; Lexicalization; Transliteration

Full Text:

PDF


References


S. Samantaray, “A Data mining approach for resolving cases of Multiple Parsing in Machine Aided Translation of Indian Languages”, In proceedings of Fourth IEEE International Conference on Information Technology, 2007. ITNG '07, 2-4 April 2007, pp.401 – 405, doi:10.1109/ITNG.2007.

Al-Taani, A.T., A new approach to the extraction of multiword terms from arabic texts, (2010) International Review on Computers and Software (IRECOS), 5 (2), pp. 129-133.

RemyaRajan, Remya Sivan, RemyaRavindran and K.P Soman, “Rule Based Machine Translation from English to Malayalam”, in the proceedings of the International Conference on Advances in Computing, Control, and Telecommunication Technologies, pp.439-441, 2009.

Poornima C , Dhanalakshmi V, Anand Kumar M and Soman K P, “Rule based Sentence Simplification for English to Tamil Machine Translation System”, International Journal of Computer Applications (0975 – 8887),Volume 25– No.8, July 2011.

R. Harshawardhan, Mridula Sara Augustine and K.P. Soman, “Phrase based English – Tamil Translation System by Concept Labeling using Translation Memory”, International Journal of Computer Applications (0975 – 8887), Volume 20– No.3, April 2011.

Rahul.C, Dinunath.K, RemyaRavindran, K.P.Soman, Department of Computational Engineering &NetworkingAmritaVishwaVidyapeetham, “Rule Based Reordering and Morphological Processing For English-Malayalam Statistical Machine Translation”, in the proceedings of the International Conference on Advances in Computing, Control, and Telecommunication Technologies, pp.458-460, 2009.

Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M.,Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C.,Bojar, O., Constantin, A. &Herbst, E. (2007). Moses: Open source toolkit for statistical machine translation. In ACL, The Association for Computer Linguistics.

Papineni, K., Roukos, S., Ward, T. & Zhu, W.J. (2001). Blue: a method for automatic evaluation of machine translation. In ACL ’02: Proceedings of the40th Annual Meeting on Association for Computational Linguistics, 311–318,Association for Computational Linguistics, Morristown, NJ, USA.

Yamada, K. & Knight, K. (2001). A syntax-based statistical translation model. In ACL ’01: Proceedings of the 39th Annual Meeting on Association for Computational Linguistics, 523–530, Association for Computational Linguistics, Morristown, NJ, USA

Dorr, B.J., Jordan, P.W. & Benoit, J.W. (1999). A survey of current paradigms in machine translation. Advances in Computers, 49, 2–68

Lehmann, T. (1989). A Grammar of Modern Tamil .Pondicherry Institute of Linguistics and Culture.

Koehn, P., Och, F.J. &Marcu, D. (2003). Statistical phrase-based translation.InNAACL ’03: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, 48–54, Association for Computational Linguistics, Morristown, NJ,USA.

Yang Liu, Dept. of Comput. Sci., Texas Univ., Richardson, TX, Shriberg, E. “Comparing Evaluation Metrics for Sentence Boundary Detection”, IEEE International Conference on Acoustics, Speech and Signal Processing, 2007.ICASSP 2007.

Mnasri, Z., Boukadida, F., Ellouze, N., Modeling segmental duration by statistical learning for an Arabic text-to-speech system, (2009) International Review on Computers and Software (IRECOS), 4 (5), pp. 533-542.

Wang, Y., Acero, A., and Chelba, C, “Is Word Error Rate a Good Indicator for Spoken Language Understanding Accuracy”, IEEE Workshop on Automatic Speech Recognition and Understanding, St. Thomas, US Virgin Islands, 2003.

Subramanian, K. BBN Technol., Cambridge, Stallard, D. ,Prasad, R. , Saleem, S. , Natarajan, P., “Semantic translation error rate for evaluating translation systems”, IEEE Workshop on Automatic Speech Recognition and Understanding, 2007.

Myers, Jerome L., Well, Arnold D, “Research Design and Statistical Analysis”,Second Edition, Lawrence Erlbaum,pp. 508, ISBN 0-8058-4037-0,2003.

Maritz.J.S. “Distribution-Free Statistical Methods”, Chapman &Hall.ISBN 0-412-15940-6, page 217, 1981.

Boukadida, F., Ellouze, N., Modeling arabic prosody for a text-to-speech system, (2009) International Review on Computers and Software (IRECOS), 4 (3), pp. 337-343.


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize