Authorship Attribution in Tamil Language Email for Forensic Analysis


(*) Corresponding author


Authors' affiliations


DOI's assignment:
the author of the article can submit here a request for assignment of a DOI number to this resource!
Cost of the service: euros 10,00 (for a DOI)

Abstract


This paper presents Authorship attribution (AA) to Tamil language email. This work presents generation of representative signatures of Tamil emails using lexical and syntactic based methods. The signature of each email has large dimensions. In order to make it suitable for subsequent processing, conversion of large dimension of the signature into 2-dimensional pattern using Fisher’s linear discriminant function (FLD) method is given. The 2-dimensional patterns of the signatures are used as training data for the radial basis function (RBF) network and echo state neural network (ESNN). The improved classification of Tamil email is shown by transformation of patterns using FLD followed by training using RBF, as well as, training ESNN. The paper presents a new technique for building signature database and for optimal AA in Tamil email forensics.
Copyright © 2013 Praise Worthy Prize - All rights reserved.

Keywords


Echo State Neural Network; Tamil Email; Lexical Features; Syntactic Features; Discriminant Function; Radial Basis Function

Full Text:

PDF


References


Abbasi, A., & Chen, H., (2005), Applying authorship analysis to extremist-group Web forum messages, IEEE Intelligent Systems, 20(5), 67-75.

Argamon, S., Koppel, M., Fine, J., & Shimoni, A., (2003), Gender, Genre, and Writing Style in Formal Written Texts, Text and Talk, 23(3), 321-346.

Argamon, S., Koppel, M., Pennebaker, J., & Schler, J., (2009), Automatically Profiling the Author of an Anonymous Text, Communications of the ACM.,52(2),119-123.

Baayen, H., van Halteran, H., Neijt, A., & Tweedie, F. (2002), An Experiment in Authorship Attribution, JADT 2002 : 6th Journ´ees internationales d’Analyse statistique des Donn´ees Textuelles.

Bagavandas, M., Hameed Abdul & Manimannan, G., (2009), Neural Computation in Authorship Attribution: The Case of Selected Tamil Articles, Journal of Quantitative Linguistics, 16(2), 115-131.

Binongo, J. N. G., (2003), Who wrote the 15th Book of Oz? An application of multivariate analysis to authorship attribution, Chance 16(2), 9-17.

Corney, M., de Vel, O., Anderson, A., & Mohay, G., (2002), Gender-Preferential Text Mining of E-mail Discourse, ACSAC '02 Proceedings of the 18th Annual Computer Security Applications Conference, 282.

De Vel, O., Anderson, A., Corney, M., Mohay, G. M. (2001), Mining e-mail content for author identification forensics. ACM SIGMOD, 30(4),55-64.

Diederich, J., Kindermann, J., Leopold, E., and Paass, G., (2003), Authorship Attribution with Support Vector Machines, Journal Applied Intelligence archive, 19(1-2),109-123.

Farkhund Iqbal, Hamad Binsalleeh, Benjamin C.M. Fung & Mourad Debbabi. (2010). Mining writeprints from anonymous e-mails for forensic investigation, Digital Investigation, 1-9.

Farkhund Iqbal, Rachid Hadjidj, Benjamin, C.M., Fung, & Mourad Debbabi. (2008). A novel approach of mining write-prints for authorship attribution in e-mail forensics, Digital Investigation, 42-51.

Genkin, A., David D. Lewis, & Madigan, D., (2007), Large-scale Bayesian logistic regression for text categorization, American Statistical Association and the American Society for Quality TECHNOMETRICS, 49(3),291-304.

Graham, N., Hirst, G., & Marthi, B., (2005), Segmenting documents by stylistic character, Natural Language Engineering, 11(4), 397-415.

Grieve, J., (2007). Quantitative authorship attribution: An evaluation of techniques. Literary and Linguistic Computing, 22 (3), 251-270.

Holmes, D. I., Gordon, L., & Wilson, C. (2001), A Widow and her Soldier: Stylometry and the American Civil War, Literary and Linguistic Computing, 16(4), 403-420.

Holmes, D. I., Robertson, M., & Paez, R., (2001), Stephen Crane and the New-York Tribune: A case study in traditional and non-traditional authorship attribution, Computers and the Humanities, 35(3), 315-331.

Holmes, D.I.,.(1992)., A stylometric analysis of Mormon scripture and related texts. Journal of the Royal Statistical Society, Series A, 155(1), 91–120.

Jaeger H., (2001) ,Short term memory in echo state networks, GMD Report 152, German National Research Center for Information Technology.

Jaeger H., (2001), The echo state approach to analyzing and training recurrent neural networks, GMD Report 148, German National Research Center for Information Technology.

Koppel, M., & Schler, J., (2003), Exploiting Stylistic Idiosyncrasies for Authorship Attribution, in Proceedings of IJCAI'03 Workshop on Computational Approaches to Style Analysis and Synthesis, 69-72.

Koppel, M., Argamon, S., Shimoni, A.R., (2002), Automatically categorizing written texts by author gender,Literary and Linguistic Computing 17(4), 401-412.

Luyckx, K., & Daelemans W., (2008), Authorship Attribution and Verification with Many Authors and Limited Data.. Presented at the 20th Belgian-Netherlands Conference on Artificial Intelligence (BNAIC 2008), Enschede, The Netherlands.

Madigan, D., Genkin, A., Lewis, D.D., Argamon, S., Fradkin, D., & Ye, L., (2005), Author Identification on the Large Scale, Proc. of Classification Society of N. America.

Mary Amala Bai V., Manimegalai D, (2013), A Document Level Measure for Text Categorization, International Review on Computers and Software,8(6),1374-1381.

Purushothaman S.and Suganthi D., (2008), fMRI segmentation using echo state neural network, International Journal of Image Processing,.2(1), .1-9.

Sambasiva Rao Baragada, Ramakrishna, S., Rao, M.S., & Purushothaman, S., (2009) Implementation of Radial Basis Function Neural Network for Image Steganalysis. International Journal of Computer Science and Security, 2, 12-22.

Stamatatos, (2009). A survey of modern authorship attribution methods, Journal of the American Society for Information Science and Technology, 60(3), 538–556.

Zhao, Y., & Zobel, J., (2005), Effective authorship attribution using function word, in 'Proc. 2nd AIRS Asian Information Retrieval Symposium', Springer, 174-190.

Zheng, R., Li, J., Chen, H., & Huang, Z., (2006). A framework for authorship identification of online messages: Writing style features and classification techniques. Journal of the American Society of Information Science and Technology, 57(3), 378-393.


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize