A Framework for SMS Spam and Phishing Detection in Malay Language: a Case Study

(*) Corresponding author

Authors' affiliations

DOI's assignment:
the author of the article can submit here a request for assignment of a DOI number to this resource!
Cost of the service: euros 10,00 (for a DOI)


Short Message Service (SMS) spam and SMS phishing has been increase nowadays especially in Malay language which is the first language for Malaysia country. Currently, many SMS spam in others language has been proposed, however not yet for Malay language and we are the first to propose these. In addition, this paper also analyst on several frameworks of SMS spam filtering for our SMS spam and phishing detection framework. From the analysis, the chosen framework has been enhanced for Malay SMS spam and phishing. The enhancement has been done on classification phase where our framework proposed dual classification. The classification 1 will classify the SMS into ham and scam SMS. For classification 2, the scam SMS will be classified again into SMS spam and SMS phishing. After dual classifications phase completed, the Malay SMS has been examined using Naïve Bayes and J48 unsupervised Machine Learning techniques. The result shows high accuracy in detecting Malay SMS ham, spam and phishing.
Copyright © 2014 Praise Worthy Prize - All rights reserved.


Detection; Filtering; Phishing; Security; SMS; Spam

Full Text:



S. S. Chandran and S. Murugappan, "Spam detection and elimination of messages from twitter," International Review on Computers and Software, vol. 8, pp. 2438-2443, 2013.

F-Secure, "Mobile Threat Report Q3 2012," F-Secure Labs2012.

M. Boodae, "Mobile Users Three Times More Vulnerable to Phishing Attacks," in Trusteer vol. 2012, ed, 2011.

I. Lookout, "Lookout Mobile Threat Report August 2011," 2011.

I. Joe and H. Shim, "An SMS Spam Filtering System Using Support Vector Machine," in Future Generation Information Technology. vol. 6485, T.-h. Kim, et al., Eds., ed: Springer Berlin Heidelberg, 2010, pp. 577-584.

K. Dunham, "Chapter 6 - Phishing, SMishing, and Vishing," in Mobile Malware Attacks and Defense, D. Ken, Ed., ed Boston: Syngress, 2009, pp. 125-196.

A. Kwee, et al., "Sentence-Level Novelty Detection in English and Malay," in Advances in Knowledge Discovery and Data Mining. vol. 5476, T. Theeramunkong, et al., Eds., ed: Springer Berlin Heidelberg, 2009, pp. 40-51.

Y. Hård af Segerstad, Use and Adaptation of Written Language to the Conditions of Computer-Mediated Communication: University of Gothenburg, 2002.

T. Ogle, "Creative Uses of Information Extracted from SMS Messages," Undergraduate, Computer Science, The University of Sheffield, 2005.

Cédrick Fairon and S. Paumier, "A Translated Corpus of 30,000 French SMS," in In Proceedings of Language Resources and Evaluation., 2006.

M. Choudhury, et al., "Investigation and modeling of the structure of texting language," Int. J. Doc. Anal. Recognit., vol. 10, pp. 157-174, 2007.

S. C. Herring and A. Zelenkauskaite, "Symbolic capital in a virtual heterosexual market abbreviation and insertion in Italian iTV SMS," Written Communication, vol. 26, pp. 5-31, 2009.

C. Bach and J. Gunnarsson, "Extraction of trends is SMS text," Master's thesis, Lund University, 2010.

D. Pietrini, "X’6:-(?”: The sms and the triumph of informality and ludic writing," Italienisch, vol. 46, pp. 92-101, 2001.

P. Schlobinski, et al., "Simsen. Eine Pilotstudie zu sprachlichen und kommunikativen Aspekten in der SMS-Kommunikation," Networx 22. Online-Publikationen zum Thema Sprache und Kommunikation im Internet, 2001.

T. Shortis, "’New Literacies’ and Emerging Forms: Text Messaging on Mobile Phones," presented at the International Literacy and Research Network Conference on Learning., 2001.

N. Doring, "1 bread, sausage, 5 bags of apples I.L.Y" - communicative functions of text messages (SMS)," Zeitschrift für Medienpsychologie 3, 2002.

Y. Hard af Sergerstad, Use and Adaptation of Written Language to the Conditions of Computer-Mediated Communication: University of Gothenburg, 2002.

E.-L. Kasesniemi and P. Rautiainen, "Mobile culture of children and teenagers in Finland," in Perpetual contact, ed: Cambridge University Press, 2002, pp. 170-192.

R. Grinter and M. Eldridge, "Wan2tlk?: everyday text messaging," presented at the Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, Ft. Lauderdale, Florida, USA, 2003.

C. a. A. B. Thurlow, "Generation Txt? The sociolinguistics of young people's text messaging," Discourse Analysis Online 1(1), 30., 2003.

Yijue How and M.-Y. Kan, "Optimizing Predictive Text Entry for Short Message Service on Mobile Phones," presented at the In Proceedings of HCII, 2005.

Rich Ling and N. S. Baron, "Text Messaging and IM: Linguistic Comparison of American College Data," 2007.

M. Žic Fuchs and N. Tuđman Vuković, "Communication technologies and their influence on language: Reshuffling tenses in Croatian SMS text messaging," Jezikoslovlje, pp. 109-122, 2008.

D. Gibbon and M. Kul, "Economy Strategies in Restricted Communication Channels. A study of Polish short teхt messages," 2008.

A. Deumert and S. Oscar Masinyana, "Mobile language choices The use of English and isiXhosa in text messages (SMS) Evidence from a bilingual South African sample," English World-Wide, vol. 29, pp. 117-147, 2008.

I. Hutchby and V. Tanna, "Aspects of sequential organization in text message exchange," Discourse & Communication, vol. 2, pp. 143-164, 2008.

J. Walkowska, "Gathering and Analysis of a Corpus of Polish SMS Dialogues," Challenging Problems of Science. Computer Science. Recent Advances in Intelligent Information Systems, pp. 145-157, 2009.

C. Tagg, "A Corpus Linguistics Study of SMS Text Messaging," Doctor of Philosophy, Department of English, The University of Birmingham, Birmingham, 2009.

F. W. Elvis, "The sociolinguistics of mobile phone sms usage in cameroon and nigeria," The International Journal of Language Society and Culture, vol. 28, pp. 25-40, 2009.

S. N. Barasa, Language, mobile phones and internet: a study of SMS texting, email, IM and SNS chats in computer mediated communication (CMC) in Kenya, 2010.

A. B. Bodomo, "The Grammar of Mobile Phone Written Language," Chapter, vol. 7, pp. 110-198, 2010.

W. Liu and T. Wang, "Index-based online text classification for sms spam filtering," Journal of Computers, vol. 5, pp. 844-851, 2010.

S. Sotillo, "SMS Texting Practices and Communicative Intention," Chapter, vol. 16, pp. 252-265, 2010.

C. Dürscheid and E. Stark, "SMS4science: An international corpus-based texting project and the specific challenges for multilingual Switzerland," Digital Discourse: Language in the New Media: Language in the New Media, p. 299, 2011.

K. V. Lexander, "Names U ma puce: multilingual texting in Senegal," Working paper2011.

J. Elizondo, "Not 2 Cryptic 2 DCode: Paralinguistic Restitution, Deletion, and Nonstandard Orthography in Text Messages," Ph. D. thesis, Swarthmore College, 2011.

T. Chen and M.-Y. Kan, "Creating a live, public short message service corpus: the NUS SMS corpus," Language Resources and Evaluation, vol. 47, pp. 299-335, 2013/06/01 2013.

O. Salem, et al., "Awareness Program and AI based Tool to Reduce Risk of Phishing Attacks," in Computer and Information Technology (CIT), 2010 IEEE 10th International Conference on, 2010, pp. 1418-1423.

Q. Xu, et al., "SMS Spam Detection using Content-less Features," Intelligent Systems, IEEE, vol. PP, pp. 1-1, 2012.

J. W. Yoon, et al., "Hybrid spam filtering for mobile communication," Computers & Security, vol. 29, pp. 446-459, 2010.

H. Peizhou, et al., "A Novel Method for Filtering Group Sending Short Message Spam," in Convergence and Hybrid Information Technology, 2008. ICHIT '08. International Conference on, 2008, pp. 60-65.

G. V. Cormack, et al., "Content based SMS spam filtering," presented at the Proceedings of the 2006 ACM symposium on Document engineering, Amsterdam, The Netherlands, 2006.

M. Taufiq Nuruzzaman, et al., "Simple SMS spam filtering on independent mobile phone," Security and Communication Networks, vol. 5, pp. 1209-1220, 2012.

G. V. Cormack, et al., "Feature engineering for mobile (SMS) spam filtering," presented at the Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, Amsterdam, The Netherlands, 2007.

K. Yadav, et al., "SMSAssassin: crowdsourcing driven mobile-based system for SMS spam filtering," presented at the Proceedings of the 12th Workshop on Mobile Computing Systems and Applications, Phoenix, Arizona, 2011.

Y. C. Lim, et al., "Application of Genetic Algorithm in unit selection for Malay speech synthesis system," Expert Systems with Applications, vol. 39, pp. 5376-5383, 2012.

F. S. Tsai, et al., "Multilingual novelty detection," Expert Systems with Applications, vol. 38, pp. 652-658, 2011.

T. Subramaniam, et al., "Naïve Bayesian Anti-spam Filtering Technique for Malay Language."

T. S. Guzella and W. M. Caminhas, "A review of machine learning approaches to spam filtering," Expert Systems with Applications, vol. 36, pp. 10206-10222, 2009.

M. Z. Rafique, et al., "Application of evolutionary algorithms in detecting SMS spam at access layer," presented at the Proceedings of the 13th annual conference on Genetic and evolutionary computation, Dublin, Ireland, 2011.

M. Z. R. que and M. Farooq, "SMS Spam Detection By Operating On Byte-Level Distributions Using Hidden Markov Models (HMMS)," presented at the Virus Bulletin Conference September 2010, 2010.

G. Yan, et al., "SMS-Watchdog: Profiling Social Behaviors of SMS Users for Anomaly Detection

Recent Advances in Intrusion Detection." vol. 5758, E. Kirda, et al., Eds., ed: Springer Berlin / Heidelberg, 2009, pp. 202-223.

J. M. G. Hidalgo, et al., "Content based SMS spam filtering," presented at the Proceedings of the 2006 ACM symposium on Document engineering, Amsterdam, The Netherlands, 2006.

Y. Xiang, et al., "Filtering mobile spam by support vector machine " presented at the Conference on Computer Sciences, Software Engineering, Information Technology, E-Business and Applications (3rd: 2004 : Cairo, Egypt), Cairo, Egypt, 2004.

C. Jie, et al., "Spam Filter for Short Messages Using Winnow," in Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on, 2008, pp. 454-459.

K. Yadav, et al., "Take Control of Your SMSes: Designing an Usable Spam SMS Filtering System," in Mobile Data Management (MDM), 2012 IEEE 13th International Conference on, 2012, pp. 352-355.

W. Ningning, et al., "Real-time monitoring and filtering system for mobile SMS," in Industrial Electronics and Applications, 2008. ICIEA 2008. 3rd IEEE Conference on, 2008, pp. 1319-1324.

J. Huang, et al., "A Bayesian Approach for Text Filter on 3G Network," in Wireless Communications Networking and Mobile Computing (WiCOM), 2010 6th International Conference on, 2010, pp. 1-5.

H. Najadat, et al., "Mobile SMS Spam Filtering based on Mixing Classifiers."

T. M. Mahmoud and A. M. Mahfouz, "SMS Spam Filtering Technique Based on Artificial Immune System," IJCSI International Journal of Computer Science Issues, vol. 9, 2012.

T. Charninda, et al., "Content based hybrid sms spam filtering system," 2014.

N. Saxena and N. S. Chaudhari, "SecureSMS: A secure SMS protocol for VAS and other applications," Journal of Systems and Software, vol. 90, pp. 138-150, 2014.

G. C. C. F. Pereira, et al., "SMSCrypto: A lightweight cryptographic framework for secure SMS transmission," Journal of Systems and Software, vol. 86, pp. 698-706, 2013.

J. Choi and H. Kim, "A Novel Approach for SMS security," International Journal of Security & Its Applications, vol. 6, 2012.


  • There are currently no refbacks.

Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2023 Praise Worthy Prize