Open Access Open Access  Restricted Access Subscription or Fee Access

The Implementation of Using Medical Ontologies in Plagiarism Detection

Khaled Omar(1*), Bassel Alkhatib(2), Mayssoon Dashash(3), Fadel Alhassan(4)

(1) Faculty of informatics Engineering, Damascus university, Syrian Arab Republic
(2) Faculty of informatics Engineering, Damascus university, Syrian Arab Republic
(3) Faculty of Dentistry, Damascus University, Syrian Arab Republic
(4) Higher Institute for Applied Sciences and Technology, Syrian Arab Republic
(*) Corresponding author


DOI: https://doi.org/10.15866/irecos.v11i3.8681

Abstract


This paper aims to present a new algorithm implemented for detecting plagiarism using semantic web tools and notions. For increasing detection accuracy domain ontology could be used in addition to global semantic resources. Using global semantic resources increases the effect of ambiguity therefore disambiguation technique was used. Not all semantically similar texts are plagiarized. So, other detection techniques were used to reduce false positive results.  Given that our work has been implemented in medical disciplines and for texts written in English, it presents a generic algorithm that can be adapted for different disciplines and languages. For medical discipline, a set of medical Ontologies were used for enriching extracted medical terms. In addition, WordNet was used for enriching global terms. The test results of the algorithm shows that it was able to detect advanced types of plagiarism that are out of the reach of classical methods such as: using word synonyms, word re-ordering, text re-styling and other natural languages techniques which are usually used to hide the plagiarism action.
Copyright © 2016 Praise Worthy Prize - All rights reserved.

Keywords


Semantic Web; Medical Ontologies; Plagiarism Detection in Medical Sciences

Full Text:

PDF


References


Vinod K.R., Sandhya. S, Sathish Kumar D, Harani A, David Banji and Otilia JF Banji, “Plagiarism history, detection and prevention “, Hygeia: journal for drugs and medicines, Vol.3-Issue.1-pp. 1- 4, 2011.

Carroll, J. (2002) A Handbook for Deterring Plagiarism in Higher Education. Oxford: Oxford Brookes University.

Tachaphetpiboon, S.; Facundes, N.; Amornraksa, T., Plagiarism indication by syntactic-semantic analysis, Asia-Pacific Conference on Communications, pp.237-240, 2007.
http://dx.doi.org/10.1109/apcc.2007.4433544

Osman, A.H., Salim, N., BinWahlan, S., Hentabli, H. (2011). Conceptual Similarity and Graph-Based Method for Plagiarism Detection. Journal of Theoretical and Applied Information Technology. Volume32 (2): pp. 135 – 145.(Scopus Indexed)

M. Shenoy, K.C.Shet, and U. D. Acharya, “S Emantic P Lagiarism Detection System Using Ontology Mapping,” Advanced Computing, vol. 3, no. 3, pp. 59–62, 2012.

Y. Palkovskii, A. Belov, and I. Muzyka, "Using WordNet-based Semantic Similarity Measurement in External Plagiarism DetectionNotebook for PAN at CLEF 2011", in Proc. CLEF (Notebook Papers/Labs/Workshop), 2011.

Salha Alzahrani Naomie Salim “Fuzzy Semantic Based String Similarity for Extrinsic Plagiarism Detection” Lab Report for PAN at CLEF 2010.

Gipp, B., & Beel, J. (2010).” Citation based plagiarism detection - A new approach to identify plagiarized work language independently”. HT’10 - Proceedings of the 21st ACM Conference on Hypertext and Hypermedia, (June), 273–274.
http://dx.doi.org/10.1145/1810617.1810671

Princeton University, WordNet a large lexical database of English.[online] https://wordnet.Princeton.Edu [Accessed 15-March 2015].

EMBL-EBI, Ontology Lookup Service.[online] http://www.ebi. ac.uk/ontology-lookup/init.do#soft [Accessed 15-August 2015 ].

EMBL-EBI, Ontology Lookup Service.[online] http://www.ebi. ac.uk /ontology-lookup/ontologyList.do [Accessed 15-August 2015].

Gene Ontology Consortium, [online] Available at: ,http://geneontology.org/ [Accessed 1-March 2016].

The Infectious Disease Ontology, [online] Available at: http://infectiousdiseaseontology.org/page/Main_Page[Accessed 1-March 2016].

Ontology Design Patterns .org(ODP), [online] Available at: http://ontologydesignpatterns.org/wiki/Ontology:Foundational_Model_of_Anatomy_(FMA) [Accessed 1-March 2016].

Disease Ontology, [online] Available at: http://disease-ontology.org/[Accessed 1-March 2016].

Cell Ontology, [online] Available at: http://obofoundry. org/ontology/cl.html [Accessed 1-March 2016].

The OBO Foundry. [online] Available at: http://www.obofoundry.org/ [Accessed 15-March 2015].

The Stanford Natural Language Processing Group.[online] http://nlp.stanford.edu/software/ [Accessed 15-August 2015 ].

Wikipedia, the free encyclopedia.[online] https://en.Wikipedia. org /wiki/ Stop_words / [Accessed 20-August 2015 ].

The Stanford Natural Language Processing Group. Stanford Log-linear Part-Of-Speech Tagger [online] http://nlp.stanford.edu/ software/tagger.shtml/ [Accessed 1-August 2015].

The Stanford Natural Language Processing Group. Stanford Dependencies [online] http://nlp.stanford.edu/software/stanford-dependencies.shtml/ [Accessed 1-August 2015].

Satanjeev Banerjee, Ted Pedersen An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet.
http://dx.doi.org/10.1007/3-540-45715-1_11

Rajesh Thiagarajan, Geetha Manjunath, and Markus Stumptner ,” Computing Semantic Similarity Using Ontologies”, HP Laboratories HPL-2008-87.

Abrate, M., Bacciu, C., Marchetti, A., and Tesconi, M. (2012). Wordnet atlas: a web application for visualizing wordnet as a zoomable map. In GWC 2012 6th International Global Wordnet Conference, page 23.

Madylova, A. Oguducu, S. G., "A taxonomy based semantic similarity of documents using the cosine measure," in Computer and Information Sciences, 2009. ISCIS 2009. 24th International Symposium on, vol., no., pp.129-134, 14-16 Sept. 2009.
http://dx.doi.org/10.1109/iscis.2009.5291865

Faisal Rahutomo , Teruaki Kitasuka, Masayoshi Aritsugi,” Semantic Cosine Similarity”, The 7th International Student Conference on Advanced Science and Technology ICAST 2012, At Seoul, South Korea.
http://dx.doi.org/10.1145/2428736.2428784

Wikipedia, Cosine similarity.[online] https://en.wikipedia.org /wiki/Cosine_similarity / [Accessed 1-August 2015 ].

Bing, API Basics. [online] Available at: http://www.bing.com/ developers/s/APIBasics.html [Accessed 15-March 2015].

NETBEANS, [online] Available at: https://netbeans.org/ [Accessed 1-March 2014].

Europe PMC, [online] Available at: https://europepmc.org/ [Accessed 1-March 2014].

PubMed.gov, [online] Available at: http://www.ncbi.nlm.nih.gov/ pubmed [Accessed 20-March 2014].

Wikipedia, Precision and recall.[online] https://en.wikipedia.org/ wiki/Precision_and_recall / [Accessed 1-August 2015].

Omar, K., AlKhatib, B., Dashash, M., The Implementation of Plagiarism Detection System in Health Sciences Publications in Arabic and English Languages, (2013) International Review on Computers and Software (IRECOS), 8 (4), pp. 915-919.


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2019 Praise Worthy Prize