An Efficient Ontology Based Concept Indexing and Clustering for Biomedical Documents


(*) Corresponding author


Authors' affiliations


DOI's assignment:
the author of the article can submit here a request for assignment of a DOI number to this resource!
Cost of the service: euros 10,00 (for a DOI)

Abstract


Conventional Document clustering techniques aim to group the documents into different semantic classes based on the cluster hypothesis. Most of the existing techniques are based on either single term keyword with its frequency analysis or phrase based approach using n-gram techniques of the document.  Accurate clustering is infeasible in document clustering because of the curse of dimensionality due to the high dimensionality space of it. For the successful clustering of text documents, a two step process is proposed in this paper. This proposed method involves with concept based indexing with the domain ontology as background knowledge for concept extraction and clustering of documents. The results of the proposed method is compared with the traditional indexing technique, Latent Semantic Indexing (LSI). In order to prove the efficiency of the proposed technique, biomedical domain is chosen with MeSH ontology. The experimental results show that the proposed method outperforms traditional term-base method and LSI.
Copyright © 2014 Praise Worthy Prize - All rights reserved.

Keywords


Latent Semantic Indexing; Ontology; Document Clustering; FM index; Silhouette Index

Full Text:

PDF


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize