Open Access Open Access  Restricted Access Subscription or Fee Access

Dynamic Clustering for Improved Web Usage Mining Using WebBLUEGILL


(*) Corresponding author


Authors' affiliations


DOI: https://doi.org/10.15866/irecos.v9i11.3895

Abstract


Every year zettabytes of information are generated making difficult to find information on the web. Next e-business requirement will be personalizing or customizing the web pages according to the requirement of individuals. Web personalization provides recommendations that will be potentially liked by the user using web usage mining techniques. To support improved web personalization this paper proposes clustering algorithms handling two main challenges of web usage mining techniques: scalability and ability to handle dynamic data sets. Hybridization of the proposed WebBLUEGILL with the traditional k-means and the spherical k-means improves scalability. The WebBLUEGILL algorithm performs dynamic clustering with dynamic data items. The WebBLUEGILL algorithm uses swarm intelligence approach that gains inspiration by the Bluegill fish’s foraging behavior. This dynamic clustering approach can address the dynamic nature of online usage data by managing new data items and by handling budding and vanishing clusters.
Copyright © 2014 Praise Worthy Prize - All rights reserved.

Keywords


Cluster Optimization; Dynamic Clustering; Swarm Intelligence; User Profiles; Web Usage Mining

Full Text:

PDF


References


R. Cooley, B. Mobasher, and J. Srivastava, “Web Mining: Information and Pattern Discovery on the World Wide Web,”Proc. Ninth IEEE Int’l Conf. Tools with AI (ICTAI ’97), pp. 558-567,1997.
http://dx.doi.org/10.1109/tai.1997.632303

O. Nasraoui, R. Krishnapuram, and A. Joshi, “Mining Web Access Logs Using a Relational Clustering Algorithm Based on a Robust Estimator,” Proc. Eighth Int’lWorld Wide Web Conf. (WWW ’99),pp. 40-41, 1999.

O. Nasraoui, R. Krishnapuram, H. Frigui, and A. Joshi, “ExtractingWeb User Profiles Using Relational Competitive Fuzz Clustering, ”Int’l J. Artificial Intelligence Tools, vol. 9, no. 4,pp. 509-526, 2000.
http://dx.doi.org/10.1142/s021821300000032x

J. Srivastava, R. Cooley, M. Deshpande, and P.-N. Tan, “Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data,” SIGKDD Explorations, vol. 1, no. 2, pp. 1-12, Jan. 2000.
http://dx.doi.org/10.1145/846183.846188

A. J. Abrantes and J. S. Marques, “A Method for Dynamic Clustering of Data”, Proc. Ninth British Conference,1998.
http://dx.doi.org/10.5244/c.12.16

Martin Ester, Hans peter Kriegel, Michael Wimmer, and Xiaowej Xu, “ Incremental Clustering for Mining in a Data Warehousing Enviornment”, Proc 24th INt. Conf. Very Large DataBases, VLDB, pages 323-333, 1998.

J. Kennedy and R. Eberhart. “Particle swarm optimization”, Proc of IEEE International Conference on Neural Networks, vol 4, pages 1942-1948, 1995.
http://dx.doi.org/10.1109/icnn.1995.488968

N. Labroche,N. Monmarche and G. Venturini, “ Antclust: Ant clustering and web usage mining”, proc. Of GECC)-2003,pages 25-36,Springer,LNCS,2003.
http://dx.doi.org/10.1007/3-540-45105-6_3

O. Nasraoui, C. Cardona,C.Rojas and F. Gonzalez, “ Mining evolving user profiles in noisy web clickstream data with a scalable immune system clustering algorithm, Proc of WebKDD 2003,pages 71-81,Washington DC,Aug 2003.

Yixin chen and Li Tu, “Density based clustering for real time stream data”.Proc of 13th ACM SIGKDD International Conference on Knowledge discovery and data mining (KDD’07), pages 133-142,2007.
http://dx.doi.org/10.1145/1281192.1281210

Esin Saka and Olfa Nasraoui,” Simultaneous clustering and visualization of web usage data using swarm based intelligence”, Proc of the 20th IEEE International Conference on Tools with Artificial Intelligence (ICTAI’08), 2008.
http://dx.doi.org/10.1109/ictai.2008.100

O. Nasraoui, M. Soliman, E. Saka, A. Badia and R. Germain, “A Web Usage Mining Framework for Mining Evolving User Profiles in Dynamic Web Sites”, IEEE Transactions on Knowledge and Data Engineering, vol. 20, no. 2,pp. 202-215, Feb. 2008
http://dx.doi.org/10.1109/tkde.2007.190667

J. MacQueen. Some methods for classification and analysis of multivariate observations. Proceedings of the Fifth Berkeley Symposium on Mathematica Statistics and Probability, volume 1, pages 281~297, 1967.

Esin Saka and Olfa Nasraoui. Improvements in flock-based collaborative clustering algorithms, Computational Intelligence Collaboration, Fusion and Emergence, Intelligent Systems Reference Library, pages 639-672. Springer, 2009.
http://dx.doi.org/10.1007/978-3-642-01799-5_20

http://bcs.whfreeman.com/thelifewire/content/chp53/5302002.html

Mohamed Jafar, O.A., Sivakumar, R., Hybrid fuzzy based nature-inspired clustering algorithms with validity measures, (2014) International Review on Computers and Software (IRECOS), 9 (6), pp. 928-937.

Long, F., Zhang, Y., Bin, L., A method for mining association rules based on cloud computing, (2011) International Review on Computers and Software (IRECOS), 6 (6), pp. 1112-1116.

Ambika, M., Latha, K., Web mining: The demystification of multifarious aspects, (2014) International Review on Computers and Software (IRECOS), 9 (1), pp. 135-141.

Hashemi, A., Nadimi, M., Recommender systems based on web usage mining: A survey, (2012) International Review on Computers and Software (IRECOS), 7 (6), pp. 2981-2986.

Jalali, M., Mustapha, N., Recommendation systems through semantic web usage mining, (2011) International Review on Computers and Software (IRECOS), 6 (6), pp. 882-886.


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize