Mining of Optimized Multi Relational Relation Patterns for Prediction System


(*) Corresponding author


Authors' affiliations


DOI's assignment:
the author of the article can submit here a request for assignment of a DOI number to this resource!
Cost of the service: euros 10,00 (for a DOI)

Abstract


Prediction is arguably considered the main goal of data mining, with the greatest potential payoff. The two principal prediction problems are classification and regression. In many data mining tools that support classification tasks, training data are stored in a single table containing both the target field and the attributes. Generally, only intra-tuple relationships between the attributes and the target field are found, while inter-tuple relationships are not considered and relationships between several tuples of distinct tables are not even able explore. Disregarding inter-table relationships can be a severe limitation in many real-world applications that involve the prediction of numerical values from data that are naturally organized in a relational model involving several tables. In this paper, we have concentrated on classification task of multi-relational data. We have planned to develop an optimized rule generation with the newly designed fitness function of genetic algorithm. Once the rule is mined, the relevant attributes will be identified from multi relational patterns. Then, the database is converted to single table with all the relevant attributes from multiple tables. Finally, the neural network will be used to training the converted data based on the association relationship. Then, in the testing phase, the prediction will be done with the help of trained neural network. In the experimental analysis, the proposed approach has gained an average accuracy of 95% in the prediction process by neural network.
Copyright © 2013 Praise Worthy Prize - All rights reserved.

Keywords


Prediction; Multi-Relational Data; Pattern Mining; Optimization; Artificial Neural Network

Full Text:

PDF


References


Osmar R. Z., “Introduction to Data Mining”, In: Principles of Knowledge Discovery in Databases. CMPUT690, University of Alberta, Canada, 1999.

Kantardzic, Mehmed. “Data Mining: Concepts, Models, Methods, and Algorithms”, John Wiley and Sons, 2003.

E. Wainright Martin, Carol V. Brown, Daniel W. DeHayes, Jeffrey A. Hoffer and William C. Perkins, “Managing information technology”, Pearson Prentice-Hall 2005.

Andrew Kusiak and Matthew Smith, “Data mining in design of products and production systems”, in proceedings of Annual Reviews in control, vol. 31, no. 1, pp. 147- 156, 2007.

Mahesh Motwani, J.L. Rana and R.C Jain, "Use of Domain Knowledge for Fast Mining of Association Rules", in Proceedings of the International Multi-Conference of Engineers and Computer Scientists, 2009.

Oded Z. Maimon and Lior Rokach, “Data mining and knowledge discovery handbook”, Springer 2005.

Saso Dzeroski, "Multi-relational data mining: an introduction", ACM SIGKDD Explorations Newsletter, COLUMN: Multi Relational Data Mining (MRDM), vol. 5, no. 1, pp. 1 - 16, July 2003.

J. Knobbe, H. Blockeel, A. Siebes, and Van der Wallen D., “Multi-relational data mining”, In Proceedings of Benelearn 99, 1999.

Saso Dzeroski and Luc De Raedt, "Multi-Relational Data Mining: a Workshop Report", ACM SIGKDD Explorations Newsletter, vol. 4, no. 2, pp.122 - 124, December 2002.

Annalisa Appice, Michelangelo Ceci, and Donato Malerba, “Mining Relational Association Rules for Propositional Classification”, Lecture Notes in Computer Science, vol. 3673, pp. 522-534, 2005.

Wrobel, S., “Inductive logic programming for knowledge discovery in databases”, In: Dzeroski, S., N. Lavrac (eds.): Relational Data Mining, Springer: Berlin, pp. 74-101, 2001.

R. Agrawal, T. Imielinski and A. Swami, “Mining association rules between sets of items in large databases”, In proceedings of the international Conference on Management of Data, ACM SIGMOD, pp. 207–216, Washington, DC, May 1993.

Satheesh, A. Mishra, D.K. and Patel, R. "Classification Rule Mining for Object Oriented Databases: A Brief Review", in proceedings of the First International Conference on Computational Intelligence, Communication Systems and Networks, pp. 259 - 263, Indore, July 2009.

Michelangelo Ceci and Annalisa Appice, “Spatial Associative Classification: Propositional vs Structural approach”, Journal of Intelligent Information Systems, vol. 27, no. 3, pp: 191- 213, November 2006.

Saso Dzeroski and Luc De Raedt, "Multi-Relational Data Mining", in proceedings of ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24 - 27, 2003.

Anton Flank, "Multi-relational Association Rule Mining", 2004.

Bao Liang, Xiaoguang Hong, Lei Zhang and Shuai Li, "Extended MRI-Cube Algorithm for Mining Multi-Relational Patterns", in proceedings of the 2008 The 9th International Conference for Young Computer Scientists, pp. 1132-1136, 2008.

Seid, D.Y and Mehrotra, S., "Efficient relationship pattern mining using multi-relational iceberg-cubes", Fourth IEEE International Conference on Data Mining, pp. 515 - 518, 2004.

Toshiyuki Makino, Nobuhiro Inuzuka, "Implementing Pattern Mining Using Extended Attribute Expression on Relational DB", in proceedings of third International Conference on Knowledge Discovery and Data Mining, pp. 502-505, 2010.

Patterson D W, Chan K H, Tan C M. 1993, Time Series Forecasting with neural nets: a comparative study. Proc. the international conference on neural network applictions to signal processing. NNASP 1993 Singapore pp 269-274.

Nobuhiro Inuzuka, Jun-ichi Motoyama, Shinpei Urazawa and Tomofumi Nakano, "Relational Pattern Mining Based on Equivalent Classes of Properties Extracted from Samples", Lecture Notes in Computer Science, vol. 5012, pp. 582-591, 2008.

Xingquan Zhu and Xindong Wu, "Discovering Relational Patterns across Multiple Databases”, in procdings of the IEEE 23rd International Conference on Data Engineering, pp.726-735, 2007.

Edwards, T., Tansley, D, S. W., Davey, N., Frank, R. J.(1997). Traffic Trends Analysis using Neural Networks. Proceedings of the International Workshop on Applications of Neural Networks to Telecommuncations 3. pp. 157-164.

Bengio, S., Fessant F., Collobert D. A Connectionist System for Medium-Term Horizon Time Series Prediction. In Proc. Intl. Workshop Application Neural Networks to Telecoms pp308-315, 1995.

Dawit Yimam Seid, Sharad Mehrotra, "Efficient Relationship Pattern Mining using Multi-relational Data Cubes", Technical Report , Department of Information and Computer Science University of California, Irvine, 2004.

Rayner Alfred, Dimitar Kazakov, "Aggregating Multiple Instances in Relational Database Using Semi-Supervised Genetic Algorithm-based Clustering Technique", in proceedings of Eleventh East-European Conference on Advances in Databases and Information Systems, Varna, Bulgaria, 2007.

Hendrik Blockeel and Saso Dzeroski, “Multi-Relational Data Mining”, in Proceedings of ACM SIGKDD Conference, pp.126-128, 2005.

Saso Dzeroski, Luc De Raedt, "MultiRelational Data Mining: The Current Frontiers", ACM SIGKDD Explorations Newsletter (2002), pp: 2002- 2003, August 2002.

Wei Zhang, "Multi-Relational Data Mining Based on Higher-Order Inductive Logic Programming," 2009 WRI Global Congress on Intelligent Systems, vol. 2, pp: 453-458, 2009.

Yin, X. Han, J., "Exploring the Power of Heuristics and Links in Multi-relational Data Mining", Lecture Notes In Computer Science, No: 4994, pp: 17-27, 2008.

Y. Kavurucu, P. Senkul, and I.H. Toroslu, "ILP-based concept discovery in multi-relational data mining", An International Journal of Expert Systems with Applications, vol. 36, no. 9, pp: 11418- 11428, 2009.

Jingfeng Guo, Weifeng Bian, Jing Li, Yanshan Univ and Qinhuangdao, "Multi relational Association Rule Mining with Guidance of User", in proc. of 4th Intl. Conference on Fuzzy Systems and Knowledge Discovery, vol. 2, pp: 704- 709, 24- 27 August, Haikou, 2007.

Ankur M. Teredesai, Muhammad A. Ahmad, Juveria Kanodia and Roger S. Gaborski, "CoMMA: a framework for integrated multimedia mining using multi-relational associations", Knowledge and Information Systems, vol. 10, no. 2, pp: 135- 162, August 2006.

Pizzi, L.C., Ribeiro, M.X., Vieira, M.T.P., "Analysis of Hepatitis Dataset using Multi-relational Association Rules", The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD) Discovery Challenge, 2005.

Donato Malerba, Michelangelo Ceci and Annalisa Appice, "A relational approach to probabilistic classification in a transductive setting", Engineering Applications of Artificial Intelligence, vol. 22, no. 1, pp. 1109- 116, February 2009.

Hongyu Guo and Herna L. Viktor, "Multi-relational classification: a multiple view approach", Journal of Knowledge and Information Systems, vol. 17, no. 3, pp: 287- 312, February, 2008.

Chien- I Lee, Cheng- Jung Tsai, Tong- Qin Wu and Wei- Pang Yang, "An approach to mining the multi-relational imbalanced database", Journal of Expert Systems with Applications, vol. 34, no. 4, pp. 3021- 3032, May 2008.

Xiaoxin Yin, Jiawei Han, Jiong Yang and Yu Philip S., "Efficient classification across multiple database relations: A crossmine approach", IEEE Transactions on Knowledge and Data Engineering, vol. 18, no. 6, pp. 770- 783, 2006.

Vaibhav Kant Singh, Vijay Shah, Yogendra Kumar Jain, Anupam Shukla, A.S. Thoke, Vinay Kumar Singh, Chhaya Dule and Vivek Parganiha, "Proposing an Efficient Method for Frequent Pattern Mining", World Academy of Science, Engineering and Technology, vol.61, pp. 384-390, 2008.

Widrow v, Rumelhart D.E, Lehr. M.A, “Neural Networks: Application in industry, business and science”, Communication of the ACM, Vol 37, No 2, pp: 93-105, 1994.

Financial data from http://lisp.vse.cz/pkdd99/Challenge/chall.htm

Peijie Huang, Piyuan Lin, Shangwei Yan, Meiyan Xiao, " Building Model-driven Decision Support System for Seasonal Broiler Growth Performance Prediction", IRECOS, Vol. 7 N. 2, pp. 891-902, 2012.

Yilmaz, G., Badur, B.Y., Mardikyan, S., Development of a constraint based sequential pattern mining tool, (2011) International Review on Computers and Software (IRECOS), 6 (2), pp. 191-198.


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize