A Hybrid System of Hadoop and DBMS for Earthquake Precursor Application


(*) Corresponding author


Authors' affiliations


DOI's assignment:
the author of the article can submit here a request for assignment of a DOI number to this resource!
Cost of the service: euros 10,00 (for a DOI)

Abstract


Compared with traditional data warehouse applications, big data analytics are huge and complex, and requires massive performance and scalability. In this paper, we explore the feasibility and versatility of building a hybrid system that not only retains the analytical DBMS, but also could handle the demands of rapidly exploding data applications. We propose a hybrid system prototype which takes DBMS as the underlying storage and execution units, and Hadoop as an index layer and a cache. Experiments show that our system meets the demand, and will be appropriate for analogous big data analysis applications.
Copyright © 2013 Praise Worthy Prize - All rights reserved.

Keywords


MapReduce; Parallel Database; Global Index Access

Full Text:

PDF


References


J. Gantz, C. Chute, and A. Manfrediz. The diverse and exploding digital universe, 2008: IDC white paper.

J. Lin and C. Dyer. Data-Intensive Text Processing with MapReduce, 2010: Morgan & Claypool.

R.E. Bryant. Date-Intensive Supercomputing: The Case for DISC, 2007: CMU Tech Report.

A. Zhou. Data intensive computing-challenges of data management techniques. Communications of the CCF, 2009. 5(7): p. 50-53.

Worldwide LHC Computing Grid. Available from: http://public.web.cern.ch/public/en/LHC/Computing-en.html.

Facebook, Hadoop, and Hive. Available from: http://www.dbms2.com/2009/05/11/facebook-hadoop-and-hive/.

S. Ghemawat, H. Gobioff, and S. Leung. The Google file system. in Proceedings of the 19th ACM Symposium on Operationg System Principles (SOSP' 03). 2003. New York, USA.

J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. in Proceedings of the 6th Symposium on Operating System Design and Implementation (OSDI' 04). 2004. San Francisco, California, USA.

J. Dean and S. Ghemawat. MapReduce: a flexible data processing tool. Communications of the ACM, 2010. 53(1): p. 72-77.

Hadoop: Open-source implementation of MapReduce. Available from: http://hadoop.apache.org.

The HDFS Project. Available from: http://hadoop.apche.org/hdfs.

K. Shvachko, H. Huang, S. Radia, et al. The hadoop distributed filesystem. in Proceedings of the 26th IEEE Symposium on Massive Storage Systems and Technologies (MSST' 10) 2010.

Zhou, L., Fang, Z., Xiang, L., Cai, R., Hu, N., Performance optimization of processing small files based on HDFS, (2012) International Review on Computers and Software (IRECOS), 7 (6), pp. 3386-3391.

Long, F., Zhang, Y., Bin, L., A method for mining association rules based on cloud computing, (2011) International Review on Computers and Software (IRECOS), 6 (6), pp. 1112-1116.


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize