Swipe to navigate through the chapters of this book
Cloud Database Management System (CDBMS) is one of the potential services provided by various Cloud Service Providers. Cloud providers cope with different users, different data and processing or analysis of different data. Traditional Database Management Systems are insufficient to handle such variety of data, users and their requirements. Hence, at the conceptual layer of CDBMS, traditional SQL, Oracle and many more Database Languages are insufficient to provide proper services to their users. HIVE and Pig are the different types of languages which are suitable for the cloud environment which can handle such huge amount of data. In this paper, performance comparison of 3-Node cluster and Cloud Based Cluster provided by the Amazon Web Services is being done. We have compared the processing of structured data with the help of different queries provided by HIVE tool on 3-Node cluster and Amazon Web Service (AWS) cluster. It has been concluded that HIVE queries on AWS cluster gives better results as compared to 3-Node cluster.
Please log in to get access to this content
To get access to this content you need the following product:
M. Alam and K. Shakil.: Cloud Database Management System Architecture. In: UACEE International Journal of Computer Science and its Applications, Volume 3(1), 2013, pages 27–31.
AWS documentation; Auto scaling, http://aws.amazon.com/autoscaling.
J. Dean and S. Ghemawat.: Mapreduce: simplified data processing on large clusters. In OSDI’04. In: Proceedings of the 6th Symposium on Opearting Systems Design & Implementation (OSDI’04), 2004, pages 1–10.
L. Zhang, C. Wu, L. Zongpeng, C. Guo, C. Minghua and C.M. Lau. In: Moving Big Data to the Cloud: An Online Cost-Minimizing Approach. In: IEEE journal on selected areas in communications (2013), Vol 31, Issue 12, pages 2710–2721.
L. Huang, H. Shan, Chen and H. Ting-Ting.: Research on Hadoop Cloud Computing Model and its Applications. In: IEEE Third International Conference on Networking and Distributed Computing (ICNDC), 21–24 Oct. 2012, pages 59–63.
Apache: Apache Hadoop: http://hadoop.apache.org/docs/r2.7.1.
Amazon Elastic MapReduce, Developer Guide (API Version 2009-03-31), http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/emr-how-does-emr-work.html.
AmazonEC2 Service Level Agreement, http://aws.amazon.com/ec2-sla/, Retrieved July 2012.
Amazon Virtual Private Cloud, Getting Started Guide, API Version 2013-10-15, http://awsdocs.s3.amazonaws.com/VPC/latest/vpc-gsg.pdf.
Amazon EC2 Instance, http://aws.amazon.com/ec2/, Retrieved July 2012.
S. Mongia, M.N. Doja, B. Alam, and M. Alam.: 5 layered Architecture of Cloud Database Management System. In: AASRI Conference on parallel and Distributed Computing and Systems, Vol 5, Pages 194–199, 2013.
- Executing HIVE Queries on 3-Node Cluster and AWS Cluster—Comparative Analysis
Mohammad Najmud Doja
- Springer India
- Sequence number