Skip to main content
Top

2015 | OriginalPaper | Chapter

An Efficient Data Integration Framework in Cloud Using MapReduce

Authors : P. Srinivasa Rao, M. H. M. Krishna Prasad, K. Thammi Reddy

Published in: Computational Intelligence Techniques for Comparative Genomics

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In Bigdata applications, providing security to massive data is an important challenge because working with such data requires large scale resources that must be provided by cloud service provider. Here, this paper demonstrates a cloud implementation and technologies using big data and discusses how to protect such data using hashing and how users can be authenticated. In particular, technologies using big data such as the Hadoop project of Apache are discussed, which provides parallelized and distributed data analyzing and processing of petabyte of data, along with a summarized view of monitoring and usage of Hadoop cluster. In this paper, an algorithm called FNV hashing is introduced to provide integrity of the data that has been outsourced to cloud by the user. The data within Hadoop cluster can be accessed and verified using hashing. This approach brings out to enable many new security challenges over the cloud environment using Hadoop distributed file system. The performance of the cluster can be monitored by using ganglia monitoring tool. This paper designs an evaluation cloud model which will provide quantity related results for regularly checking accuracy and cost. From the results of the experiment found out that this model is more accurate, cheaper and can respond in real time.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
3.
go back to reference Svantesson D, Clarke R (2010) Privacy and consumer risks in cloud computing. Comput Law Secur Review 26(4):391–397CrossRef Svantesson D, Clarke R (2010) Privacy and consumer risks in cloud computing. Comput Law Secur Review 26(4):391–397CrossRef
4.
go back to reference King NJ, Raja VT (2012) Protecting the privacy and security of sensitive customer data in the cloud. Comput Law Secur Rev 28(3):308–319CrossRef King NJ, Raja VT (2012) Protecting the privacy and security of sensitive customer data in the cloud. Comput Law Secur Rev 28(3):308–319CrossRef
5.
go back to reference Breitinger F, Stivaktakis G, Baier H (2013) A framework to test algorithms of similarity hashing. Digit Invest 10:S50–S58CrossRef Breitinger F, Stivaktakis G, Baier H (2013) A framework to test algorithms of similarity hashing. Digit Invest 10:S50–S58CrossRef
6.
go back to reference Rupesh M, Chitre DK (2012) Data leakage and detection of guilty agent. Int J Sci Eng Res 3(6) Rupesh M, Chitre DK (2012) Data leakage and detection of guilty agent. Int J Sci Eng Res 3(6)
9.
go back to reference Zhao J, Wang L, Tao J, Chen J, Sun W, Ranjan R, Kołodziej J, Streit A, Georgakopoulos D (2014) A security framework in GHadoop for bigdata computing across distributed Cloud data centers. Comput Syst Sci 80:994–1007CrossRefMATH Zhao J, Wang L, Tao J, Chen J, Sun W, Ranjan R, Kołodziej J, Streit A, Georgakopoulos D (2014) A security framework in GHadoop for bigdata computing across distributed Cloud data centers. Comput Syst Sci 80:994–1007CrossRefMATH
10.
go back to reference Wang L, Tao J, Ranjan R, Marten H, Streit A, Chen D, Chen J (2013) G-Hadoop: mapreduce across distributed data centers from data-intensive computing. Future Gener Comput Syst 29(3):739CrossRef Wang L, Tao J, Ranjan R, Marten H, Streit A, Chen D, Chen J (2013) G-Hadoop: mapreduce across distributed data centers from data-intensive computing. Future Gener Comput Syst 29(3):739CrossRef
11.
go back to reference Caballer M, de Alfonso C, Molto G, Romero E, Blanquer I, Garcia A (2014) Code cloud: A platform to enable execution of programming models on the Clouds. J Syst Softw 93:187–198CrossRef Caballer M, de Alfonso C, Molto G, Romero E, Blanquer I, Garcia A (2014) Code cloud: A platform to enable execution of programming models on the Clouds. J Syst Softw 93:187–198CrossRef
12.
go back to reference AL-Saiyd NA, Sail N (2013) Data integrity in cloud computing security. Theor Appl Inform Technol 58 AL-Saiyd NA, Sail N (2013) Data integrity in cloud computing security. Theor Appl Inform Technol 58
13.
go back to reference Dillibabu M, Kumari S, Saranya T, Preethi R (2013) Assured protection and veracity for cloud data using Merkle hash tree algorithm. Indian J Appl Res 3:1–3 Dillibabu M, Kumari S, Saranya T, Preethi R (2013) Assured protection and veracity for cloud data using Merkle hash tree algorithm. Indian J Appl Res 3:1–3
14.
go back to reference Mounika CH, RamaDevi L, Nikhila P (2013) Sample load rebalancing for distributed hash table in cloud. ISRO J Comput Eng 13:60–65CrossRef Mounika CH, RamaDevi L, Nikhila P (2013) Sample load rebalancing for distributed hash table in cloud. ISRO J Comput Eng 13:60–65CrossRef
Metadata
Title
An Efficient Data Integration Framework in Cloud Using MapReduce
Authors
P. Srinivasa Rao
M. H. M. Krishna Prasad
K. Thammi Reddy
Copyright Year
2015
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-287-338-5_11

Premium Partner