Skip to main content

2015 | OriginalPaper | Buchkapitel

MapReduce-DBMS: An Integration Model for Big Data Management and Optimization

verfasst von : Dhouha Jemal, Rim Faiz, Ahcène Boukorca, Ladjel Bellatreche

Erschienen in: Database and Expert Systems Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The data volume and the multitude of sources have an exponential number of technical and application challenges. In the past, Big Data solutions have been presented as a replacement for the Parallel Database Management Systems. However, Big Data solutions can be seen as a complement to a RDBMS for analytical applications, because different problems require complex analysis capabilities provided by both technologies. The aim of his work is to integrate a Big Data solution and a classic DBMS, in a goal of queries optimization. We propose a model for OLAP queries process. Then, we valid the proposed optimized model through experiments showing the gain of the execution cost saved up.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ordonez, C.: Can we analyze big data inside a DBMS?. In: Proceedings of the Sixteenth International Workshop on Data Warehousing and OLAP, ACM, pp. 85–92 (2013) Ordonez, C.: Can we analyze big data inside a DBMS?. In: Proceedings of the Sixteenth International Workshop on Data Warehousing and OLAP, ACM, pp. 85–92 (2013)
2.
Zurück zum Zitat Fan, W., Bifet, A.: Mining big data: current status, and forecast to the future. In: SIGKDD Explorations, vol. 14, issue 2 (2011) Fan, W., Bifet, A.: Mining big data: current status, and forecast to the future. In: SIGKDD Explorations, vol. 14, issue 2 (2011)
3.
Zurück zum Zitat Doulkeridis, C., Nørvåg, K.: A survey of large-scale analytical query processing in MapReduce. VLDB J. 23(3), 355–380 (2014)CrossRef Doulkeridis, C., Nørvåg, K.: A survey of large-scale analytical query processing in MapReduce. VLDB J. 23(3), 355–380 (2014)CrossRef
4.
Zurück zum Zitat Dean, J., Ghemawats, S.: Mapreduce: Simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)CrossRef Dean, J., Ghemawats, S.: Mapreduce: Simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)CrossRef
5.
Zurück zum Zitat Brown, P.G.: Object-Relational Database Development: A Plumber’s Guide. Prentice Hall PTR, Upper Saddle River (2000) Brown, P.G.: Object-Relational Database Development: A Plumber’s Guide. Prentice Hall PTR, Upper Saddle River (2000)
6.
Zurück zum Zitat Douglas, K., Douglas, S.: PostgreSQL: A Comprehensive Guide to Building, Programming, and Administring PostreSQL Databases, 1st edn. Sams Publishing, (2003) Douglas, K., Douglas, S.: PostgreSQL: A Comprehensive Guide to Building, Programming, and Administring PostreSQL Databases, 1st edn. Sams Publishing, (2003)
7.
Zurück zum Zitat McClean, A., Conceicao, R., O’halloran, M.: A comparison of MapReduce and parallel database management systems. In: ICONS 2013 The Eighth International Conference on Systems, pp. 64–68 (2013) McClean, A., Conceicao, R., O’halloran, M.: A comparison of MapReduce and parallel database management systems. In: ICONS 2013 The Eighth International Conference on Systems, pp. 64–68 (2013)
8.
Zurück zum Zitat Nance, C., Losser, T., Iype, R., Harmon, G.: NOSQL VS RDBMS – why there is room for both. In: Proceedings of the Southern Association for Information Systems Conference Savannah USA, pp. 111–116 (2013) Nance, C., Losser, T., Iype, R., Harmon, G.: NOSQL VS RDBMS – why there is room for both. In: Proceedings of the Southern Association for Information Systems Conference Savannah USA, pp. 111–116 (2013)
9.
Zurück zum Zitat Stonebraker, M., Abadi, D., Dewitt, D., Madden, S., Paulsone, E., Pavlo, A., Rasin, A.: Mapreduce and parallel dbmss: friends or foes? Commun. ACM 53(1), 64–71 (2010)CrossRef Stonebraker, M., Abadi, D., Dewitt, D., Madden, S., Paulsone, E., Pavlo, A., Rasin, A.: Mapreduce and parallel dbmss: friends or foes? Commun. ACM 53(1), 64–71 (2010)CrossRef
10.
Zurück zum Zitat Gruska, N., Martin, P.: Integrating MapReduce and RDBMSs. In: Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research, IBM Corp, pp. 212–223 (2010) Gruska, N., Martin, P.: Integrating MapReduce and RDBMSs. In: Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research, IBM Corp, pp. 212–223 (2010)
11.
Zurück zum Zitat Yui, M., Kojima, I.: A database-hadoop hybrid approach to scalable machine learning. In: Big Data 2013 IEEE International Congress on IEEE, pp. 1–8 (2013) Yui, M., Kojima, I.: A database-hadoop hybrid approach to scalable machine learning. In: Big Data 2013 IEEE International Congress on IEEE, pp. 1–8 (2013)
12.
Zurück zum Zitat Abouzeid, A., Pawlikowski, K., Abadi, D., Silberschatz, A., Rasin, A.: Hadoopdb: an architectural hybrid of mapreduce and dbms technologies for analytical workloads. In: Proceedings of the VLDB Endowment, pp. 922–933 (2009) Abouzeid, A., Pawlikowski, K., Abadi, D., Silberschatz, A., Rasin, A.: Hadoopdb: an architectural hybrid of mapreduce and dbms technologies for analytical workloads. In: Proceedings of the VLDB Endowment, pp. 922–933 (2009)
13.
Zurück zum Zitat Pavlo, A., Rasin, A., Madden, S., Stonebraker, M., Dewitt, D., Paulson, E., Shrinivas, L., Abadi, D.: A comparison of approaches to large scale data analysis. In: Proceedings of the 2009 ACM SIGMOD International Conference on Management of data, pp. 165–178 (2009) Pavlo, A., Rasin, A., Madden, S., Stonebraker, M., Dewitt, D., Paulson, E., Shrinivas, L., Abadi, D.: A comparison of approaches to large scale data analysis. In: Proceedings of the 2009 ACM SIGMOD International Conference on Management of data, pp. 165–178 (2009)
14.
Zurück zum Zitat Chandrasekar, S., Dakshinamurthy, R., Seshakumar, P.G., Prabavathy, B., Babu, C.: A novel indexing scheme for efficient handling of small files in hadoop distributed file system. In: Computer Communication and Informatics (ICCCI), 2013 International Conference, pp. 1–8 (2013) Chandrasekar, S., Dakshinamurthy, R., Seshakumar, P.G., Prabavathy, B., Babu, C.: A novel indexing scheme for efficient handling of small files in hadoop distributed file system. In: Computer Communication and Informatics (ICCCI), 2013 International Conference, pp. 1–8 (2013)
15.
Zurück zum Zitat Pavlo, A., Paulson, E., Rasin, A., Abadi, D.J., DeWitt, D.J., Madden, S., Stonebraker, M.: A comparison of approaches to large-scale data analysis. In: Proceedings of the 2009 ACM SIGMOD International Conference on Management of data ACM, pp. 165–178 (2009) Pavlo, A., Paulson, E., Rasin, A., Abadi, D.J., DeWitt, D.J., Madden, S., Stonebraker, M.: A comparison of approaches to large-scale data analysis. In: Proceedings of the 2009 ACM SIGMOD International Conference on Management of data ACM, pp. 165–178 (2009)
16.
Zurück zum Zitat Boukorca, A., Faget, Z., Bellatreche, L.: What-if physical design for multiple query plan generation. In: Decker, H., Lhotská, L., Link, S., Spies, M., Wagner, R.R. (eds.) DEXA 2014, Part I. LNCS, vol. 8644, pp. 492–506. Springer, Heidelberg (2014) Boukorca, A., Faget, Z., Bellatreche, L.: What-if physical design for multiple query plan generation. In: Decker, H., Lhotská, L., Link, S., Spies, M., Wagner, R.R. (eds.) DEXA 2014, Part I. LNCS, vol. 8644, pp. 492–506. Springer, Heidelberg (2014)
17.
Zurück zum Zitat Brighen, A.: Conception de bases de données volumineuses sur le cloud. In: Doctoral dissertation, Université Abderrahmane Mira de Béjaia (2012) Brighen, A.: Conception de bases de données volumineuses sur le cloud. In: Doctoral dissertation, Université Abderrahmane Mira de Béjaia (2012)
18.
Zurück zum Zitat Fabrizio, M., Domenico, T., Paolo, T.: P2P-MapReduce: parallel data processing in dynamic cloud environments. J. Comput. Syst. Sci. 78(5), 1382–1402 (2012)CrossRef Fabrizio, M., Domenico, T., Paolo, T.: P2P-MapReduce: parallel data processing in dynamic cloud environments. J. Comput. Syst. Sci. 78(5), 1382–1402 (2012)CrossRef
Metadaten
Titel
MapReduce-DBMS: An Integration Model for Big Data Management and Optimization
verfasst von
Dhouha Jemal
Rim Faiz
Ahcène Boukorca
Ladjel Bellatreche
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-22852-5_36