Skip to main content
Erschienen in: The Journal of Supercomputing 11/2019

29.07.2019

A dynamic data dissemination mechanism for Cassandra NoSQL data store

verfasst von: Elahe Khatibi, Seyedeh Leili Mirtaheri

Erschienen in: The Journal of Supercomputing | Ausgabe 11/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Cassandra as a type of NoSQL databases has been put forward so as to surmount the hurdles of traditional relational databases in the scope of big data as well as real-time applications whose paramount traits are high-speed data production (volume) and miscellaneous data formats (variety). The dynamic nature of distributed data, distributed systems, and their concomitant applications results in skewed data access patterns, thereby causing imbalanced data issues, and in turn consecutive performance deterioration after all. In this study, we have proposed a dynamic data dissemination (D3) strategy well conforming to the dynamic behavior of distributed environment, including diversified as well as temporal popularity of data requests, and heterogeneous node capacity. The assessment results have shed light on performance improvement.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat McAfee A, Brynjolfsson E, Davenport T (2012) Big data: the management revolution. Harv Bus Rev 90:60–68 McAfee A, Brynjolfsson E, Davenport T (2012) Big data: the management revolution. Harv Bus Rev 90:60–68
2.
Zurück zum Zitat Chen M, Mao S, Liu Y (2014) Big data: a survey. Mob Netw Appl 19(2):171–209CrossRef Chen M, Mao S, Liu Y (2014) Big data: a survey. Mob Netw Appl 19(2):171–209CrossRef
3.
Zurück zum Zitat Zikopoulos P, Eaton C (2011) Understanding big data: Analytics for enterprise class hadoop and streaming data. McGraw-Hill Osborne Media, New York Zikopoulos P, Eaton C (2011) Understanding big data: Analytics for enterprise class hadoop and streaming data. McGraw-Hill Osborne Media, New York
4.
Zurück zum Zitat Li Y, Manoharan S (2013) A performance comparison of SQL and NoSQL databases. In: IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM), pp. 15–19 Li Y, Manoharan S (2013) A performance comparison of SQL and NoSQL databases. In: IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM), pp. 15–19
5.
Zurück zum Zitat Xhafa F (2018) Special issue on advanced techniques for cloud data storage and collaborative systems. Concurr Comput Pract Exp 30(1):e4373MathSciNetCrossRef Xhafa F (2018) Special issue on advanced techniques for cloud data storage and collaborative systems. Concurr Comput Pract Exp 30(1):e4373MathSciNetCrossRef
6.
Zurück zum Zitat Wang X, Qi D, Lin W, Yu M, Zheng Z, Zhou N, Chen P (2018) A general framework for big data knowledge discovery and integration. Concurr Comput Pract Exp 30(13):e4422CrossRef Wang X, Qi D, Lin W, Yu M, Zheng Z, Zhou N, Chen P (2018) A general framework for big data knowledge discovery and integration. Concurr Comput Pract Exp 30(13):e4422CrossRef
7.
Zurück zum Zitat Punceva M, Rodero I, Parashar M, Rana OF, Petri I (2015) Incentivising resource sharing in social clouds. Concurr Comput Pract Exp 27(6):1483–1497CrossRef Punceva M, Rodero I, Parashar M, Rana OF, Petri I (2015) Incentivising resource sharing in social clouds. Concurr Comput Pract Exp 27(6):1483–1497CrossRef
9.
Zurück zum Zitat Huang X, Wang J, Yu PS, Bai J, Zhang J (2017) An experimental study on tuning the consistency of NoSQL systems. Concurr Comput Pract Exp 29(12):e4129CrossRef Huang X, Wang J, Yu PS, Bai J, Zhang J (2017) An experimental study on tuning the consistency of NoSQL systems. Concurr Comput Pract Exp 29(12):e4129CrossRef
10.
Zurück zum Zitat Oussous A, Benjelloun FZ, Lahcen AA, Belfkih S (2017) Big data technologies: a survey. J King Saud Univ Comput Inf Sci 30:431–448 Oussous A, Benjelloun FZ, Lahcen AA, Belfkih S (2017) Big data technologies: a survey. J King Saud Univ Comput Inf Sci 30:431–448
11.
Zurück zum Zitat Corbellini A, Mateos C, Zunino A, Godoy D, Schiaffino S (2017) Persisting big-data: the NoSQL landscape. Inf Syst 63:1–23CrossRef Corbellini A, Mateos C, Zunino A, Godoy D, Schiaffino S (2017) Persisting big-data: the NoSQL landscape. Inf Syst 63:1–23CrossRef
12.
Zurück zum Zitat Oussous A, Benjelloun FZ, Lahcen AA, Belfkih S (2013) Comparison and classification of nosql databases for big data. Int J Database Theory Appl 6(4):83–87 Oussous A, Benjelloun FZ, Lahcen AA, Belfkih S (2013) Comparison and classification of nosql databases for big data. Int J Database Theory Appl 6(4):83–87
13.
Zurück zum Zitat Makris A, Tserpes K, Andronikou V, Anagnostopoulos D (2016) A classification of NoSQL data stores based on key design characteristics. Procedia Comput Sci 97:94–103CrossRef Makris A, Tserpes K, Andronikou V, Anagnostopoulos D (2016) A classification of NoSQL data stores based on key design characteristics. Procedia Comput Sci 97:94–103CrossRef
14.
Zurück zum Zitat Ananthanarayanan G, Agarwal S, Kandula S, Greenberg A, Stoica I, Harlan D, Harris E (2011) Scarlett: coping with skewed content popularity in mapreduce clusters. In: Proceedings of the Sixth Conference on Computer Systems, ACM, pp 287–300 Ananthanarayanan G, Agarwal S, Kandula S, Greenberg A, Stoica I, Harlan D, Harris E (2011) Scarlett: coping with skewed content popularity in mapreduce clusters. In: Proceedings of the Sixth Conference on Computer Systems, ACM, pp 287–300
15.
Zurück zum Zitat Cruz F, Maia F, Oliveira R, Vilaça R (2014) Workload-aware table splitting for NoSQL. In: Proceedings of the 29th Annual ACM Symposium on Applied Computing, ACM, pp 399–404 Cruz F, Maia F, Oliveira R, Vilaça R (2014) Workload-aware table splitting for NoSQL. In: Proceedings of the 29th Annual ACM Symposium on Applied Computing, ACM, pp 399–404
16.
Zurück zum Zitat Makris A, Tserpes K, Anagnostopoulos D (2016) Load balancing in in-memory key-value stores for response time minimization. In: International Conference on the Economics of Grids, Clouds, Systems, and Services, Springer, Cham, pp 62–73CrossRef Makris A, Tserpes K, Anagnostopoulos D (2016) Load balancing in in-memory key-value stores for response time minimization. In: International Conference on the Economics of Grids, Clouds, Systems, and Services, Springer, Cham, pp 62–73CrossRef
17.
Zurück zum Zitat Fernandez Afonso CE (2016) An elasticity controller for applications orquestrated with Cloudify. Master dissertation Fernandez Afonso CE (2016) An elasticity controller for applications orquestrated with Cloudify. Master dissertation
18.
Zurück zum Zitat Μπέκας E (2017) Service management in NoSQL data stores via replica-group reconfigurations. Doctoral dissertation Μπέκας E (2017) Service management in NoSQL data stores via replica-group reconfigurations. Doctoral dissertation
19.
Zurück zum Zitat Papaioannou A, Magoutis K (2017) Incremental elasticity for NoSQL data stores. In: IEEE 36th Symposium on Reliable Distributed Systems (SRDS), pp 174–183 Papaioannou A, Magoutis K (2017) Incremental elasticity for NoSQL data stores. In: IEEE 36th Symposium on Reliable Distributed Systems (SRDS), pp 174–183
20.
Zurück zum Zitat Neeraj N (2013) Mastering Apache Cassandra. Packt Publishing Ltd, Birmingham Neeraj N (2013) Mastering Apache Cassandra. Packt Publishing Ltd, Birmingham
21.
Zurück zum Zitat Cattell R (2011) Scalable SQL and NoSQL data stores. Acm Sigmod Rec 39:12–27CrossRef Cattell R (2011) Scalable SQL and NoSQL data stores. Acm Sigmod Rec 39:12–27CrossRef
22.
Zurück zum Zitat Han J, Haihong E, Le G, Du J (2011) Survey on NoSQL database. In: 2011 6th International Conference on Pervasive Computing and Applications. IEEE, Port Elizabeth, South Africa Han J, Haihong E, Le G, Du J (2011) Survey on NoSQL database. In: 2011 6th International Conference on Pervasive Computing and Applications. IEEE, Port Elizabeth, South Africa
23.
Zurück zum Zitat Makris A, Tserpes K, Anagnostopoulos D (2016) Load balancing in in-memory key-value stores for response time minimization. In: International Conference on the Economics of Grids, Clouds, Systems, and Services. Springer, Cham Makris A, Tserpes K, Anagnostopoulos D (2016) Load balancing in in-memory key-value stores for response time minimization. In: International Conference on the Economics of Grids, Clouds, Systems, and Services. Springer, Cham
24.
Zurück zum Zitat Gudivada V, Rao D (2014) NoSQL systems for big data management. In 2014 IEEE World Congress on Services, Anchorage, AK, USA Gudivada V, Rao D (2014) NoSQL systems for big data management. In 2014 IEEE World Congress on Services, Anchorage, AK, USA
25.
Zurück zum Zitat Klein J, Gorton I, Ernst N, Donohoe P, Pham K (2015) Performance evaluation of NoSQL databases: a case study. In: Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems. ACM Klein J, Gorton I, Ernst N, Donohoe P, Pham K (2015) Performance evaluation of NoSQL databases: a case study. In: Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems. ACM
26.
Zurück zum Zitat Featherston D (2010) Cassandra: principles and application. Department of Computer Science University of Illinois, Urbana Featherston D (2010) Cassandra: principles and application. Department of Computer Science University of Illinois, Urbana
27.
Zurück zum Zitat Győrödi C, Győrödi R, Sotoc R (2015) A comparative study of relational and non-relational database models in a web-based application. Int J Adv Comput Sci Appl 6(11):78–83 Győrödi C, Győrödi R, Sotoc R (2015) A comparative study of relational and non-relational database models in a web-based application. Int J Adv Comput Sci Appl 6(11):78–83
28.
Zurück zum Zitat Cattell R (2011) Scalable SQL and NoSQL data stores. Acm Sigmod Rec 39(4):12–27CrossRef Cattell R (2011) Scalable SQL and NoSQL data stores. Acm Sigmod Rec 39(4):12–27CrossRef
29.
Zurück zum Zitat Mohring T (2016) Design and implementation of a NoSQL-concept for an international and multicentral clinical database. Doctoral dissertation, Ulm University Mohring T (2016) Design and implementation of a NoSQL-concept for an international and multicentral clinical database. Doctoral dissertation, Ulm University
30.
Zurück zum Zitat Padhy RP, Patra MR, Satapathy SC (2011) RDBMS to NoSQL: reviewing some next-generation non-relational database’s. Int J Adv Eng Sci Technol 11(1):15–30 Padhy RP, Patra MR, Satapathy SC (2011) RDBMS to NoSQL: reviewing some next-generation non-relational database’s. Int J Adv Eng Sci Technol 11(1):15–30
31.
Zurück zum Zitat Rabl T, Gómez-Villamor S, Sadoghi M, Muntés-Mulero V, Jacobsen HA, Mankovskii S (2012) Solving big data challenges for enterprise application performance management. Proc VLDB Endow 5(12):1724–1735CrossRef Rabl T, Gómez-Villamor S, Sadoghi M, Muntés-Mulero V, Jacobsen HA, Mankovskii S (2012) Solving big data challenges for enterprise application performance management. Proc VLDB Endow 5(12):1724–1735CrossRef
32.
Zurück zum Zitat Orend K (2010) Analysis and classification of NoSQL databases and evaluation of their ability to replace an object-relational Persistence Layer. Architecture 1:1–100 Orend K (2010) Analysis and classification of NoSQL databases and evaluation of their ability to replace an object-relational Persistence Layer. Architecture 1:1–100
33.
Zurück zum Zitat Muhammad Y (2011) Evaluation and implementation of distributed NoSQL database for MMO gaming environment. M.S. Thesis, Uppsala University Muhammad Y (2011) Evaluation and implementation of distributed NoSQL database for MMO gaming environment. M.S. Thesis, Uppsala University
34.
Zurück zum Zitat Floratou A, Teletia N, DeWitt DJ, Patel JM, Zhang D (2012) Can the elephants handle the nosql onslaught? Proc VLDB Endow 5(12):1712–1723CrossRef Floratou A, Teletia N, DeWitt DJ, Patel JM, Zhang D (2012) Can the elephants handle the nosql onslaught? Proc VLDB Endow 5(12):1712–1723CrossRef
35.
Zurück zum Zitat Hecht R, Jablonski S (2011) NoSQL evaluation: a use case oriented survey. In: International Conference on Cloud and Service Computing (CSC), pp 336–341 Hecht R, Jablonski S (2011) NoSQL evaluation: a use case oriented survey. In: International Conference on Cloud and Service Computing (CSC), pp 336–341
36.
Zurück zum Zitat Lakshman A, Malik P (2010) Cassandra: a decentralized structured storage system. ACM SIGOPS Oper Syst Rev 44(2):35–40CrossRef Lakshman A, Malik P (2010) Cassandra: a decentralized structured storage system. ACM SIGOPS Oper Syst Rev 44(2):35–40CrossRef
37.
Zurück zum Zitat Carpenter J, Hewitt E (2016) Cassandra: the definitive guide: distributed data at web scale. O’Reilly Media Inc, Sebastopol Carpenter J, Hewitt E (2016) Cassandra: the definitive guide: distributed data at web scale. O’Reilly Media Inc, Sebastopol
38.
Zurück zum Zitat Tech DHM, Omar MM (2017) A view on load balancing of NoSQL databases (Couchbase, Cassandra, Neo4j and Voldemort). Int J Adv Res Comput Eng Technol (IJARCET) 6(2) Tech DHM, Omar MM (2017) A view on load balancing of NoSQL databases (Couchbase, Cassandra, Neo4j and Voldemort). Int J Adv Res Comput Eng Technol (IJARCET) 6(2)
39.
Zurück zum Zitat Konstantinou I, Tsoumakos D, Mytilinis I, Koziris N (2013) DBalancer: distributed load balancing for NoSQL data-stores. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pp 1037–1040 Konstantinou I, Tsoumakos D, Mytilinis I, Koziris N (2013) DBalancer: distributed load balancing for NoSQL data-stores. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pp 1037–1040
40.
Zurück zum Zitat Pandey SK (2018) An approach to improve load balancing in distributed storage systems for NoSQL databases: MongoDB. In: Pattnaik PK, Rautaray SS, Das H, Nayak J (eds) Progress in computing, analytics and networking. Springer, Singapore, pp 529–538 Pandey SK (2018) An approach to improve load balancing in distributed storage systems for NoSQL databases: MongoDB. In: Pattnaik PK, Rautaray SS, Das H, Nayak J (eds) Progress in computing, analytics and networking. Springer, Singapore, pp 529–538
41.
Zurück zum Zitat Cruz F, Maia F, Matos M, Oliveira R, Paulo J, Pereira J, Vilaça R (2013) Met: workload aware elasticity for nosql. In: Proceedings of the 8th ACM European Conference on Computer Systems, pp 183–196 Cruz F, Maia F, Matos M, Oliveira R, Paulo J, Pereira J, Vilaça R (2013) Met: workload aware elasticity for nosql. In: Proceedings of the 8th ACM European Conference on Computer Systems, pp 183–196
42.
Zurück zum Zitat Konstantinou I, Tsoumakos D, Koziris N (2011) Fast and cost-effective online load-balancing in distributed range-queriable systems. IEEE Trans Parallel Distrib Syst 22(8):1350–1364CrossRef Konstantinou I, Tsoumakos D, Koziris N (2011) Fast and cost-effective online load-balancing in distributed range-queriable systems. IEEE Trans Parallel Distrib Syst 22(8):1350–1364CrossRef
46.
Zurück zum Zitat Khatibi E et al (2012) Dynamic multilevel feedback based searching strategy in unstructured peer-to-peer systems. In: IEEE International Conference on Green Computing and Communications Khatibi E et al (2012) Dynamic multilevel feedback based searching strategy in unstructured peer-to-peer systems. In: IEEE International Conference on Green Computing and Communications
47.
Zurück zum Zitat Dede E, Sendir B, Kuzlu P, Hartog J, Govindaraju M (2013) An evaluation of cassandra for hadoop. In: IEEE Sixth International Conference on Cloud Computing (CLOUD), pp 494–501 Dede E, Sendir B, Kuzlu P, Hartog J, Govindaraju M (2013) An evaluation of cassandra for hadoop. In: IEEE Sixth International Conference on Cloud Computing (CLOUD), pp 494–501
48.
Zurück zum Zitat Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint: arXiv:1301.3781 Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint: arXiv:​1301.​3781
Metadaten
Titel
A dynamic data dissemination mechanism for Cassandra NoSQL data store
verfasst von
Elahe Khatibi
Seyedeh Leili Mirtaheri
Publikationsdatum
29.07.2019
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 11/2019
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-019-02959-7

Weitere Artikel der Ausgabe 11/2019

The Journal of Supercomputing 11/2019 Zur Ausgabe

Premium Partner