Skip to main content
Erschienen in: Arabian Journal for Science and Engineering 8/2022

30.11.2021 | Research Article-Computer Engineering and Computer Science

A Distributed Data Storage Strategy Based on LOPs

verfasst von: Qianqiu Wang, Xiaoping Ye, Xianlu Luo, Lunjie Li, Hainan Chen

Erschienen in: Arabian Journal for Science and Engineering | Ausgabe 8/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Distributed data management requires data partitioning and deployment at the data storage level, and data querying requires the configuration and integration of query subresults at each site. The data partitioning strategy is closely related to the overhead of the distributed system. It is necessary to determine the appropriate data partitioning strategy and update strategy according to the application. This paper proposes a widely distributed storage and processing scheme for a distributed linear order partition (DLOP) based on time stamps. This scheme proposes two kinds of partition strategy based on the characteristics of an "equivalent division" of a linear order partition (LOP), namely, partitioning based on time interval equilibrium and partitioning based on query expectation. Each site in the distributed system is uniformly configured with an index-based data query mechanism to complete the distributed management of data. The corresponding experiments verify the practicability and efficiency of the proposed storage strategy and show that the proposed method is effective for the self-scalability of the data scale and reduces the cluster hardware configuration requirements.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zhang, Y.; et al.: Parallel processing systems for big data: a survey. Proc. IEEE 104(11), 2114–2136 (2016)CrossRef Zhang, Y.; et al.: Parallel processing systems for big data: a survey. Proc. IEEE 104(11), 2114–2136 (2016)CrossRef
2.
Zurück zum Zitat Polato, I.; et al.: A comprehensive view of Hadoop research—a systematic literature review. J. Netw. Comput. Appl. 46, 1–25 (2014). Author 1, A.; Author 2, B. Book Title, 3rd ed.; Publisher: Publisher Location, Country, 2008; pp. 154–196 Polato, I.; et al.: A comprehensive view of Hadoop research—a systematic literature review. J. Netw. Comput. Appl. 46, 1–25 (2014). Author 1, A.; Author 2, B. Book Title, 3rd ed.; Publisher: Publisher Location, Country, 2008; pp. 154–196
3.
Zurück zum Zitat Challa, J.S.; et al.: DD-Rtree: a dynamic distributed data structure for efficient data distribution among cluster nodes for spatial data mining algorithms. In: 2016 IEEE International Conference on Big Data (Big Data). IEEE (2016) Challa, J.S.; et al.: DD-Rtree: a dynamic distributed data structure for efficient data distribution among cluster nodes for spatial data mining algorithms. In: 2016 IEEE International Conference on Big Data (Big Data). IEEE (2016)
4.
Zurück zum Zitat Cangir, O.F.; Cankur, O.; Ozsoy, A.: A taxonomy for Blockchain based distributed storage technologies. Inf. Process. Manag. 58(5), 102627 (2021)CrossRef Cangir, O.F.; Cankur, O.; Ozsoy, A.: A taxonomy for Blockchain based distributed storage technologies. Inf. Process. Manag. 58(5), 102627 (2021)CrossRef
5.
Zurück zum Zitat Fan, W.; et al.: Method of maintaining data consistency in microservice architecture. In: 2018 IEEE 4th International Conference on Big Data Security on Cloud (BigDataSecurity), IEEE International Conference on High Performance and Smart Computing, (HPSC) and IEEE International Conference on Intelligent Data and Security (IDS). IEEE Computer Society (2018) Fan, W.; et al.: Method of maintaining data consistency in microservice architecture. In: 2018 IEEE 4th International Conference on Big Data Security on Cloud (BigDataSecurity), IEEE International Conference on High Performance and Smart Computing, (HPSC) and IEEE International Conference on Intelligent Data and Security (IDS). IEEE Computer Society (2018)
6.
Zurück zum Zitat Benerjee, K.G.; Gupta, M.K.: Trade-off for heterogeneous distributed storage systems between storage and repair cost. Prob. Inf. Transm. 57(1), 33–53 (2021)MathSciNetCrossRef Benerjee, K.G.; Gupta, M.K.: Trade-off for heterogeneous distributed storage systems between storage and repair cost. Prob. Inf. Transm. 57(1), 33–53 (2021)MathSciNetCrossRef
7.
Zurück zum Zitat Ruty, G.; Baccouch, H.; Nguyen, V., et al.: Popularity-based full replica caching for erasure-coded distributed storage systems. Clust. Comput. 2021, 1–14 (2021) Ruty, G.; Baccouch, H.; Nguyen, V., et al.: Popularity-based full replica caching for erasure-coded distributed storage systems. Clust. Comput. 2021, 1–14 (2021)
8.
Zurück zum Zitat Hall, R.J.: Tools for predicting the reliability of large-scale storage systems. ACM Trans. Storage (TOS) 12(4), 1–30 (2016)CrossRef Hall, R.J.: Tools for predicting the reliability of large-scale storage systems. ACM Trans. Storage (TOS) 12(4), 1–30 (2016)CrossRef
9.
Zurück zum Zitat Kruglik, S.; Frolov, A.: An information-theoretic approach for reliable distributed storage systems. J. Commun. Technol. Elect. 65(12), 1505–1516 (2020)CrossRef Kruglik, S.; Frolov, A.: An information-theoretic approach for reliable distributed storage systems. J. Commun. Technol. Elect. 65(12), 1505–1516 (2020)CrossRef
10.
Zurück zum Zitat Yu, L.; et al.: Stochastic load balancing for virtual resource management in datacenters. IEEE Trans. Cloud Comput. 8(2), 459–472 (2016)CrossRef Yu, L.; et al.: Stochastic load balancing for virtual resource management in datacenters. IEEE Trans. Cloud Comput. 8(2), 459–472 (2016)CrossRef
11.
Zurück zum Zitat Kaur, S.; Sharma, T.: Efficient load balancing using improved central load balancing technique. In: 2018 2nd International Conference on Inventive Systems and Control (ICISC). IEEE (2018) Kaur, S.; Sharma, T.: Efficient load balancing using improved central load balancing technique. In: 2018 2nd International Conference on Inventive Systems and Control (ICISC). IEEE (2018)
12.
Zurück zum Zitat Qin, X.P.; Wang, H.J.; Li, F.R.; et al.: New landscape of data management technologies. J. Softw. 24(2), 175–197 (2013)CrossRef Qin, X.P.; Wang, H.J.; Li, F.R.; et al.: New landscape of data management technologies. J. Softw. 24(2), 175–197 (2013)CrossRef
13.
Zurück zum Zitat Mishra, S.; Suman, A.C.: An efficient method of partitioning high volumes of multidimensional data for parallel clustering algorithms (2016). arXiv:1609.06221 Mishra, S.; Suman, A.C.: An efficient method of partitioning high volumes of multidimensional data for parallel clustering algorithms (2016). arXiv:​1609.​06221
14.
Zurück zum Zitat Alarabi, L.; Mokbel, M.F.; Musleh, M.: St-hadoop: a mapreduce framework for spatio-temporal data. GeoInformatica 22(4), 785–813 (2018)CrossRef Alarabi, L.; Mokbel, M.F.; Musleh, M.: St-hadoop: a mapreduce framework for spatio-temporal data. GeoInformatica 22(4), 785–813 (2018)CrossRef
15.
Zurück zum Zitat Mahmud, M.S.; et al.: A survey of data partitioning and sampling methods to support big data analysis. Big Data Min. Analyt. 3(2), 85–101 (2020)CrossRef Mahmud, M.S.; et al.: A survey of data partitioning and sampling methods to support big data analysis. Big Data Min. Analyt. 3(2), 85–101 (2020)CrossRef
16.
Zurück zum Zitat Emara, X.Z.T.Z.; He, C.W.H.: A random sample partition data model for big data analysis (2017). arXiv:1712.04146 Emara, X.Z.T.Z.; He, C.W.H.: A random sample partition data model for big data analysis (2017). arXiv:​1712.​04146
17.
Zurück zum Zitat Alsmirat, M.; Jararweh, Y.; Al-Ayyoub, M.: Speeding DBLP querying using hadoop and spark//IOP conference series: materials science and engineering. IOP Publ. 459(1), 012003 (2018) Alsmirat, M.; Jararweh, Y.; Al-Ayyoub, M.: Speeding DBLP querying using hadoop and spark//IOP conference series: materials science and engineering. IOP Publ. 459(1), 012003 (2018)
18.
Zurück zum Zitat Hu, X.; Xu, H.; Jia, J.; et al.: Research on distributed storage and query optimization of multi-source heterogeneous meteorological data. In: Proceedings of the 2018 International Conference on Cloud Computing and Internet of Things. ACM, pp. 12–18 (2018) Hu, X.; Xu, H.; Jia, J.; et al.: Research on distributed storage and query optimization of multi-source heterogeneous meteorological data. In: Proceedings of the 2018 International Conference on Cloud Computing and Internet of Things. ACM, pp. 12–18 (2018)
19.
Zurück zum Zitat Xue, J.; Xu, C.; Bai, L.: DStore: a distributed system for outsourced data storage and retrieval. Futur. Gener. Comput. Syst. 99, 106–114 (2019)CrossRef Xue, J.; Xu, C.; Bai, L.: DStore: a distributed system for outsourced data storage and retrieval. Futur. Gener. Comput. Syst. 99, 106–114 (2019)CrossRef
20.
Zurück zum Zitat Kolomvatsos, K.: A distributed, proactive intelligent scheme for securing quality in large scale data processing. Computing 101(11), 1687–1710 (2019)CrossRef Kolomvatsos, K.: A distributed, proactive intelligent scheme for securing quality in large scale data processing. Computing 101(11), 1687–1710 (2019)CrossRef
21.
Zurück zum Zitat Rafique, A.; Van Landuyt, D.; Joosen, W.: Persist: policy-based data management middleware for multi-tenant saas leveraging federated cloud storage. J. Grid Comput. 16(2), 165–194 (2018)CrossRef Rafique, A.; Van Landuyt, D.; Joosen, W.: Persist: policy-based data management middleware for multi-tenant saas leveraging federated cloud storage. J. Grid Comput. 16(2), 165–194 (2018)CrossRef
22.
Zurück zum Zitat Rafique, A.; Van Landuyt, D.; Truyen, E.; Reniers, V.; Joosen, W.: SCOPE: self-adaptive and policy-based data management middleware for federated clouds. J. Internet Serv. Appl. 10(1), 1–19 (2019)CrossRef Rafique, A.; Van Landuyt, D.; Truyen, E.; Reniers, V.; Joosen, W.: SCOPE: self-adaptive and policy-based data management middleware for federated clouds. J. Internet Serv. Appl. 10(1), 1–19 (2019)CrossRef
24.
Zurück zum Zitat Li, R.; He, H.; Wang, R.; Ruan, S.; Sui, Y.; Bao, J.; Zheng, Y.: Trajmesa: a distributed nosql storage engine for big trajectory data. In: 2020 IEEE 36th international conference on data engineering (ICDE). IEEE, pp. 2002–2005 (2020) Li, R.; He, H.; Wang, R.; Ruan, S.; Sui, Y.; Bao, J.; Zheng, Y.: Trajmesa: a distributed nosql storage engine for big trajectory data. In: 2020 IEEE 36th international conference on data engineering (ICDE). IEEE, pp. 2002–2005 (2020)
25.
Zurück zum Zitat Ye, X.; Tang, Y.; Lin, Y.; Chen, Z.; Zhang, Z.; Chen, R.: Study and implementation of temporal index TD index. Sci. Sin. (Inf.) 8(45), 1025–1045 (2015) Ye, X.; Tang, Y.; Lin, Y.; Chen, Z.; Zhang, Z.; Chen, R.: Study and implementation of temporal index TD index. Sci. Sin. (Inf.) 8(45), 1025–1045 (2015)
26.
Zurück zum Zitat Ye, X.P.; Tang, Y.; Zhang, Z.B.; Chen, Z.Y.; Lin, Y.C.: Study and implementation on semantics-based cooperative temporal XML index. J. Comput. 37(9), 1911–1921 (2014) Ye, X.P.; Tang, Y.; Zhang, Z.B.; Chen, Z.Y.; Lin, Y.C.: Study and implementation on semantics-based cooperative temporal XML index. J. Comput. 37(9), 1911–1921 (2014)
27.
Zurück zum Zitat Ye, X.P.; Tang, Y.; Lin, Y.C.; Chen, Z.Y.; Zhang, Z.B.: Study and application of temporal quasi-order data structure. J. Softw. 25(11), 2587–2601 (2014)MATH Ye, X.P.; Tang, Y.; Lin, Y.C.; Chen, Z.Y.; Zhang, Z.B.: Study and application of temporal quasi-order data structure. J. Softw. 25(11), 2587–2601 (2014)MATH
28.
Zurück zum Zitat Allen, J.F.: Maintaining knowledge about temporal intervals. Read. Qual. Reason. Phys. Syst. 26(11), 361–372 (1990) Allen, J.F.: Maintaining knowledge about temporal intervals. Read. Qual. Reason. Phys. Syst. 26(11), 361–372 (1990)
Metadaten
Titel
A Distributed Data Storage Strategy Based on LOPs
verfasst von
Qianqiu Wang
Xiaoping Ye
Xianlu Luo
Lunjie Li
Hainan Chen
Publikationsdatum
30.11.2021
Verlag
Springer Berlin Heidelberg
Erschienen in
Arabian Journal for Science and Engineering / Ausgabe 8/2022
Print ISSN: 2193-567X
Elektronische ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-021-06371-3

Weitere Artikel der Ausgabe 8/2022

Arabian Journal for Science and Engineering 8/2022 Zur Ausgabe

Research Article-Computer Engineering and Computer Science

Prostate Segmentation via Dynamic Fusion Model

Research Article-Computer Engineering and Computer Science

Enhanced UAVs Mobility Models for Surveillance and Intruders Detection Missions

RESEARCH ARTICLE-COMPUTER ENGINEERING AND COMPUTER SCIENCE

Upgrading the Quality of Power Using TVSS Device and PFC Converter Fed SBLDC Motor

Research Article-Computer Engineering and Computer Science

Learning Deep Pyramid-based Representations for Pansharpening

Research Article-Computer Engineering and Computer Science

IRText: An Item Response Theory-Based Approach for Text Categorization

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.