Skip to main content
Top
Published in: Service Oriented Computing and Applications 4/2019

05-07-2019 | SPECIAL ISSUE PAPER

Hydrological stream data pipeline framework based on IoTDB

Authors: YuanSheng Lou, Yu Qin, Feng Ye, Peng Zhang, Yong Chen

Published in: Service Oriented Computing and Applications | Issue 4/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

With the increasing amount of hydrological data in Chuhe river basin, the traditional relational database has been unable to meet the needs of users, which not only makes it difficult to achieve low latency and high throughput in the real-time transmission of hydrological data, but also causes the phenomenon of long time or even system crash when querying large amount of annual water-level data. To solve this problem, this paper proposes a stream data pipeline framework based on timeseries databases IoTDB and Kafka, which can provide services for hydrological early warning and anomaly detection researchers. Based on the hydrological sensor data of Chuhe river, the processing scenarios of sensor stream data are set and compared with other NoSQL (HBase, MongoDB, RiakTS and Redis) in different scenarios. The performance and workload of different NoSQL in this data pipeline are tested. Finally, it is docked with Flink real-time stream data processing platform and compared with other data pipelines. The experimental results show that the stream data pipeline composed of IoTDB, Kafka and Flink is outstanding in data acquisition, transmission, incremental query and data analysis.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Tang E, Fan Y (2017) Performance comparison between five NoSQL databases. In: International conference on cloud computing & big data. IEEE Tang E, Fan Y (2017) Performance comparison between five NoSQL databases. In: International conference on cloud computing & big data. IEEE
2.
go back to reference Kang L, Deolalikar V, Pradhan N (2015) Big data gathering and mining pipelines for CRM using open-source. In: IEEE international conference on big data Kang L, Deolalikar V, Pradhan N (2015) Big data gathering and mining pipelines for CRM using open-source. In: IEEE international conference on big data
3.
go back to reference Raj P (2018) A detailed analysis of NoSQL and NewSQL databases for bigdata analytics and distributed computing. Adv Comput 109:1–48CrossRef Raj P (2018) A detailed analysis of NoSQL and NewSQL databases for bigdata analytics and distributed computing. Adv Comput 109:1–48CrossRef
4.
go back to reference Lawlor B, Lynch R, Mac MA, Walsh P (2018) Field of genes: using apache kafka as a bioinformatic data repository. Gigascience 7(4):giy036CrossRef Lawlor B, Lynch R, Mac MA, Walsh P (2018) Field of genes: using apache kafka as a bioinformatic data repository. Gigascience 7(4):giy036CrossRef
6.
go back to reference Freire SM, Teodoro D, Wei-Kleiner F, Sundvall E, Karlsson D, Lambrix P (2016) Comparing the performance of nosql approaches for managing archetype-based electronic health record data. PLoS ONE 11(3):e0150069CrossRef Freire SM, Teodoro D, Wei-Kleiner F, Sundvall E, Karlsson D, Lambrix P (2016) Comparing the performance of nosql approaches for managing archetype-based electronic health record data. PLoS ONE 11(3):e0150069CrossRef
7.
go back to reference Nguyen CN, Kim JS, Hwang S (2016) KOHA: building a kafka-based distributed queue system on the fly in a Hadoop cluster. Foundations and applications of self* systems. In: IEEE International Workshops on IEEE Nguyen CN, Kim JS, Hwang S (2016) KOHA: building a kafka-based distributed queue system on the fly in a Hadoop cluster. Foundations and applications of self* systems. In: IEEE International Workshops on IEEE
8.
go back to reference Yi M, Ting X, Shao-Bin L (2017) Research on NoSQL distributed big data mining method in complex attribute environment. Sci Technol Eng Yi M, Ting X, Shao-Bin L (2017) Research on NoSQL distributed big data mining method in complex attribute environment. Sci Technol Eng
9.
go back to reference O’Donovan P, Leahy K, Bruton K (2015) An industrial big data pipeline for data-driven analytics maintenance applications in large-scale smart manufacturing facilities. J. Big Data 2(1):25CrossRef O’Donovan P, Leahy K, Bruton K (2015) An industrial big data pipeline for data-driven analytics maintenance applications in large-scale smart manufacturing facilities. J. Big Data 2(1):25CrossRef
10.
go back to reference Nallakaruppan MK, Kumaran US (2018) Quick fix for obstacles emerging in management recruitment measure using IOT-based candidate selection. Serv. Oriented Comput Appl 12(3–4):275–284CrossRef Nallakaruppan MK, Kumaran US (2018) Quick fix for obstacles emerging in management recruitment measure using IOT-based candidate selection. Serv. Oriented Comput Appl 12(3–4):275–284CrossRef
11.
go back to reference Zhang Q, Li S, Li Z (2015) CHARM: a cost-efficient multi-cloud data hosting scheme with high availability. IEEE Trans Cloud Comput 3(3):1CrossRef Zhang Q, Li S, Li Z (2015) CHARM: a cost-efficient multi-cloud data hosting scheme with high availability. IEEE Trans Cloud Comput 3(3):1CrossRef
12.
go back to reference Al-Sakran A, Qattous H, Hijjawi M (2018) A proposed performance evaluation of NoSQL databases in the field of IoT. In: The 8th international conference on computer science and information technology (CSIT 2018). IEEE Computer Society Al-Sakran A, Qattous H, Hijjawi M (2018) A proposed performance evaluation of NoSQL databases in the field of IoT. In: The 8th international conference on computer science and information technology (CSIT 2018). IEEE Computer Society
13.
go back to reference Veloudis S, Paraskakis I, Petsos C (2017) Cloud service broker-age: enhancing resilience in virtual enterprises through service governance and quality assurance. Serv. Oriented Comput Appl 11(4):445–458CrossRef Veloudis S, Paraskakis I, Petsos C (2017) Cloud service broker-age: enhancing resilience in virtual enterprises through service governance and quality assurance. Serv. Oriented Comput Appl 11(4):445–458CrossRef
14.
go back to reference Feng Y, Peng Z, Sheng G, Yong C (2019) Intelligent Chuhe system based on the new generation of big data processing engine Flink. Water Resour Prot 2:90–94 Feng Y, Peng Z, Sheng G, Yong C (2019) Intelligent Chuhe system based on the new generation of big data processing engine Flink. Water Resour Prot 2:90–94
15.
go back to reference Reniers V, Rafique A, Van Landuyt D, Joosen W (2017) Object-nosql database mappers: a benchmark study on the performance overhead. J Internet Serv Appl 8(1):1CrossRef Reniers V, Rafique A, Van Landuyt D, Joosen W (2017) Object-nosql database mappers: a benchmark study on the performance overhead. J Internet Serv Appl 8(1):1CrossRef
Metadata
Title
Hydrological stream data pipeline framework based on IoTDB
Authors
YuanSheng Lou
Yu Qin
Feng Ye
Peng Zhang
Yong Chen
Publication date
05-07-2019
Publisher
Springer London
Published in
Service Oriented Computing and Applications / Issue 4/2019
Print ISSN: 1863-2386
Electronic ISSN: 1863-2394
DOI
https://doi.org/10.1007/s11761-019-00267-9

Other articles of this Issue 4/2019

Service Oriented Computing and Applications 4/2019 Go to the issue

Premium Partner