Skip to main content
Top
Published in: Neural Computing and Applications 17/2020

17-12-2019 | S.I. : Green and Human Information Technology 2019

High-performance IoT streaming data prediction system using Spark: a case study of air pollution

Authors: Ho-Yong Jin, Eun-Sung Jung, Duckki Lee

Published in: Neural Computing and Applications | Issue 17/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Internet-of-Things (IoT) devices are becoming prevalent, and some of them, such as sensors, generate continuous time-series data, i.e., streaming data. These IoT streaming data are one of Big Data sources, and they require careful consideration for efficient data processing and analysis. Deep learning is emerging as a solution to IoT streaming data analytics. However, there is a persistent problem in deep learning that it takes a long time to learn neural networks. In this paper, we propose a high-performance IoT streaming data prediction system to improve the learning speed and to predict in real time. We showed the efficacy of the system through a case study of air pollution. The experimental results show that the modified LSTM autoencoder model shows the best performance compared to a generic LSTM model. We noticed that achieving the best performance requires optimizing many parameters, including learning rate, epoch, memory cell size, input timestep size, and the number of features/predictors. In that regard, we show that the high-performance data learning/prediction frameworks (e.g., Spark, Dist-Keras, and Hadoop) are essential to rapidly fine-tune a model for training and testing before real deployment of the model as data accumulate.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
8.
go back to reference Salinas D, Flunkert V, Gasthaus J (2017) DeepAR: probabilistic forecasting with autoregressive recurrent networks. arXiv:1704.04110 [cs, stat] Salinas D, Flunkert V, Gasthaus J (2017) DeepAR: probabilistic forecasting with autoregressive recurrent networks. arXiv:​1704.​04110 [cs, stat]
9.
go back to reference Bui TC, Le VD, Cha SK (2018) A deep learning approach for forecasting air pollution in South Korea using LSTM. arXiv:1804.07891 [cs, stat] Bui TC, Le VD, Cha SK (2018) A deep learning approach for forecasting air pollution in South Korea using LSTM. arXiv:​1804.​07891 [cs, stat]
13.
go back to reference Srivastava N, Mansimov E, Salakhutdinov R (2015) Unsupervised learning of video representations using LSTMs. In: Proceedings of the 32nd international conference on international conference on machine learning, vol 37, ICML’15, JMLR.org. Event-place, Lille, France, pp 843–852. http://dl.acm.org/citation.cfm?id=3045118.3045209. Accessed 16 Dec 2019 Srivastava N, Mansimov E, Salakhutdinov R (2015) Unsupervised learning of video representations using LSTMs. In: Proceedings of the 32nd international conference on international conference on machine learning, vol 37, ICML’15, JMLR.org. Event-place, Lille, France, pp 843–852. http://​dl.​acm.​org/​citation.​cfm?​id=​3045118.​3045209. Accessed 16 Dec 2019
Metadata
Title
High-performance IoT streaming data prediction system using Spark: a case study of air pollution
Authors
Ho-Yong Jin
Eun-Sung Jung
Duckki Lee
Publication date
17-12-2019
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 17/2020
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-019-04678-9

Other articles of this Issue 17/2020

Neural Computing and Applications 17/2020 Go to the issue

Premium Partner