Skip to main content
Top
Published in: Evolutionary Intelligence 4/2022

06-06-2021 | Special Issue

Research on automatic cleaning algorithm of multi-dimensional network redundant data based on big data

Author: Jie Fang

Published in: Evolutionary Intelligence | Issue 4/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In order to realize the research on network redundant data cleaning based on big data, this paper designs a set of redundant data cleaning framework according to the data processing flow before data analysis. According to the spatial correlation of redundant data, a method of data cleaning is designed. In the data cleaning method, appropriate cleaning algorithms are designed for abnormal data and missing data respectively, in which mathematical probability design is applied to abnormal data to delete the data with obvious deviation from the normal data value. The spatial model and algorithm are designed by applying spatial correlation to the missing data to fill the missing data value after the redundant data is cleaned by other steps in the method. The accuracy of the model is compared with that of the common data prediction algorithm, and the accuracy between the algorithm and the redundant data set is verified.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Chai Q, Zheng W, Pan J, Lu S, Wen J (2018) Research on state monitoring and fault handling methods of intelligent distribution network based on big data analysis. Modern Electron Technol 10(4):3137–3147 Chai Q, Zheng W, Pan J, Lu S, Wen J (2018) Research on state monitoring and fault handling methods of intelligent distribution network based on big data analysis. Modern Electron Technol 10(4):3137–3147
2.
go back to reference Shen X, Li Y, Ma Y, Yang J (2019) Application of environmental monitoring system based on GIS technology in comprehensive pipe gallery. Municipal Technol 124(5):936–939 Shen X, Li Y, Ma Y, Yang J (2019) Application of environmental monitoring system based on GIS technology in comprehensive pipe gallery. Municipal Technol 124(5):936–939
3.
go back to reference Liu B, Fu Z, Wang Y, Wang P, Gao X (2018) Big data mining technology based on parallel computing and its application in power plant boiler performance optimization. Chin J Power Eng 38(6):431–439 Liu B, Fu Z, Wang Y, Wang P, Gao X (2018) Big data mining technology based on parallel computing and its application in power plant boiler performance optimization. Chin J Power Eng 38(6):431–439
4.
go back to reference Wang H, Li Z, Zhang X (2017) An adaptive audit method for data integrity in cloud storage. Comput Res Dev 54(1):172–179 Wang H, Li Z, Zhang X (2017) An adaptive audit method for data integrity in cloud storage. Comput Res Dev 54(1):172–179
5.
go back to reference Zhang S, Wang Z, Wang B (2017) Integrity detection scheme of power consumption information collection terminal based on trusted computing. Electric Power Autom Equip 12:117–124 Zhang S, Wang Z, Wang B (2017) Integrity detection scheme of power consumption information collection terminal based on trusted computing. Electric Power Autom Equip 12:117–124
6.
go back to reference Zhang R, Ma Z (2017) Simulation research on missing optimization detection of big data network information system. Comput Simul 56(9):69–81 Zhang R, Ma Z (2017) Simulation research on missing optimization detection of big data network information system. Comput Simul 56(9):69–81
7.
go back to reference Zhou J, Wang J, He T, Wang J, Li P (2018) Multi-sensor data fusion of greenhouse environment based on spatio-temporal correlation. Jiangsu Agricult Sci 89(5):31–42 Zhou J, Wang J, He T, Wang J, Li P (2018) Multi-sensor data fusion of greenhouse environment based on spatio-temporal correlation. Jiangsu Agricult Sci 89(5):31–42
8.
go back to reference Wu F (2018) Data science and big data technology: the sweet pastry in emerging majors. Friends High School Stud 63(1):1–7 Wu F (2018) Data science and big data technology: the sweet pastry in emerging majors. Friends High School Stud 63(1):1–7
9.
go back to reference Ramírez-Gallego S, Krawczyk B, García S, Woźniak M, Herrera F (2017) A survey on data preprocessing for data stream mining: current status and future directions. Neurocomputing 239:39–57CrossRef Ramírez-Gallego S, Krawczyk B, García S, Woźniak M, Herrera F (2017) A survey on data preprocessing for data stream mining: current status and future directions. Neurocomputing 239:39–57CrossRef
10.
go back to reference Sun X, Li P, Liu Y (2019) Design and implementation of smart home control system based on the internet of things. Electron Technol Softw Eng 62(7):4430–4442 Sun X, Li P, Liu Y (2019) Design and implementation of smart home control system based on the internet of things. Electron Technol Softw Eng 62(7):4430–4442
11.
go back to reference Chen W (2019) Research and analysis of building energy consumption monitoring system based on internet things technology. Green Build 01:3650–3652 Chen W (2019) Research and analysis of building energy consumption monitoring system based on internet things technology. Green Build 01:3650–3652
12.
go back to reference Wang L, Chen Q, Gao H, Ma Z, Zhang Y, He D (2018) Intelligent substation fault tracking architecture based on big data mining technology. Autom Electric Power Syst 42(03):84–91 Wang L, Chen Q, Gao H, Ma Z, Zhang Y, He D (2018) Intelligent substation fault tracking architecture based on big data mining technology. Autom Electric Power Syst 42(03):84–91
13.
go back to reference Li H, Wan X (2017) Research on mass data sharing technology based on OS2 master station system. Electron Design Eng 20:1–6 Li H, Wan X (2017) Research on mass data sharing technology based on OS2 master station system. Electron Design Eng 20:1–6
14.
go back to reference Li H, Zhang L (2017) Multi-tenant data integrity verification scheme based on two-layer authentication tree. Chin Sci Technol Paper 107(8):203–216 Li H, Zhang L (2017) Multi-tenant data integrity verification scheme based on two-layer authentication tree. Chin Sci Technol Paper 107(8):203–216
15.
go back to reference Shah JS, Rai SN, DeFilippis AP, Hill BG, Bhatnagar A, Brock GN (2017) Distribution based nearest neighbor imputation for truncated high dimensional data with applications to pre-clinical and clinical metabolomics studies. BMC Bioinform 18(1):1–13CrossRef Shah JS, Rai SN, DeFilippis AP, Hill BG, Bhatnagar A, Brock GN (2017) Distribution based nearest neighbor imputation for truncated high dimensional data with applications to pre-clinical and clinical metabolomics studies. BMC Bioinform 18(1):1–13CrossRef
16.
go back to reference Marshall DD, Powers R (2017) Beyond the paradigm: combining mass spectrometry and nuclear magnetic resonance for metabolomics. Prog Nuclear Magn Resonance Spectrosco 100:1–16CrossRef Marshall DD, Powers R (2017) Beyond the paradigm: combining mass spectrometry and nuclear magnetic resonance for metabolomics. Prog Nuclear Magn Resonance Spectrosco 100:1–16CrossRef
17.
go back to reference Xu Y (2019) The application and prospect analysis of the Internet of Things technology in the stadium system. Dig Commun World 3(08):80–89 Xu Y (2019) The application and prospect analysis of the Internet of Things technology in the stadium system. Dig Commun World 3(08):80–89
18.
go back to reference Yi T, Xi C, Weidong L, Baochang C, Liuqing D, Liyun S, Lihong H (2017) Global and untargeted metabolomics evidence of the protective effect of different extracts of Dipsacus asper Wall. ex C.B. Clarke on estrogen deficiency after ovariectomia in rats. J Ethnopharmacol 199:20–29CrossRef Yi T, Xi C, Weidong L, Baochang C, Liuqing D, Liyun S, Lihong H (2017) Global and untargeted metabolomics evidence of the protective effect of different extracts of Dipsacus asper Wall. ex C.B. Clarke on estrogen deficiency after ovariectomia in rats. J Ethnopharmacol 199:20–29CrossRef
19.
go back to reference Wang Z, Guo Z, Yang H, Liu B (2019) Analysis of the effect of population structure changes on medical and health expenditure based on vector autoregressive model. China Health Stat 37(2):307–332 Wang Z, Guo Z, Yang H, Liu B (2019) Analysis of the effect of population structure changes on medical and health expenditure based on vector autoregressive model. China Health Stat 37(2):307–332
20.
go back to reference Tao Y, Zhang H, Xu J (2018) Application research of outlier detection in big data analysis. Inf Sci 14(03):373–377 Tao Y, Zhang H, Xu J (2018) Application research of outlier detection in big data analysis. Inf Sci 14(03):373–377
21.
go back to reference Hao S, Li G, Feng J, Wang N (2018) Overview of structured data cleaning technology. J Tsinghua Univ (Nat Sci Ed) 26(1):65–74 Hao S, Li G, Feng J, Wang N (2018) Overview of structured data cleaning technology. J Tsinghua Univ (Nat Sci Ed) 26(1):65–74
22.
go back to reference Qu C, Zhang Y, Wang Y, Zhao Y (2018) Energy Internet power energy big data cleaning model based on Spark framework. Electr Meas Instrum 86(3):221–236 Qu C, Zhang Y, Wang Y, Zhao Y (2018) Energy Internet power energy big data cleaning model based on Spark framework. Electr Meas Instrum 86(3):221–236
23.
go back to reference Xu S, Mi W, Xu Z, Bo Z (2017) A dynamic data integrity verification scheme in smart grid. Comput Eng 12(8):366–371 Xu S, Mi W, Xu Z, Bo Z (2017) A dynamic data integrity verification scheme in smart grid. Comput Eng 12(8):366–371
Metadata
Title
Research on automatic cleaning algorithm of multi-dimensional network redundant data based on big data
Author
Jie Fang
Publication date
06-06-2021
Publisher
Springer Berlin Heidelberg
Published in
Evolutionary Intelligence / Issue 4/2022
Print ISSN: 1864-5909
Electronic ISSN: 1864-5917
DOI
https://doi.org/10.1007/s12065-021-00620-y

Other articles of this Issue 4/2022

Evolutionary Intelligence 4/2022 Go to the issue

Premium Partner