Skip to main content
Top
Published in: The Journal of Supercomputing 9/2019

21-06-2019

Research on SVM environment performance of parallel computing based on large data set of machine learning

Authors: Yunlu Gong, Lianguo Jia

Published in: The Journal of Supercomputing | Issue 9/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The support vector machine (SVM) algorithm is widely used in various fields because of its good classification effect, simplicity and practicability. However, the support vector machine calculates the support vector by quadratic programming, and the solution of quadratic programming will calculate the n-order matrix. When the amount of data is large, the calculation and storage of the n-order matrix will make the optimization speed very slow, even lead to memory overflow and interrupt operation. Using the big data computing platform Spark to improve the support vector machine algorithm can solve the above problems, but it’s not competent for multi-classification problems. Therefore, this paper starts with constructing multiple classifiers, combines the Spark framework of big data programming model and the classification characteristics of support vector machine to realize a parallel one-to-many SVM optimization algorithm based on large data sets and compares them through UCI data sets. In the experiments, the one-to-many support vector machine improved by Spark is obviously better than the one-to-many support vector machine in the single-machine environment. The simulation results show that the proposed algorithm has better performance.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Danchin A, Ouzounis C, Tokuyasu T et al (2018) No wisdom in the crowd: genome annotation in the era of big data—current status and future prospects[J]. Microb Biotechnol 11(4):588–605CrossRef Danchin A, Ouzounis C, Tokuyasu T et al (2018) No wisdom in the crowd: genome annotation in the era of big data—current status and future prospects[J]. Microb Biotechnol 11(4):588–605CrossRef
2.
go back to reference Grover P, Kar AK (2017) Big data analytics: a review on theoretical contributions and tools used in literature[J]. Glob J Flex Syst Manag 18(3):1–27CrossRef Grover P, Kar AK (2017) Big data analytics: a review on theoretical contributions and tools used in literature[J]. Glob J Flex Syst Manag 18(3):1–27CrossRef
3.
go back to reference Zhou S, Hu C et al (2016) Research on supply chain risk assessment with intuitionistic fuzzy information[J]. J Intell Fuzzy Syst 30(6):3367–3372CrossRef Zhou S, Hu C et al (2016) Research on supply chain risk assessment with intuitionistic fuzzy information[J]. J Intell Fuzzy Syst 30(6):3367–3372CrossRef
4.
go back to reference Li D, Li M, Liu J (2018) Evolutionary trust scheme of certificate game in mobile cloud computing[J]. Soft Comput 22(7):2245–2255CrossRef Li D, Li M, Liu J (2018) Evolutionary trust scheme of certificate game in mobile cloud computing[J]. Soft Comput 22(7):2245–2255CrossRef
5.
go back to reference Rad MH, Patooghy A, Fazeli M (2017) An Efficient Programming Skeleton for Clusters of Multi-Core Processors[J]. Int J Parallel Program 2:1–16 Rad MH, Patooghy A, Fazeli M (2017) An Efficient Programming Skeleton for Clusters of Multi-Core Processors[J]. Int J Parallel Program 2:1–16
6.
go back to reference Dastgeer U, Kessler C (2016) Smart Containers and Skeleton Programming for GPU-Based Systems[J]. Int J Parallel Program 44(3):1–25CrossRef Dastgeer U, Kessler C (2016) Smart Containers and Skeleton Programming for GPU-Based Systems[J]. Int J Parallel Program 44(3):1–25CrossRef
7.
go back to reference Kim Y, Lee J, Kim D et al (2017) ScaleGPU: gPU Architecture for Memory-Unaware GPU Programming[J]. IEEE Comput Archit Lett 13(2):101–104CrossRef Kim Y, Lee J, Kim D et al (2017) ScaleGPU: gPU Architecture for Memory-Unaware GPU Programming[J]. IEEE Comput Archit Lett 13(2):101–104CrossRef
8.
go back to reference Sitaridi EA, Ross KA (2016) GPU-accelerated string matching for database applications[J]. VLDB J 25(5):719–740CrossRef Sitaridi EA, Ross KA (2016) GPU-accelerated string matching for database applications[J]. VLDB J 25(5):719–740CrossRef
9.
go back to reference Fang Y, Chen Q, Xiong NN et al (2017) RGCA: a reliable GPU cluster architecture for large-scale internet of things computing based on effective performance-energy optimization[J]. Sensors 17(8):1799CrossRef Fang Y, Chen Q, Xiong NN et al (2017) RGCA: a reliable GPU cluster architecture for large-scale internet of things computing based on effective performance-energy optimization[J]. Sensors 17(8):1799CrossRef
10.
go back to reference Washington ID, Swartz CLE (2017) A parallel structure exploiting nonlinear programming algorithm for multiperiod dynamic optimization[J]. Comput Chem Eng 103:151–164CrossRef Washington ID, Swartz CLE (2017) A parallel structure exploiting nonlinear programming algorithm for multiperiod dynamic optimization[J]. Comput Chem Eng 103:151–164CrossRef
11.
go back to reference Gerardo G, Martinez-Velasco JA (2017) Evaluation of MATPOWER and OpenDSS load flow calculations in power systems using parallel computing[J]. J Eng 2017(6):195–204 Gerardo G, Martinez-Velasco JA (2017) Evaluation of MATPOWER and OpenDSS load flow calculations in power systems using parallel computing[J]. J Eng 2017(6):195–204
12.
go back to reference Kumar N, Lee JH, Chilamkurti N et al (2016) Energy-efficient multimedia data dissemination in vehicular clouds: stochastic-reward-nets-based coalition game approach[J]. IEEE Syst J 10(2):847–858CrossRef Kumar N, Lee JH, Chilamkurti N et al (2016) Energy-efficient multimedia data dissemination in vehicular clouds: stochastic-reward-nets-based coalition game approach[J]. IEEE Syst J 10(2):847–858CrossRef
13.
go back to reference Madar V, Batista S (2016) FastLSU: a more practical approach for the Benjamini-Hochberg FDR controlling procedure for huge-scale testing problems[J]. Bioinformatics 32(11):1716–1723CrossRef Madar V, Batista S (2016) FastLSU: a more practical approach for the Benjamini-Hochberg FDR controlling procedure for huge-scale testing problems[J]. Bioinformatics 32(11):1716–1723CrossRef
14.
go back to reference Allombert V, Gava F, Tesson J (2016) Multi-ML: programming multi-BSP algorithms in ML[J]. Int J Parallel Program 45(2):340–361CrossRef Allombert V, Gava F, Tesson J (2016) Multi-ML: programming multi-BSP algorithms in ML[J]. Int J Parallel Program 45(2):340–361CrossRef
15.
go back to reference Tetko IV, Varbanov HP, Galanski M et al (2016) Prediction of logP for Pt(II) and Pt(IV) complexes: comparison of statistical and quantum-chemistry based approaches[J]. J Inorg Biochem 156:1–13CrossRef Tetko IV, Varbanov HP, Galanski M et al (2016) Prediction of logP for Pt(II) and Pt(IV) complexes: comparison of statistical and quantum-chemistry based approaches[J]. J Inorg Biochem 156:1–13CrossRef
16.
go back to reference Janka E, Vincze F, Ádány Róza et al (2018) Is the definition of Roma an important matter? The parallel application of self and external classification of ethnicity in a population-based health interview survey[J]. Int J Environ Res Public Health 15(2):353CrossRef Janka E, Vincze F, Ádány Róza et al (2018) Is the definition of Roma an important matter? The parallel application of self and external classification of ethnicity in a population-based health interview survey[J]. Int J Environ Res Public Health 15(2):353CrossRef
17.
go back to reference Zhang C, Yang Y, Du Z et al (2016) Particle swarm optimization algorithm based on ontology model to support cloud computing applications[J]. J Ambient Intell Humaniz Comput 7(5):633–638CrossRef Zhang C, Yang Y, Du Z et al (2016) Particle swarm optimization algorithm based on ontology model to support cloud computing applications[J]. J Ambient Intell Humaniz Comput 7(5):633–638CrossRef
18.
go back to reference Kim JS, Lee S, Chung MY (2018) Time-division random-access scheme based on coverage level for cellular internet-of-things in 3GPP networks[J]. Pervasive Mob Comput 44:45–57CrossRef Kim JS, Lee S, Chung MY (2018) Time-division random-access scheme based on coverage level for cellular internet-of-things in 3GPP networks[J]. Pervasive Mob Comput 44:45–57CrossRef
19.
go back to reference Magnus JR, Luca GD (2016) Weighted-average least squares (WALS): a survey[J]. J Econ Surv 30(1):117–148CrossRef Magnus JR, Luca GD (2016) Weighted-average least squares (WALS): a survey[J]. J Econ Surv 30(1):117–148CrossRef
20.
go back to reference Chang C, Srirama S, Ling S (2015) Service discovery and trust in mobile social network in proximity[J]. Comput Sci SI:58–62 Chang C, Srirama S, Ling S (2015) Service discovery and trust in mobile social network in proximity[J]. Comput Sci SI:58–62
21.
go back to reference Ripepi V, Cignoni M, Tosi M et al (2016) The VST survey of the SMC and the Magellanic bridge (STEP): first results[J]. Universe of Digital Sky Surv 42(5):809–811 Ripepi V, Cignoni M, Tosi M et al (2016) The VST survey of the SMC and the Magellanic bridge (STEP): first results[J]. Universe of Digital Sky Surv 42(5):809–811
22.
go back to reference Babiceanu RF, Seker R (2016) Big Data and virtualization for manufacturing cyber-physical systems: a survey of the current status and future outlook[J]. Comput Ind 81(C):128–137CrossRef Babiceanu RF, Seker R (2016) Big Data and virtualization for manufacturing cyber-physical systems: a survey of the current status and future outlook[J]. Comput Ind 81(C):128–137CrossRef
23.
go back to reference Smith C, Albarghouthi A (2016) MapReduce program synthesis[J]. Acm Sigplan Not 51(6):326–340CrossRef Smith C, Albarghouthi A (2016) MapReduce program synthesis[J]. Acm Sigplan Not 51(6):326–340CrossRef
24.
go back to reference Wu X, Chen L, Wei D et al (2016) Static analysis of runtime errors in interrupt-driven programs via sequentialization[J]. Acm Trans Embed Comput Syst 15(4):70 Wu X, Chen L, Wei D et al (2016) Static analysis of runtime errors in interrupt-driven programs via sequentialization[J]. Acm Trans Embed Comput Syst 15(4):70
25.
go back to reference Bhatia V, Rani R (2018) Ap-FSM: a parallel algorithm for approximate frequent subgraph mining using pregel[J]. Expert Syst Appl 106:217–232CrossRef Bhatia V, Rani R (2018) Ap-FSM: a parallel algorithm for approximate frequent subgraph mining using pregel[J]. Expert Syst Appl 106:217–232CrossRef
26.
go back to reference Zaharia M, Xin RS, Wendell P et al (2016) Apache spark: a unified engine for big data processing[J]. Commun ACM 59(11):56–65CrossRef Zaharia M, Xin RS, Wendell P et al (2016) Apache spark: a unified engine for big data processing[J]. Commun ACM 59(11):56–65CrossRef
27.
go back to reference Zhou S, Qian S et al (1934) A novel bearing multi-fault diagnosis approach based on weighted Permutation entropy and an improved SVM ensemble classifier[J]. Sensors 2018:18 Zhou S, Qian S et al (1934) A novel bearing multi-fault diagnosis approach based on weighted Permutation entropy and an improved SVM ensemble classifier[J]. Sensors 2018:18
Metadata
Title
Research on SVM environment performance of parallel computing based on large data set of machine learning
Authors
Yunlu Gong
Lianguo Jia
Publication date
21-06-2019
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 9/2019
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-019-02894-7

Other articles of this Issue 9/2019

The Journal of Supercomputing 9/2019 Go to the issue

Premium Partner