Skip to main content
Erschienen in: Neural Computing and Applications 5/2019

04.07.2018 | S.I. : Emerging Intelligent Algorithms for Edge-of-Things Computing

Optimized feature selection algorithm based on fireflies with gravitational ant colony algorithm for big data predictive analytics

verfasst von: Osama AlFarraj, Ahmad AlZubi, Amr Tolba

Erschienen in: Neural Computing and Applications | Ausgabe 5/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Big data is an important and complex dataset consisting of a large volume of data that helps to collect, store, and analyze data, depending on its applications and predictive analytics. During the predictive process, the method examines different quantities of data, which are difficult to process because their high dimensionality leads to difficulties in examining the correlations among the data. This paper introduces a method of optimized feature selection and soft computing techniques for reducing the dimensionality of the dataset. Initially, the data were collected from various resources that contained some inconsistent data, reducing the system’s efficiency. Then, the inconsistent and noise data were removed by applying a normalized approach. Next, the optimized features were selected using the fireflies gravitational ant colony optimization (FGACO) approach. This optimized feature selection method successfully examines the characteristics and importance of the feature during the selection process. The selected feature consists of all details about particular predictive analytics. The system’s efficiency was then evaluated using different datasets. The experimental results show that FGACO performs better in terms of the sensitivity, specificity, accuracy, and the number of selected features based on time.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Wang Y, Kung L, Byrd TA (2018) Big data analytics: understanding its capabilities and potential benefits for healthcare organizations. Technol Forecast Soc Change 126:3–13CrossRef Wang Y, Kung L, Byrd TA (2018) Big data analytics: understanding its capabilities and potential benefits for healthcare organizations. Technol Forecast Soc Change 126:3–13CrossRef
3.
Zurück zum Zitat Dey N, Hassanien AE, Bhatt C, Ashour A, Satapathy SC (eds) (2018) Internet of things and big data analytics toward next-generation intelligence. Springer, Berlin Dey N, Hassanien AE, Bhatt C, Ashour A, Satapathy SC (eds) (2018) Internet of things and big data analytics toward next-generation intelligence. Springer, Berlin
4.
Zurück zum Zitat Yang C, Huang Q, Li Z, Liu K, Hu F (2017) Big data and cloud computing: innovation opportunities and challenges. Int J Digit Earth 10(1):13–53CrossRef Yang C, Huang Q, Li Z, Liu K, Hu F (2017) Big data and cloud computing: innovation opportunities and challenges. Int J Digit Earth 10(1):13–53CrossRef
9.
Zurück zum Zitat Gunasekaran A, Papadopoulos T, Dubey R, Wamba SF, Childe SJ, Hazen B, Akter S (2017) Big data and predictive analytics for supply chain and organizational performance. J Bus Res 70:308–317CrossRef Gunasekaran A, Papadopoulos T, Dubey R, Wamba SF, Childe SJ, Hazen B, Akter S (2017) Big data and predictive analytics for supply chain and organizational performance. J Bus Res 70:308–317CrossRef
11.
Zurück zum Zitat Ramírez-Gallego S, Krawczyk B, García S, Woźniak M, Herrera F (2017) A survey on data preprocessing for data stream mining: current status and future directions. Neurocomputing 239:39–57CrossRef Ramírez-Gallego S, Krawczyk B, García S, Woźniak M, Herrera F (2017) A survey on data preprocessing for data stream mining: current status and future directions. Neurocomputing 239:39–57CrossRef
12.
Zurück zum Zitat Ayhan S, Pesce J, Comitz P, Sweet D, Bliesner S, Gerberick G (2013) Predictive analytics with aviation big data. In: Integrated communications, navigation and surveillance conference (ICNS) Ayhan S, Pesce J, Comitz P, Sweet D, Bliesner S, Gerberick G (2013) Predictive analytics with aviation big data. In: Integrated communications, navigation and surveillance conference (ICNS)
13.
Zurück zum Zitat Saravanakumar NM, Eswaric T, Sampath P, Lavanya S (2015) Predictive methodology for diabetic data analysis in big data. Procedia Comput Sci 50:203–208CrossRef Saravanakumar NM, Eswaric T, Sampath P, Lavanya S (2015) Predictive methodology for diabetic data analysis in big data. Procedia Comput Sci 50:203–208CrossRef
16.
Zurück zum Zitat Gulati H (2015) Predictive analytics using data mining technique. J Comput Sustain Glob Dev (INDIACom) 713-716 Gulati H (2015) Predictive analytics using data mining technique. J Comput Sustain Glob Dev (INDIACom) 713-716
17.
Zurück zum Zitat Muthukrishnan R, Rohini R (2017) LASSO: a feature selection technique in predictive modeling for machine learning. IEEE International Conference on Advances in Computer Applications, pp 18–20 Muthukrishnan R, Rohini R (2017) LASSO: a feature selection technique in predictive modeling for machine learning. IEEE International Conference on Advances in Computer Applications, pp 18–20
18.
Zurück zum Zitat Weyland D (2015) A critical analysis of the harmony search algorithm—how not to solve sudoku. Oper Res Perspect 2:97–105MathSciNetCrossRef Weyland D (2015) A critical analysis of the harmony search algorithm—how not to solve sudoku. Oper Res Perspect 2:97–105MathSciNetCrossRef
20.
Zurück zum Zitat Nair PS, VariBench Vihinen M (2013) A benchmark database for variations. Hum Mutat 34(1):42–49CrossRef Nair PS, VariBench Vihinen M (2013) A benchmark database for variations. Hum Mutat 34(1):42–49CrossRef
21.
Zurück zum Zitat Smart OS, Horský V, Gore S, Svobodová Vařeková R, Bendová V, Kleywegt GJ, Velankar S (2018) Worldwide Protein Data Bank validation information: usage and trends. Acta Crystallogr Sect D 74(3):237–244CrossRef Smart OS, Horský V, Gore S, Svobodová Vařeková R, Bendová V, Kleywegt GJ, Velankar S (2018) Worldwide Protein Data Bank validation information: usage and trends. Acta Crystallogr Sect D 74(3):237–244CrossRef
22.
Zurück zum Zitat Moro S, Cortez P, Rita P (2014) A data-driven approach to predict the success of bank telemarketing. Decis Support Syst 62:22–31CrossRef Moro S, Cortez P, Rita P (2014) A data-driven approach to predict the success of bank telemarketing. Decis Support Syst 62:22–31CrossRef
23.
Zurück zum Zitat Hong ZQ, Yang JY (1991) Optimal discriminant plane for a small number of samples and design method of classifier on the plane. Pattern Recognit 24(4):317–324MathSciNetCrossRef Hong ZQ, Yang JY (1991) Optimal discriminant plane for a small number of samples and design method of classifier on the plane. Pattern Recognit 24(4):317–324MathSciNetCrossRef
24.
Zurück zum Zitat Zhang Z, Cheng Y, Liu NC (2014) Comparison of the effect of mean-based method and z-score for field normalization of citations at the level of Web of Science subject categories. Scientometrics 101(3):1679–1693CrossRef Zhang Z, Cheng Y, Liu NC (2014) Comparison of the effect of mean-based method and z-score for field normalization of citations at the level of Web of Science subject categories. Scientometrics 101(3):1679–1693CrossRef
26.
Zurück zum Zitat Zlochin M, Birattari M, Meuleau N, Dorigo M (2014) Model-based search for combinatorial optimization: a critical survey. Ann Oper Res 131(1–4):373–395MathSciNetMATH Zlochin M, Birattari M, Meuleau N, Dorigo M (2014) Model-based search for combinatorial optimization: a critical survey. Ann Oper Res 131(1–4):373–395MathSciNetMATH
27.
Zurück zum Zitat Zhang J, Chung H, Lo WL (2007) Clustering-based adaptive crossover and mutation probabilities for genetic algorithms. IEEE Trans Evolut Comput 11(3):326–335CrossRef Zhang J, Chung H, Lo WL (2007) Clustering-based adaptive crossover and mutation probabilities for genetic algorithms. IEEE Trans Evolut Comput 11(3):326–335CrossRef
28.
Zurück zum Zitat Rashedi E, Nezamabadi-Pour H, Saryazdi S (2009) GSA: a gravitational search algorithm. Inf Sci 179(13):2232–2248CrossRefMATH Rashedi E, Nezamabadi-Pour H, Saryazdi S (2009) GSA: a gravitational search algorithm. Inf Sci 179(13):2232–2248CrossRefMATH
29.
Zurück zum Zitat Majid M, Bishop JM (2013) Swarmic paintings and colour attention. In: Machado P, McDermott J, Carballal A (eds) EvoMUSART 2013, Swarmic Paintings and Colour Attention. vol 7834. LNCS, pp 97–108 Majid M, Bishop JM (2013) Swarmic paintings and colour attention. In: Machado P, McDermott J, Carballal A (eds) EvoMUSART 2013, Swarmic Paintings and Colour Attention. vol 7834. LNCS, pp 97–108
31.
Zurück zum Zitat Pang J, Gu Y, Xu J, Yu G (2018) Semi-supervised multi-graph classification using optimal feature selection and extreme learning machine. Neurocomputing 277:89–100CrossRef Pang J, Gu Y, Xu J, Yu G (2018) Semi-supervised multi-graph classification using optimal feature selection and extreme learning machine. Neurocomputing 277:89–100CrossRef
Metadaten
Titel
Optimized feature selection algorithm based on fireflies with gravitational ant colony algorithm for big data predictive analytics
verfasst von
Osama AlFarraj
Ahmad AlZubi
Amr Tolba
Publikationsdatum
04.07.2018
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 5/2019
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-018-3612-0

Weitere Artikel der Ausgabe 5/2019

Neural Computing and Applications 5/2019 Zur Ausgabe

S.I. : Emerging Intelligent Algorithms for Edge-of-Things Computing

Classifying streaming of Twitter data based on sentiment analysis using hybridization

S.I. : Emerging Intelligent Algorithms for Edge-of-Things Computing

Tuberculosis (TB) detection system using deep neural networks