Skip to main content

2017 | OriginalPaper | Buchkapitel

A Deep Learning-Cuckoo Search Method for Missing Data Estimation in High-Dimensional Datasets

verfasst von : Collins Leke, Alain Richard Ndjiongue, Bhekisipho Twala, Tshilidzi Marwala

Erschienen in: Advances in Swarm Intelligence

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This study brings together two related areas: deep learning and swarm intelligence for missing data estimation in high-dimensional datasets. The growing number of studies in the deep learning area warrants a closer look at its possible application in the aforementioned domain. Missing data being an unavoidable scenario in present day datasets results in different challenges which are nontrivial for existing techniques which constitute narrow artificial intelligence architectures and computational intelligence methods. This can be attributed to the large number of samples and high number of features. In this paper, we propose a new framework for the imputation procedure that uses a deep learning method with a swarm intelligence algorithm, called Deep Learning-Cuckoo Search (DL-CS). This technique is compared to similar approaches and other existing methods. The time required to obtain accurate estimates for the missing data entries surpasses that of existing methods, but this is considered a worthy bargain when the accuracy of the said estimates in a high dimensional setting are taken into consideration.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Abdella, M., Marwala, T.: The use of genetic algorithms and neural networks to approximate missing data in database. In: 3rd International Conference on Computational Cybernetics. (ICCC), pp. 207–212. IEEE (2005) Abdella, M., Marwala, T.: The use of genetic algorithms and neural networks to approximate missing data in database. In: 3rd International Conference on Computational Cybernetics. (ICCC), pp. 207–212. IEEE (2005)
2.
Zurück zum Zitat Leke, C., Twala, B., Marwala, T.: Modeling of missing data prediction: computational intelligence and optimization algorithms. In: International Conference on Systems, Man and Cybernetics (SMC), pp. 1400–1404. IEEE (2014) Leke, C., Twala, B., Marwala, T.: Modeling of missing data prediction: computational intelligence and optimization algorithms. In: International Conference on Systems, Man and Cybernetics (SMC), pp. 1400–1404. IEEE (2014)
3.
Zurück zum Zitat Vukosi, M.N., Nelwamondo, F.V., Marwala, T.: Autoencoder, principal component analysis and support vector regression for data imputation. arXiv preprint arXiv:0709.2506 (2007) Vukosi, M.N., Nelwamondo, F.V., Marwala, T.: Autoencoder, principal component analysis and support vector regression for data imputation. arXiv preprint arXiv:​0709.​2506 (2007)
4.
Zurück zum Zitat Jerez, J.M., Molina, I., García-Laencina, P.J., Alba, E., Ribelles, N., Martín, M., Franco, L.: Missing data imputation using statistical and machine learning methods in a real breast cancer problem. Artif. intell. Med. 50(2), 105–115 (2010). ElsevierCrossRef Jerez, J.M., Molina, I., García-Laencina, P.J., Alba, E., Ribelles, N., Martín, M., Franco, L.: Missing data imputation using statistical and machine learning methods in a real breast cancer problem. Artif. intell. Med. 50(2), 105–115 (2010). ElsevierCrossRef
5.
Zurück zum Zitat Liew, A.W.-C., Law, N.-F., Yan, H.: Missing value imputation for gene expression data: computational techniques to recover missing data from available information. Brief. Bioinform. 12(5), 498–513 (2011). Oxford University PressCrossRef Liew, A.W.-C., Law, N.-F., Yan, H.: Missing value imputation for gene expression data: computational techniques to recover missing data from available information. Brief. Bioinform. 12(5), 498–513 (2011). Oxford University PressCrossRef
6.
Zurück zum Zitat Myers, T.A.: Goodbye, listwise deletion: presenting hot deck imputation as an easy and effective tool for handling missing data. Commun. Methods Meas. 5(4), 297–310 (2011). Taylor & FrancisCrossRef Myers, T.A.: Goodbye, listwise deletion: presenting hot deck imputation as an easy and effective tool for handling missing data. Commun. Methods Meas. 5(4), 297–310 (2011). Taylor & FrancisCrossRef
7.
Zurück zum Zitat Schafer, J.L., Graham, J.W.: Missing data: our view of the state of the art. Psychol. Methods 7(2), 147 (2002). American Psychological AssociationCrossRef Schafer, J.L., Graham, J.W.: Missing data: our view of the state of the art. Psychol. Methods 7(2), 147 (2002). American Psychological AssociationCrossRef
8.
9.
Zurück zum Zitat Leke, C., Marwala, T.: Missing data estimation in high-dimensional datasets: a swarm intelligence-deep neural network approach. In: Tan, Y., Shi, Y., Niu, B. (eds.) ICSI 2016. LNCS, vol. 9712, pp. 259–270. Springer, Cham (2016). doi:10.1007/978-3-319-41000-5_26 Leke, C., Marwala, T.: Missing data estimation in high-dimensional datasets: a swarm intelligence-deep neural network approach. In: Tan, Y., Shi, Y., Niu, B. (eds.) ICSI 2016. LNCS, vol. 9712, pp. 259–270. Springer, Cham (2016). doi:10.​1007/​978-3-319-41000-5_​26
10.
Zurück zum Zitat Finn C., Tan, X., Duan, Y., Darrell, T., Levine, S., Abbeel, P.: Deep spatial autoencoders for visuomotor learning. In: International Conference on Robotics and Automation (ICRA), pp. 512–519 (2016) Finn C., Tan, X., Duan, Y., Darrell, T., Levine, S., Abbeel, P.: Deep spatial autoencoders for visuomotor learning. In: International Conference on Robotics and Automation (ICRA), pp. 512–519 (2016)
11.
Zurück zum Zitat Ju, Y., Guo, J., Liu, S.: A deep learning method combined sparse autoencoder with SVM. In: 2015 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), pp. 257–260, September 2015 Ju, Y., Guo, J., Liu, S.: A deep learning method combined sparse autoencoder with SVM. In: 2015 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), pp. 257–260, September 2015
12.
Zurück zum Zitat Brain, L.B., Marwala, T., Tettet, T.: Autoencoder networks for HIV classification. Curr. Sci. 91(11), 1467–1473 (2006) Brain, L.B., Marwala, T., Tettet, T.: Autoencoder networks for HIV classification. Curr. Sci. 91(11), 1467–1473 (2006)
13.
Zurück zum Zitat Krizhevsky, A., Hinton, G.E.: Using very deep autoencoders for content-based image retrieval. In: 19th European Symposium on Artificial Neural Networks (ESANN), Bruges, Belgium, 27–29 April 2011 Krizhevsky, A., Hinton, G.E.: Using very deep autoencoders for content-based image retrieval. In: 19th European Symposium on Artificial Neural Networks (ESANN), Bruges, Belgium, 27–29 April 2011
14.
Zurück zum Zitat Yang, X.S., Debb, S.: Cuckoo search: recent advances and applications. Neural Comput. Appl. 24(1), 169–174 (2014)CrossRef Yang, X.S., Debb, S.: Cuckoo search: recent advances and applications. Neural Comput. Appl. 24(1), 169–174 (2014)CrossRef
15.
Zurück zum Zitat Vasanthakumar, S., Kumarappan, N., Arulraj, R., Vigneysh, T.: Cuckoo search algorithm based environmental economic dispatch of microgrid system with distributed generation. In: International Conference on Smart Technologies and Management for Computing, Communication, Controls, Energy and Materials (ICSTM), pp. 575–580. IEEE (2015) Vasanthakumar, S., Kumarappan, N., Arulraj, R., Vigneysh, T.: Cuckoo search algorithm based environmental economic dispatch of microgrid system with distributed generation. In: International Conference on Smart Technologies and Management for Computing, Communication, Controls, Energy and Materials (ICSTM), pp. 575–580. IEEE (2015)
16.
Zurück zum Zitat Wang, J., Zhou, B., Zhou, S.: An improved cuckoo search optimization algorithm for the problem of chaotic systems parameter estimation. Comput. Intell. Neurosci. 2016, 8 (2016) Wang, J., Zhou, B., Zhou, S.: An improved cuckoo search optimization algorithm for the problem of chaotic systems parameter estimation. Comput. Intell. Neurosci. 2016, 8 (2016)
17.
Zurück zum Zitat Ali, F.A., Mohamed, A.T.: A hybrid cuckoo search algorithm with Nelder Mead method for solving global optimization problems. SpringerPlus 5(1), 473 (2016). Springer International PublishingCrossRef Ali, F.A., Mohamed, A.T.: A hybrid cuckoo search algorithm with Nelder Mead method for solving global optimization problems. SpringerPlus 5(1), 473 (2016). Springer International PublishingCrossRef
18.
Zurück zum Zitat Hinton, G.E., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)MathSciNetCrossRefMATH Hinton, G.E., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)MathSciNetCrossRefMATH
Metadaten
Titel
A Deep Learning-Cuckoo Search Method for Missing Data Estimation in High-Dimensional Datasets
verfasst von
Collins Leke
Alain Richard Ndjiongue
Bhekisipho Twala
Tshilidzi Marwala
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-61824-1_61

Premium Partner