Skip to main content

2021 | OriginalPaper | Buchkapitel

Optimization Under Uncertainty Explains Empirical Success of Deep Learning Heuristics

verfasst von : Vladik Kreinovich, Olga Kosheleva

Erschienen in: Black Box Optimization, Machine Learning, and No-Free Lunch Theorems

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

One of the main objectives of science and engineering is to predict the future state of the world and come up with devices and strategies that would make this future state better. In some practical situations, we know how the state changes with time—e.g., in meteorology, we know the partial differential equations that describe the atmospheric processes. In such situations, prediction becomes a purely computational problem. In many other situations, however, we do not know the equation describing the system’s dynamics. In such situations, we need to learn this dynamics from data. At present, the most efficient way of such learning is to use deep learning—training a neural network with a large number of layers. To make this idea truly efficient, several trial-and-error-based heuristics were discovered, such as the use of rectified linear neurons, softmax, etc. In this chapter, we show that the empirical success of many of these heuristics can be explained by optimization-under-uncertainty techniques.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Aczel, J., Dhombres, J.: Functional Equations in Several Variables. Cambridge University Press, New York (2008)MATH Aczel, J., Dhombres, J.: Functional Equations in Several Variables. Cambridge University Press, New York (2008)MATH
2.
Zurück zum Zitat Autchariyapanitkul, K., Kosheleva, O., Kreinovich, V., Sriboonchitta, S.: Quantum econometrics: how to explain its quantitative successes and how the resulting formulas are related to scale invariance, entropy, and fuzziness. In: Huynh, V.-N., Inuiguchi, M., Tran, D.-H., Denoeux, Th. (eds.) Proceedings of the International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making IUKM’2018, Hanoi, Vietnam, March 13–15, 2018 Autchariyapanitkul, K., Kosheleva, O., Kreinovich, V., Sriboonchitta, S.: Quantum econometrics: how to explain its quantitative successes and how the resulting formulas are related to scale invariance, entropy, and fuzziness. In: Huynh, V.-N., Inuiguchi, M., Tran, D.-H., Denoeux, Th. (eds.) Proceedings of the International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making IUKM’2018, Hanoi, Vietnam, March 13–15, 2018
3.
Zurück zum Zitat Baral, C., Fuentes, O., Kreinovich, V.: Why deep neural networks: a possible theoretical explanation. In: Ceberio, M., Kreinovich, V. (eds.) Constraint Programming and Decision Making: Theory and Applications, pp. 1–6. Springer Verlag, Berlin (2018) Baral, C., Fuentes, O., Kreinovich, V.: Why deep neural networks: a possible theoretical explanation. In: Ceberio, M., Kreinovich, V. (eds.) Constraint Programming and Decision Making: Theory and Applications, pp. 1–6. Springer Verlag, Berlin (2018)
4.
Zurück zum Zitat Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)MATH Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)MATH
5.
Zurück zum Zitat Farhan, A., Kosheleva, O., Kreinovich, V.: Why max and average poolings are optimal in convolutional neural networks. In: Proceedings of the Seventh International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making IUKM’2019, Nara, Japan, March 27–29 (2019) Farhan, A., Kosheleva, O., Kreinovich, V.: Why max and average poolings are optimal in convolutional neural networks. In: Proceedings of the Seventh International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making IUKM’2019, Nara, Japan, March 27–29 (2019)
6.
Zurück zum Zitat Fuentes, O., Parra, J., Anthony, E., Kreinovich, V.: Why rectified linear neurons are efficient: a possible theoretical explanations. In: Kosheleva, O., Shary, S., Xiang, G., Zapatrin, R. (eds.) Beyond Traditional Probabilistic Data Processing Techniques: Interval, Fuzzy, etc. Methods and Their Applications, Springer, Cham (2019) Fuentes, O., Parra, J., Anthony, E., Kreinovich, V.: Why rectified linear neurons are efficient: a possible theoretical explanations. In: Kosheleva, O., Shary, S., Xiang, G., Zapatrin, R. (eds.) Beyond Traditional Probabilistic Data Processing Techniques: Interval, Fuzzy, etc. Methods and Their Applications, Springer, Cham (2019)
7.
Zurück zum Zitat Gholamy, A., Parra, J., Kreinovich, V., Fuentes, O., Anthony, E.: How to best apply deep neural networks in geosciences: towards optimal ‘averaging’ in dropout training. In: Watada, J., Tan, S.C., Vasant, P., Padmanabhan, E., Jain, L.C. (eds.). Smart Unconventional Modelling, Simulation and Optimization for Geosciences and Petroleum Engineering, pp. 15–26. Springer, Berlin (2019) Gholamy, A., Parra, J., Kreinovich, V., Fuentes, O., Anthony, E.: How to best apply deep neural networks in geosciences: towards optimal ‘averaging’ in dropout training. In: Watada, J., Tan, S.C., Vasant, P., Padmanabhan, E., Jain, L.C. (eds.). Smart Unconventional Modelling, Simulation and Optimization for Geosciences and Petroleum Engineering, pp. 15–26. Springer, Berlin (2019)
8.
Zurück zum Zitat Goodfellow, I., Bengio, Y., Courville, A.: Deep Leaning. MIT Press, Cambridge (2016)MATH Goodfellow, I., Bengio, Y., Courville, A.: Deep Leaning. MIT Press, Cambridge (2016)MATH
9.
Zurück zum Zitat Kainen, P.C., Kurkova, V., Kreinovich, V., Sirisaengtaksin, O.: Uniqueness of network parameterization and faster learning. Neural Parallel Sci. Comput. 2, 459–466 (1994)MathSciNetMATH Kainen, P.C., Kurkova, V., Kreinovich, V., Sirisaengtaksin, O.: Uniqueness of network parameterization and faster learning. Neural Parallel Sci. Comput. 2, 459–466 (1994)MathSciNetMATH
10.
Zurück zum Zitat Kosheleva, O., Kreinovich, V.: Why deep learning methods use KL divergence instead of least squares: a possible pedagogical explanation. Math. Struct. Model. 46, 102–106 (2018) Kosheleva, O., Kreinovich, V.: Why deep learning methods use KL divergence instead of least squares: a possible pedagogical explanation. Math. Struct. Model. 46, 102–106 (2018)
11.
Zurück zum Zitat Kreinovich, V.: Group-theoretic approach to intractable problems. Lecture Notes in Computer Science. Springer, Berlin, vol. 417, pp. 112–121 (1990) Kreinovich, V.: Group-theoretic approach to intractable problems. Lecture Notes in Computer Science. Springer, Berlin, vol. 417, pp. 112–121 (1990)
12.
Zurück zum Zitat Kreinovich, V.: From traditional neural networks to deep learning: towards mathematical foundations of empirical successes. In: Shahbazova, S.N., Kacprzyk, J., Balas, V.E., Kreinovich, V. (eds.) Proceedings of the World Conference on Soft Computing, Baku, Azerbaijan, May 29–31 (2018) Kreinovich, V.: From traditional neural networks to deep learning: towards mathematical foundations of empirical successes. In: Shahbazova, S.N., Kacprzyk, J., Balas, V.E., Kreinovich, V. (eds.) Proceedings of the World Conference on Soft Computing, Baku, Azerbaijan, May 29–31 (2018)
13.
Zurück zum Zitat Kreinovich, V., Quintana, C.: Neural networks: what non-linearity to choose? In: Proceedings of the Fourth University of New Brunswick Artificial Intelligence Workshop, pp. 627–637. Fredericton, New Brunswick (1991) Kreinovich, V., Quintana, C.: Neural networks: what non-linearity to choose? In: Proceedings of the Fourth University of New Brunswick Artificial Intelligence Workshop, pp. 627–637. Fredericton, New Brunswick (1991)
14.
Zurück zum Zitat Muela, G., Servin, C., Kreinovich, V.: How to make machine learning robust against adversarial inputs. Math. Struct. Model. 42, 127–130 (2017) Muela, G., Servin, C., Kreinovich, V.: How to make machine learning robust against adversarial inputs. Math. Struct. Model. 42, 127–130 (2017)
15.
Zurück zum Zitat Nguyen, H.T., Kreinovich, V.: Applications of Continuous Mathematics to Computer Science. Kluwer, Dordrecht (1997)CrossRef Nguyen, H.T., Kreinovich, V.: Applications of Continuous Mathematics to Computer Science. Kluwer, Dordrecht (1997)CrossRef
16.
Zurück zum Zitat Parra, J., Fuentes, O., Anthony, E., Kreinovich, V.: Prediction of volcanic eruptions: case study of rare events in chaotic systems with delay. In: Proceedings of the IEEE Conference on Systems, Man, and Cybernetics SMC’2017, Banff, Canada, October 5–8, pp. 351–356 (2017) Parra, J., Fuentes, O., Anthony, E., Kreinovich, V.: Prediction of volcanic eruptions: case study of rare events in chaotic systems with delay. In: Proceedings of the IEEE Conference on Systems, Man, and Cybernetics SMC’2017, Banff, Canada, October 5–8, pp. 351–356 (2017)
17.
Zurück zum Zitat Parra, J., Fuentes, O., Anthony, E., Kreinovich, V.: Use of machine learning to analyze and—hopefully—predict volcano activity. Acta Politech. Hung. 14(3), 209–221 (2017) Parra, J., Fuentes, O., Anthony, E., Kreinovich, V.: Use of machine learning to analyze and—hopefully—predict volcano activity. Acta Politech. Hung. 14(3), 209–221 (2017)
18.
Zurück zum Zitat Sirisaengtaksin, O., Kreinovich, V., Nguyen, H.T.: Sigmoid neurons are the safest against additive errors. In: Proceedings of the First International Conference on Neural, Parallel, and Scientific Computations, Atlanta, GA, May 28–31, vol. 1, pp. 419–423 (1995) Sirisaengtaksin, O., Kreinovich, V., Nguyen, H.T.: Sigmoid neurons are the safest against additive errors. In: Proceedings of the First International Conference on Neural, Parallel, and Scientific Computations, Atlanta, GA, May 28–31, vol. 1, pp. 419–423 (1995)
19.
Zurück zum Zitat Wiener, N.: Cybernetics: Or Control and Communication in the Animal and the Machine. MIT Press, Cambridge (1948) Wiener, N.: Cybernetics: Or Control and Communication in the Animal and the Machine. MIT Press, Cambridge (1948)
Metadaten
Titel
Optimization Under Uncertainty Explains Empirical Success of Deep Learning Heuristics
verfasst von
Vladik Kreinovich
Olga Kosheleva
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-66515-9_8

Premium Partner