nach oben

Erschienen in:

2021 | OriginalPaper | Buchkapitel

Optimization Under Uncertainty Explains Empirical Success of Deep Learning Heuristics

verfasst von : Vladik Kreinovich, Olga Kosheleva

Erschienen in: Black Box Optimization, Machine Learning, and No-Free Lunch Theorems

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

One of the main objectives of science and engineering is to predict the future state of the world and come up with devices and strategies that would make this future state better. In some practical situations, we know how the state changes with time—e.g., in meteorology, we know the partial differential equations that describe the atmospheric processes. In such situations, prediction becomes a purely computational problem. In many other situations, however, we do not know the equation describing the system’s dynamics. In such situations, we need to learn this dynamics from data. At present, the most efficient way of such learning is to use deep learning—training a neural network with a large number of layers. To make this idea truly efficient, several trial-and-error-based heuristics were discovered, such as the use of rectified linear neurons, softmax, etc. In this chapter, we show that the empirical success of many of these heuristics can be explained by optimization-under-uncertainty techniques.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Mathematically Rigorous Global Optimization and Fuzzy Optimization

Nächstes Kapitel Variable Neighborhood Programming as a Tool of Machine Learning

Aczel, J., Dhombres, J.: Functional Equations in Several Variables. Cambridge University Press, New York (2008)MATH

Autchariyapanitkul, K., Kosheleva, O., Kreinovich, V., Sriboonchitta, S.: Quantum econometrics: how to explain its quantitative successes and how the resulting formulas are related to scale invariance, entropy, and fuzziness. In: Huynh, V.-N., Inuiguchi, M., Tran, D.-H., Denoeux, Th. (eds.) Proceedings of the International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making IUKM’2018, Hanoi, Vietnam, March 13–15, 2018

Baral, C., Fuentes, O., Kreinovich, V.: Why deep neural networks: a possible theoretical explanation. In: Ceberio, M., Kreinovich, V. (eds.) Constraint Programming and Decision Making: Theory and Applications, pp. 1–6. Springer Verlag, Berlin (2018)

Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)MATH

Farhan, A., Kosheleva, O., Kreinovich, V.: Why max and average poolings are optimal in convolutional neural networks. In: Proceedings of the Seventh International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making IUKM’2019, Nara, Japan, March 27–29 (2019)

Fuentes, O., Parra, J., Anthony, E., Kreinovich, V.: Why rectified linear neurons are efficient: a possible theoretical explanations. In: Kosheleva, O., Shary, S., Xiang, G., Zapatrin, R. (eds.) Beyond Traditional Probabilistic Data Processing Techniques: Interval, Fuzzy, etc. Methods and Their Applications, Springer, Cham (2019)

Gholamy, A., Parra, J., Kreinovich, V., Fuentes, O., Anthony, E.: How to best apply deep neural networks in geosciences: towards optimal ‘averaging’ in dropout training. In: Watada, J., Tan, S.C., Vasant, P., Padmanabhan, E., Jain, L.C. (eds.). Smart Unconventional Modelling, Simulation and Optimization for Geosciences and Petroleum Engineering, pp. 15–26. Springer, Berlin (2019)

Goodfellow, I., Bengio, Y., Courville, A.: Deep Leaning. MIT Press, Cambridge (2016)MATH

Kainen, P.C., Kurkova, V., Kreinovich, V., Sirisaengtaksin, O.: Uniqueness of network parameterization and faster learning. Neural Parallel Sci. Comput. 2, 459–466 (1994)MathSciNetMATH

10.

Kosheleva, O., Kreinovich, V.: Why deep learning methods use KL divergence instead of least squares: a possible pedagogical explanation. Math. Struct. Model. 46, 102–106 (2018)

11.

Kreinovich, V.: Group-theoretic approach to intractable problems. Lecture Notes in Computer Science. Springer, Berlin, vol. 417, pp. 112–121 (1990)

12.

Kreinovich, V.: From traditional neural networks to deep learning: towards mathematical foundations of empirical successes. In: Shahbazova, S.N., Kacprzyk, J., Balas, V.E., Kreinovich, V. (eds.) Proceedings of the World Conference on Soft Computing, Baku, Azerbaijan, May 29–31 (2018)

13.

Kreinovich, V., Quintana, C.: Neural networks: what non-linearity to choose? In: Proceedings of the Fourth University of New Brunswick Artificial Intelligence Workshop, pp. 627–637. Fredericton, New Brunswick (1991)

14.

Muela, G., Servin, C., Kreinovich, V.: How to make machine learning robust against adversarial inputs. Math. Struct. Model. 42, 127–130 (2017)

15.

Nguyen, H.T., Kreinovich, V.: Applications of Continuous Mathematics to Computer Science. Kluwer, Dordrecht (1997)CrossRef

16.

Parra, J., Fuentes, O., Anthony, E., Kreinovich, V.: Prediction of volcanic eruptions: case study of rare events in chaotic systems with delay. In: Proceedings of the IEEE Conference on Systems, Man, and Cybernetics SMC’2017, Banff, Canada, October 5–8, pp. 351–356 (2017)

17.

Parra, J., Fuentes, O., Anthony, E., Kreinovich, V.: Use of machine learning to analyze and—hopefully—predict volcano activity. Acta Politech. Hung. 14(3), 209–221 (2017)

18.

Sirisaengtaksin, O., Kreinovich, V., Nguyen, H.T.: Sigmoid neurons are the safest against additive errors. In: Proceedings of the First International Conference on Neural, Parallel, and Scientific Computations, Atlanta, GA, May 28–31, vol. 1, pp. 419–423 (1995)

19.

Wiener, N.: Cybernetics: Or Control and Communication in the Animal and the Machine. MIT Press, Cambridge (1948)

Titel: Optimization Under Uncertainty Explains Empirical Success of Deep Learning Heuristics
verfasst von: Vladik Kreinovich
Olga Kosheleva
Verlag: Springer International Publishing
Buch: Black Box Optimization, Machine Learning, and No-Free Lunch Theorems
Print ISBN: 978-3-030-66514-2

Electronic ISBN: 978-3-030-66515-9

Copyright-Jahr: 2021
DOI: https://doi.org/10.1007/978-3-030-66515-9_8

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner