nach oben

Neural Processing Letters

Erschienen in:

01.06.2022

Knowledge Reverse Distillation Based Confidence Calibration for Deep Neural Networks

verfasst von: Xianhui Jiang, Xiaogang Deng

Erschienen in: Neural Processing Letters | Ausgabe 1/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Deep neural networks, as a key technical breakthrough in machine learning field, have been widely used in various practical scenarios. However, the existing deep neural networks often generate the predictions with high confidence risks, which are prone to mislead practitioners and limit the deploying of deep neural networks in high-risk decision-making fields. In order to solve this issue, this paper proposes a confidence calibration method for deep neural networks by designing one novel knowledge reverse distillation strategy. Traditional knowledge distillation strategy takes the accuracy as the knowledge, and transfers it from the teacher network (usually one complex deep network) to the student network (usually one simple network). Different from this traditional distillation strategy, the proposed knowledge reverse distillation strategy regards the confidence as the knowledge, and constructs one reverse knowledge transfer pathway by applying the confidence knowledge in the simple network to calibrate the complex deep network. Experimental results on three benchmark image datasets show that the knowledge reverse distillation strategy can effectively improve the calibration capability of complex networks so that the complex deep neural network captures the high confidence along with the high prediction accuracy.

Vorheriger Artikel First-order Layer in Artificial Pain Pathway

Nächster Artikel Depth Enhanced Cross-Modal Cascaded Network for RGB-D Salient Object Detection

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. Int Conf Mach Learn PMLR 448–456

Srivastava N, Hinton G, Krizhevsky A et al (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958MathSciNetMATH

Szegedy C, Liu W, Jia Y et al (2015) Going deeper with convolutions. Proc IEEE Conf Comput Vis Pattern Recognit. https://doi.org/10.1109/CVPR.2015.7298594CrossRef

Gui J, Sun Z, Wen Y et al (2021) A review on generative adversarial networks: Algorithms, theory, and applications. IEEE Trans Knowl Data Eng in press. https://doi.org/10.1109/TKDE.2021.3130191CrossRef

Huang Q, Zhang H, Song J, et al (2021) A survey of deep learning for low-shot object detection. arXiv:2112.02814

Singh S, Mahmood A (2021) The NLP cookbook: modern recipes for transformer based deep learning architectures. IEEE Access 9:68675–68702CrossRef

Jozefowicz R, Vinyals O, Schuster M, et al (2016) Exploring the limits of language modeling. arXiv:1602.02410

Li J (2021) Recent advances in end-to-end automatic speech recognition. arXiv:2111.01690

Simonyan K, Zisserman A (2014) Two-stream convolutional networks for action recognition in videos. arXiv:1406.2199

10.

Pang G, Shen C, Cao L et al (2021) Deep learning for anomaly detection: a review. ACM Comput Surv CSUR 54(2):1–38

11.

Amodei D, Olah C, Steinhardt J, et al (2016) Concrete problems in AI safety. arXiv:1606.06565

12.

Brundage M, Avin S, Clark J. et al (2018). The malicious use of artificial intelligence: Forecasting, prevention, and mitigation. arXiv:1802.07228

13.

Michelmore R, Kwiatkowska M, Gal Y (2018) Evaluating uncertainty quantification in end-to-end autonomous driving control. arXiv:1811.06817

14.

Bojarski M, Del Testa D, Dworakowski D, et al (2016) End to end learning for self-driving cars. arXiv:1604.07316

15.

Jiang X, Osl M, Kim J et al (2012) Calibrating predictive model estimates to support personalized medicine. J Am Med Inf Assoc 19(2):263–274. https://doi.org/10.1136/amiajnl-2011-000291CrossRef

16.

Pleiss G, Raghavan M, Wu F, et al (2017) On fairness and calibration. arXiv:1709.02012

17.

Zadrozny B, Elkan C (2001) Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers. Inte Conf Mach Learn PMLR 1:609–616

18.

Zadrozny B, Elkan C (2002) Transforming classifier scores into accurate multiclass probability estimates. In: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pp 694–699. https://doi.org/10.1145/775107.775151

19.

Platt J (1999) Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Advances in large margin classifiers 10(3):61–74

20.

Guo C, Pleiss G, Sun Y, et al (2017) On calibration of modern neural networks. Int Conf Mach Learn PMLR 1321–1330

21.

Fernando K, Ruwani M, Tsokos Chris P (2021) Dynamically weighted balanced loss: class imbalanced learning and confidence calibration of deep neural networks. IEEE Trans Neural Netw Learn Syst 99:1–12

22.

Pereyra G, Tucker G, Chorowski J, et al (2017) Regularizing neural networks by penalizing confident output distributions. arXiv:1701.06548

23.

Kumar A, Sarawagi S, Jain U (2018) Trainable calibration measures for neural networks from kernel mean embeddings. Int Conf Mach Learn PMLR 2805–2814

24.

DeVries T, Taylor GW (2018) Learning confidence for out-of-distribution detection in neural networks. arXiv:1802.04865

25.

Ji B, Jung H, Yoon J, et al (2019) Bin-wise temperature scaling: improvement in confidence calibration performance through simple scaling techniques. In: 2019 IEEE/CVF international conference on computer vision workshop (ICCVW). IEEE, pp 4190–4196

26.

Seo S, Seo PH, Han B (2019) Learning for single-shot confidence calibration in deep neural networks through stochastic inferences. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 9030–9038

27.

Zhang Z, Dalca AV, Sabuncu MR (2019) Confidence calibration for convolutional neural networks using structured dropout. arXiv:1906.09551

28.

Huang G, Sun Y, Liu Z, et al (2016) Deep networks with stochastic depth. In: European conference on computer vision. Springer, Cham, pp 646–661. https://doi.org/10.1007/978-3-319-46493-0_39

29.

DeGroot MH, Fienberg SE (1983) The comparison and evaluation of forecasters. J Roy Stat Soc Ser D (The Statistician) 32(1–2):12–22

30.

Bucila C, Caruana R, Niculescu-Mizil A (2006) Model compression. In: ACM SIGKDD international conference on knowledge discovery and data mining. Computer Science Cornell University. https://doi.org/10.1145/1150402.1150464

31.

Wang L, Yoon KJ (2021) Knowledge distillation and student-teacher learning for visual intelligence: a review and new outlooks. IEEE Trans Pattern Anal Mach Intell 99:1

32.

Krizhevsky A (2009) Learning multiple layers of features from tiny images. Master’s thesis, Department of Computer Science, University of Toronto

33.

Netzer Y, Wang T, Coates A, Bissacco A, Ng AY (2011) Reading digits in natural images with unsupervised feature learning. NIPS workshop on deep learning and unsupervised feature learning

34.

LeCun Y, Bottou L, Bengio Y et al (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324. https://doi.org/10.1109/5.726791CrossRef

35.

Zhang X, Zhou X, Lin M, et al (2018) ShuffleNet: an extremely efficient convolutional neural network for mobile devices. In: 2018 IEEE/CVF conference on computer vision and pattern recognition. IEEE, pp 6848–6856

36.

Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556

37.

He K, Zhang X, Ren S, et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778. https://doi.org/10.1109/CVPR.2016.90

38.

Huang G, Liu Z, Van Der Maaten L, et al. (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708

39.

Zhang J, Kailkhura B, Han TYJ (2020) Mix-n-match: ensemble and compositional methods for uncertainty calibration in deep learning. In: International conference on machine learning, pp 11117–11128

40.

Bohdal O, Yang Y, Hospedales T (2021) Meta-calibration: meta-learning of model calibration using differentiable expected calibration error. arXiv:2106.09613

Titel: Knowledge Reverse Distillation Based Confidence Calibration for Deep Neural Networks
verfasst von: Xianhui Jiang
Xiaogang Deng
Publikationsdatum: 01.06.2022
Verlag: Springer US
Erschienen in: Neural Processing Letters / Ausgabe 1/2023
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI: https://doi.org/10.1007/s11063-022-10885-8

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence_ieS/© Springer Fachmedien Wiesbaden GmbH, Search Icon, Banner Hanser, Strompreise/© vejaa / stock.adobe.com, Bunte Männchen, die Kunden darstelle, werden von einem riesigen Magneten angezogen. /© Oleksiy Mark, Dr. Daniel Schneider/© Fraunhofer IESE, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 1/2023

Variational Diversity Maximization for Hierarchical Skill Discovery

Global Context-Aware Feature Extraction and Visible Feature Enhancement for Occlusion-Invariant Pedestrian Detection in Crowded Scenes

An Adaboost Support Vector Machine Based Harris Hawks Optimization Algorithm for Intelligent Quotient Estimation from MRI Images

A Non-invasive Approach to Identify Insulin Resistance with Triglycerides and HDL-c Ratio Using Machine learning

PCA Based Dimensional Data Reduction and Segmentation for DICOM Images

First-order Layer in Artificial Pain Pathway

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.