Skip to main content
Erschienen in: Neural Computing and Applications 7/2020

19.11.2018 | Original Article

Robust spike-and-slab deep Boltzmann machines for face denoising

verfasst von: Nan Zhang, Shifei Ding, Jian Zhang, Xingyu Zhao

Erschienen in: Neural Computing and Applications | Ausgabe 7/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The robust Gaussian restricted Boltzmann machine can effectively learn the structure of noise to achieve better results in the face denoising task. The robust Gaussian restricted Boltzmann machine model contains two types of the restricted Boltzmann machine (RBM) model, where a general RBM is used to model the structure of the noise and a Gaussian RBM is used to model the clean data. The spike-and-slab RBM shows better learning abilities than the Gaussian RBM in real images modeling. In addition, the deep Boltzmann machine (DBM) shows powerful image reconstruction ability. To model the real images better, we first stack the spike-and-slab RBM and the RBM to create the spike-and-slab DBM. And then, we utilize the spike-and-slab DBM instead of the Gaussian RBM to model the density of the clean data in the Robust Gaussian RBM, and the proposed method is named as the robust spike-and-slab DBM which can obtain clearer denoising images. Finally, in order to obtain better denoising results, we make use of the learned spike-and-slab DBM model and the mean field method to multi-inference the denoising data learned from the robust spike-and-slab DBM. Experimental results show that the robust spike-and-slab DBM is an effective neural network denoising method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zhang K, Zuo W, Chen Y, Meng D, Zhang L (2017) Beyond a gaussian denoiser: residual learning of deep cnn for image denoising. IEEE Trans Image Process 26(7):3142–3155MathSciNetCrossRef Zhang K, Zuo W, Chen Y, Meng D, Zhang L (2017) Beyond a gaussian denoiser: residual learning of deep cnn for image denoising. IEEE Trans Image Process 26(7):3142–3155MathSciNetCrossRef
2.
Zurück zum Zitat Chen LC, Papandreou G, Kokkinos I, Murphy K, Yuille AL (2018) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell 40(4):834–848CrossRef Chen LC, Papandreou G, Kokkinos I, Murphy K, Yuille AL (2018) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell 40(4):834–848CrossRef
3.
Zurück zum Zitat Li Z, Tang J, He X (2018) Robust structured nonnegative matrix factorization for image representation. IEEE Trans Neural Netw Learn Syst 29(5):1947–1960MathSciNetCrossRef Li Z, Tang J, He X (2018) Robust structured nonnegative matrix factorization for image representation. IEEE Trans Neural Netw Learn Syst 29(5):1947–1960MathSciNetCrossRef
4.
Zurück zum Zitat Yang J, Luo L, Qian J, Tai Y, Zhang F, Xu Y (2017) Nuclear norm based matrix regression with applications to face recognition with occlusion and illumination changes. IEEE Trans Pattern Anal Mach Intell 39(1):156–171CrossRef Yang J, Luo L, Qian J, Tai Y, Zhang F, Xu Y (2017) Nuclear norm based matrix regression with applications to face recognition with occlusion and illumination changes. IEEE Trans Pattern Anal Mach Intell 39(1):156–171CrossRef
5.
Zurück zum Zitat Ouyang W, Zeng X, Wang X (2016) Partial occlusion handling in pedestrian detection with a deep model. IEEE Trans Circuits Syst Video Technol 26(11):2123–2137CrossRef Ouyang W, Zeng X, Wang X (2016) Partial occlusion handling in pedestrian detection with a deep model. IEEE Trans Circuits Syst Video Technol 26(11):2123–2137CrossRef
6.
Zurück zum Zitat Hinton G, Salakhutdinov R, Tang Y (2012) Robust Boltzmann machines for recognition and denoising. In: IEEE conference on computer vision and pattern recognition, pp 2264–2271 Hinton G, Salakhutdinov R, Tang Y (2012) Robust Boltzmann machines for recognition and denoising. In: IEEE conference on computer vision and pattern recognition, pp 2264–2271
7.
Zurück zum Zitat Shaham U, Cheng X, Dror O, Jaffe A, Nadler B, Chang J, Kluger Y (2016) A deep learning approach to unsupervised ensemble learning. In: International conference on machine learning Shaham U, Cheng X, Dror O, Jaffe A, Nadler B, Chang J, Kluger Y (2016) A deep learning approach to unsupervised ensemble learning. In: International conference on machine learning
8.
Zurück zum Zitat Zhang N, Ding S, Zhang J, Xue Y (2018) An overview on restricted Boltzmann machines. Neurocomputing 275:1186–1199CrossRef Zhang N, Ding S, Zhang J, Xue Y (2018) An overview on restricted Boltzmann machines. Neurocomputing 275:1186–1199CrossRef
9.
Zurück zum Zitat Zhang J, Ding S, Zhang N (2018) An overview on probability undirected graphs and their applications in image processing. Neurocomputing 321:156–168CrossRef Zhang J, Ding S, Zhang N (2018) An overview on probability undirected graphs and their applications in image processing. Neurocomputing 321:156–168CrossRef
10.
Zurück zum Zitat Cho KH, Ilin A, Raiko T (2011) Improved learning of Gaussian–Bernoulli restricted Boltzmann machines. In: International conference on artificial neural networks. Springer, Berlin, pp 10–17 Cho KH, Ilin A, Raiko T (2011) Improved learning of Gaussian–Bernoulli restricted Boltzmann machines. In: International conference on artificial neural networks. Springer, Berlin, pp 10–17
11.
Zurück zum Zitat Ranzato M, Hinton GE (2010) Modeling pixel means and covariances using factorized third-order Boltzmann machines. In: IEEE computer society conference on computer vision and pattern recognition, pp 2551–2558 Ranzato M, Hinton GE (2010) Modeling pixel means and covariances using factorized third-order Boltzmann machines. In: IEEE computer society conference on computer vision and pattern recognition, pp 2551–2558
12.
Zurück zum Zitat Courville A, Bergstra J, Bengio Y (2011) A spike and slab restricted Boltzmann machine. In: Proceedings of the fourteenth international conference on artificial intelligence and statistics, pp 233–241 Courville A, Bergstra J, Bengio Y (2011) A spike and slab restricted Boltzmann machine. In: Proceedings of the fourteenth international conference on artificial intelligence and statistics, pp 233–241
13.
Zurück zum Zitat Courville A, Desjardins G, Bergstra J, Bengio Y (2014) The Spike-and-Slab RBM and extensions to discrete and sparse data distributions. IEEE Trans Pattern Anal Mach Intell 36(9):1874–1887CrossRef Courville A, Desjardins G, Bergstra J, Bengio Y (2014) The Spike-and-Slab RBM and extensions to discrete and sparse data distributions. IEEE Trans Pattern Anal Mach Intell 36(9):1874–1887CrossRef
14.
Zurück zum Zitat Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507MathSciNetCrossRef Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507MathSciNetCrossRef
15.
Zurück zum Zitat Salakhutdinov RR, Hinton GE (2012) An efficient learning procedure for deep Boltzmann machines. Neural Comput 24(8):1967–2006MathSciNetCrossRef Salakhutdinov RR, Hinton GE (2012) An efficient learning procedure for deep Boltzmann machines. Neural Comput 24(8):1967–2006MathSciNetCrossRef
16.
Zurück zum Zitat Zhang N, Ding SF, Zhang J, Xue Y (2017) Research on point-wise gated deep networks. Appl Soft Comput 52:1210–1221CrossRef Zhang N, Ding SF, Zhang J, Xue Y (2017) Research on point-wise gated deep networks. Appl Soft Comput 52:1210–1221CrossRef
17.
Zurück zum Zitat Hinton GE (2002) Training products of experts by minimizing contrastive divergence. Neural Comput 14(8):1711–1800CrossRef Hinton GE (2002) Training products of experts by minimizing contrastive divergence. Neural Comput 14(8):1711–1800CrossRef
18.
Zurück zum Zitat Tieleman T, Hinton GE (2009) Using fast weights to improve persistent contrastive divergence. In: Annual international conference on machine learning, Montreal, pp 1033–1040 Tieleman T, Hinton GE (2009) Using fast weights to improve persistent contrastive divergence. In: Annual international conference on machine learning, Montreal, pp 1033–1040
19.
Zurück zum Zitat Phillips PJ, Wechsler H, Huang J, Rauss PJ (1998) The FERET database and evaluation procedure for face-recognition algorithms. Image Vis Comput 16(5):295–306CrossRef Phillips PJ, Wechsler H, Huang J, Rauss PJ (1998) The FERET database and evaluation procedure for face-recognition algorithms. Image Vis Comput 16(5):295–306CrossRef
20.
Zurück zum Zitat Sim T, Baker S, Bsat M (2002) The CMU pose, illumination, and expression (PIE) database. In: IEEE international conference on automatic face and gesture recognition, pp 53–58 Sim T, Baker S, Bsat M (2002) The CMU pose, illumination, and expression (PIE) database. In: IEEE international conference on automatic face and gesture recognition, pp 53–58
21.
Zurück zum Zitat Channappayya SS, Bovik AC, Jr RWH (2008) Rate bounds on SSIM index of quantized images. IEEE Trans Image Process A Publ IEEE Signal Process Soc 17(9):1624–1639MathSciNetCrossRef Channappayya SS, Bovik AC, Jr RWH (2008) Rate bounds on SSIM index of quantized images. IEEE Trans Image Process A Publ IEEE Signal Process Soc 17(9):1624–1639MathSciNetCrossRef
22.
Zurück zum Zitat Coates A, Ng A, Lee H (2011) An analysis of single-layer networks in unsupervised feature learning. In: International conference on artificial intelligence and statistics, pp 215–223 Coates A, Ng A, Lee H (2011) An analysis of single-layer networks in unsupervised feature learning. In: International conference on artificial intelligence and statistics, pp 215–223
Metadaten
Titel
Robust spike-and-slab deep Boltzmann machines for face denoising
verfasst von
Nan Zhang
Shifei Ding
Jian Zhang
Xingyu Zhao
Publikationsdatum
19.11.2018
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 7/2020
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-018-3866-6

Weitere Artikel der Ausgabe 7/2020

Neural Computing and Applications 7/2020 Zur Ausgabe

Deep Learning & Neural Computing for Intelligent Sensing and Control

A Q-learning-based approach for virtual network embedding in data center

Deep Learning & Neural Computing for Intelligent Sensing and Control

Prediction of air quality in Shenzhen based on neural network algorithm