Skip to main content
Top

2016 | OriginalPaper | Chapter

Improving Visual Feature Representations by Biasing Restricted Boltzmann Machines with Gaussian Filters

Authors : Arjun Yogeswaran, Pierre Payeur

Published in: Advances in Visual Computing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Advances in unsupervised learning have allowed the efficient learning of feature representations from large sets of unlabeled data. This paper evaluates visual features learned through unsupervised learning, specifically comparing biasing methods using Gaussian filters on a single-layer network. Using the restricted Boltzmann machine, features emerging through training on image data are compared by classification performance on standard datasets. When Gaussian filters are convolved with adjacent hidden layer activations from a single example during training, topographies emerge where adjacent features become tuned to slightly varying stimuli. When Gaussian filters are applied to the visible nodes, images become blurrier; training on these images leads to less localized features being learned. The networks are trained and tested on the CIFAR-10, STL-10, COIL-100, and MNIST datasets. It is found that the induction of topography or simple image blurring during training produce better features as evidenced by the consistent and notable increase in classification results.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Coates, A., Lee, H., Ng, A.: An analysis of single-layer networks in unsupervised feature learning. In: Proceedings of International Conference on Artificial Intelligence and Statistics, pp. 215–223 (2011) Coates, A., Lee, H., Ng, A.: An analysis of single-layer networks in unsupervised feature learning. In: Proceedings of International Conference on Artificial Intelligence and Statistics, pp. 215–223 (2011)
2.
go back to reference Goh, H., Kusmierz, L., Lim, J.-H., Thome, N., Cord, M.: Learning invariant color features with sparse topographic restricted Boltzmann machines. In: Proceedings of IEEE Conference on Image Processing, pp. 1241–1244 (2011) Goh, H., Kusmierz, L., Lim, J.-H., Thome, N., Cord, M.: Learning invariant color features with sparse topographic restricted Boltzmann machines. In: Proceedings of IEEE Conference on Image Processing, pp. 1241–1244 (2011)
3.
go back to reference Hyvärinen, A., Hoyer, P., Inki, M.: Topographic independent component analysis. Neural Comput. 13, 1527–1558 (2001)CrossRefMATH Hyvärinen, A., Hoyer, P., Inki, M.: Topographic independent component analysis. Neural Comput. 13, 1527–1558 (2001)CrossRefMATH
4.
go back to reference Kavukcuoglu, K., Ranzato, M., Fergus, R., LeCun, Y.: Learning invariant features through topographic filter maps. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1605–1612 (2009) Kavukcuoglu, K., Ranzato, M., Fergus, R., LeCun, Y.: Learning invariant features through topographic filter maps. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1605–1612 (2009)
5.
go back to reference He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37, 1904–1916 (2015)CrossRef He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37, 1904–1916 (2015)CrossRef
6.
go back to reference Simard, P., Steinkraus, D., Platt, J.: Best practices for convolutional neural networks applied to visual document analysis. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 958–962 (2003) Simard, P., Steinkraus, D., Platt, J.: Best practices for convolutional neural networks applied to visual document analysis. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 958–962 (2003)
7.
go back to reference Le, Q., Ranzato, M., Monga, R., Devin, M., Chen, K., Corrado, G., Dean, J., Ng, A.: Building high-level features with large scale unsupervised learning. In: Proceedings of International Conference on Machine Learning, pp. 81–88 (2012) Le, Q., Ranzato, M., Monga, R., Devin, M., Chen, K., Corrado, G., Dean, J., Ng, A.: Building high-level features with large scale unsupervised learning. In: Proceedings of International Conference on Machine Learning, pp. 81–88 (2012)
8.
go back to reference Sermanet, P., Kavukcuoglu, K., Chintala, S., LeCun, Y.: Pedestrian detection with unsupervised multi-stage feature learning. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 3626–3633 (2013) Sermanet, P., Kavukcuoglu, K., Chintala, S., LeCun, Y.: Pedestrian detection with unsupervised multi-stage feature learning. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 3626–3633 (2013)
9.
go back to reference Krizhevsky, A.: Learning multiple layers of features from tiny images. Technical report, University of Toronto (2009) Krizhevsky, A.: Learning multiple layers of features from tiny images. Technical report, University of Toronto (2009)
10.
go back to reference Nayar, S., Nene, S.A., Murase, H.: Real-time 100 object recognition system. IEEE Trans. Pattern Anal. Mach. Intell. 18, 1186–1198 (1996)CrossRef Nayar, S., Nene, S.A., Murase, H.: Real-time 100 object recognition system. IEEE Trans. Pattern Anal. Mach. Intell. 18, 1186–1198 (1996)CrossRef
12.
go back to reference Lee, H., Ekanadham, C., Ng, A.: Sparse deep belief net model for visual area V2. In: Proceedings of Advances in Neural Information Processing Systems, pp. 873–880 (2008) Lee, H., Ekanadham, C., Ng, A.: Sparse deep belief net model for visual area V2. In: Proceedings of Advances in Neural Information Processing Systems, pp. 873–880 (2008)
13.
go back to reference LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998)CrossRef LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998)CrossRef
14.
go back to reference Mobahi, H., Collobert, R., Weston, J.: Deep learning from temporal coherence in video. In: Proceedings of International Conference on Machine Learning, pp. 737–744 (2009) Mobahi, H., Collobert, R., Weston, J.: Deep learning from temporal coherence in video. In: Proceedings of International Conference on Machine Learning, pp. 737–744 (2009)
15.
go back to reference Larochelle, H., Bengio, S.: Classification using discriminative restricted Boltzmann machines. In: Proceedings of International Conference on Machine Learning, pp. 536–543 (2008) Larochelle, H., Bengio, S.: Classification using discriminative restricted Boltzmann machines. In: Proceedings of International Conference on Machine Learning, pp. 536–543 (2008)
Metadata
Title
Improving Visual Feature Representations by Biasing Restricted Boltzmann Machines with Gaussian Filters
Authors
Arjun Yogeswaran
Pierre Payeur
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-50835-1_74

Premium Partner