Skip to main content

2017 | OriginalPaper | Buchkapitel

Markov Random Field Based Convolutional Neural Networks for Image Classification

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In image classification, deriving efficient image representations from raw data is a key focus as it can largely determine the performance of a vision system. Conventional methods extract low-level features based on experiments or certain theories, whilst deep learning approaches learn image representations hierarchically with multiple layers of abstraction from vast number of sample images. Markov random fields are generative, flexible and stochastic image texture models, in which global image representations can be obtained by means of local conditional probabilities. Texture has been strongly linked to human visual perception. The ability of deriving global description from local structure shares compatibility with convolutional neural networks. Inspired by this property, we investigate the combination of Markov random field models with deep convolutional neural networks for image classification. Various filters from Markov random field models are first derived to form the features maps. Then convolutional neural networks are trained with prefixed filter banks. Comprehensive experiments conducted on the MNIST dataset, EMNIST database and CIFAR-10 object database are reported.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013)CrossRef Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013)CrossRef
2.
Zurück zum Zitat Hinton, G.E., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)MathSciNetCrossRefMATH Hinton, G.E., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)MathSciNetCrossRefMATH
3.
Zurück zum Zitat LeCun, Y., Bengio, Y., Hinton, G.E.: Deep learning. Nature 521(7553), 436–444 (2015)CrossRef LeCun, Y., Bengio, Y., Hinton, G.E.: Deep learning. Nature 521(7553), 436–444 (2015)CrossRef
4.
Zurück zum Zitat LeCun, Y., Kavukcuoglu, K., Farabet, C.: Convolutional networks and applications in vision. In: Proceedings of IEEE International Symposium on Circuits and Systems, pp. 253–256 (2010) LeCun, Y., Kavukcuoglu, K., Farabet, C.: Convolutional networks and applications in vision. In: Proceedings of IEEE International Symposium on Circuits and Systems, pp. 253–256 (2010)
5.
Zurück zum Zitat Julesz, B.: Visual pattern discrimination. IRE Trans. Inf. Theor. 8(2), 84–92 (1962)CrossRef Julesz, B.: Visual pattern discrimination. IRE Trans. Inf. Theor. 8(2), 84–92 (1962)CrossRef
6.
Zurück zum Zitat Julesz, B.: Textons, the elements of texture perception, and their interactions. Nature 290(5802), 91–97 (1981)CrossRef Julesz, B.: Textons, the elements of texture perception, and their interactions. Nature 290(5802), 91–97 (1981)CrossRef
7.
Zurück zum Zitat Li, S.Z.: A Markov random field model for object matching under contextual constraints. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p. 866 (1994) Li, S.Z.: A Markov random field model for object matching under contextual constraints. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p. 866 (1994)
8.
Zurück zum Zitat Li, S.Z.: Modeling image analysis problems using Markov random fields. Stochast. Processes Model. Simul. 20(5), 1–43 (2000) Li, S.Z.: Modeling image analysis problems using Markov random fields. Stochast. Processes Model. Simul. 20(5), 1–43 (2000)
9.
Zurück zum Zitat Geman, S., Geman, D.: Stochastic relaxation, gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell. 6(6), 721–741 (1984)CrossRefMATH Geman, S., Geman, D.: Stochastic relaxation, gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell. 6(6), 721–741 (1984)CrossRefMATH
10.
Zurück zum Zitat LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef
11.
Zurück zum Zitat Cohen, G., Afshar, S., Tapson, J., van Schaik, A.: Emnist: an extension of mnist to handwritten letters (2017). arXiv preprint: arXiv:1702.05373 Cohen, G., Afshar, S., Tapson, J., van Schaik, A.: Emnist: an extension of mnist to handwritten letters (2017). arXiv preprint: arXiv:​1702.​05373
12.
Zurück zum Zitat Krizhevsky, A., Hinton, G.E.: Learning multiple layers of features from tiny images. Master Thesis, the University of Toronto (2009) Krizhevsky, A., Hinton, G.E.: Learning multiple layers of features from tiny images. Master Thesis, the University of Toronto (2009)
13.
Zurück zum Zitat Cross, G.R., Jain, A.K.: Markov random field texture models. IEEE Trans. Pattern Anal. Mach. Intell. 5(1), 25–39 (1983)CrossRef Cross, G.R., Jain, A.K.: Markov random field texture models. IEEE Trans. Pattern Anal. Mach. Intell. 5(1), 25–39 (1983)CrossRef
14.
Zurück zum Zitat Ising, E.: Beitrag zur theorie des ferromagnetismus. Zeitschrift für Physik 31(1), 253–258 (1925)CrossRef Ising, E.: Beitrag zur theorie des ferromagnetismus. Zeitschrift für Physik 31(1), 253–258 (1925)CrossRef
15.
Zurück zum Zitat Yin, H., Allinson, N.M.: Unsupervised segmentation of textured images using a hierarchical neural structure. Electron. Lett. 30(22), 1842–1843 (1994)CrossRef Yin, H., Allinson, N.M.: Unsupervised segmentation of textured images using a hierarchical neural structure. Electron. Lett. 30(22), 1842–1843 (1994)CrossRef
16.
Zurück zum Zitat Nishii, R., Eguchi, S.: Image classification based on Markov random field models with jeffreys divergence. J. Multivar. Anal. 97(9), 1997–2008 (2006)MathSciNetCrossRefMATH Nishii, R., Eguchi, S.: Image classification based on Markov random field models with jeffreys divergence. J. Multivar. Anal. 97(9), 1997–2008 (2006)MathSciNetCrossRefMATH
17.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
18.
Zurück zum Zitat Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). doi:10.1007/978-3-319-10590-1_53 Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). doi:10.​1007/​978-3-319-10590-1_​53
19.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014). arXiv preprint: arXiv:1409.1556 Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014). arXiv preprint: arXiv:​1409.​1556
20.
Zurück zum Zitat Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015) Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
21.
Zurück zum Zitat Cireşan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 3642–3649. IEEE (2012) Cireşan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 3642–3649. IEEE (2012)
22.
Zurück zum Zitat Jarrett, K., Kavukcuoglu, K., LeCun, Y., et al.: What is the best multi-stage architecture for object recognition? In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2146–2153 (2009) Jarrett, K., Kavukcuoglu, K., LeCun, Y., et al.: What is the best multi-stage architecture for object recognition? In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2146–2153 (2009)
24.
Zurück zum Zitat Zeiler, M.D., Fergus, R.: Stochastic pooling for regularization of deep convolutional neural networks (2013). arXiv preprint: arXiv:1301.3557 Zeiler, M.D., Fergus, R.: Stochastic pooling for regularization of deep convolutional neural networks (2013). arXiv preprint: arXiv:​1301.​3557
25.
Zurück zum Zitat Springenberg, J.T., Dosovitskiy, A., Brox, T., Riedmiller, M.: Striving for simplicity: the all convolutional net (2014). arXiv preprint: arXiv:1412.6806 Springenberg, J.T., Dosovitskiy, A., Brox, T., Riedmiller, M.: Striving for simplicity: the all convolutional net (2014). arXiv preprint: arXiv:​1412.​6806
26.
Zurück zum Zitat Bruna, J., Mallat, S.: Invariant scattering convolution networks. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1872–1886 (2013)CrossRef Bruna, J., Mallat, S.: Invariant scattering convolution networks. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1872–1886 (2013)CrossRef
27.
Zurück zum Zitat Chan, T.H., Jia, K., Gao, S., Lu, J., Zeng, Z., Ma, Y.: PCANet: a simple deep learning baseline for image classification? IEEE Trans. Image Process. 24(12), 5017–5032 (2015)MathSciNetCrossRef Chan, T.H., Jia, K., Gao, S., Lu, J., Zeng, Z., Ma, Y.: PCANet: a simple deep learning baseline for image classification? IEEE Trans. Image Process. 24(12), 5017–5032 (2015)MathSciNetCrossRef
28.
Zurück zum Zitat Ng, C.J., Teoh, A.B.J.: Dctnet: a simple learning-free approach for face recognition. In: Proceedings of Asia-Pacific Conference on Signal and Information Processing Association Annual Summit, pp. 761–768 (2015) Ng, C.J., Teoh, A.B.J.: Dctnet: a simple learning-free approach for face recognition. In: Proceedings of Asia-Pacific Conference on Signal and Information Processing Association Annual Summit, pp. 761–768 (2015)
29.
Zurück zum Zitat Besag, J.: Spatial interaction and the statistical analysis of lattice systems. J. R. Stat. Soc. Ser. B 36, 192–236 (1974)MathSciNetMATH Besag, J.: Spatial interaction and the statistical analysis of lattice systems. J. R. Stat. Soc. Ser. B 36, 192–236 (1974)MathSciNetMATH
30.
Zurück zum Zitat Hammersley, J.M., Clifford, P.E.: Markov random fields on finite graphs and lattices. Unpublished manuscript (1971) Hammersley, J.M., Clifford, P.E.: Markov random fields on finite graphs and lattices. Unpublished manuscript (1971)
31.
Zurück zum Zitat Elliott, H., Derin, H., Cristi, R., Geman, D.: Application of the gibbs distribution to image segmentation. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 9, pp. 678–681 (1984) Elliott, H., Derin, H., Cristi, R., Geman, D.: Application of the gibbs distribution to image segmentation. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 9, pp. 678–681 (1984)
32.
Zurück zum Zitat Derin, H., Elliott, H.: Modeling and segmentation of noisy and textured images using gibbs random fields. IEEE Trans. Pattern Anal. Mach. Intell. 9(1), 39–55 (1987)CrossRef Derin, H., Elliott, H.: Modeling and segmentation of noisy and textured images using gibbs random fields. IEEE Trans. Pattern Anal. Mach. Intell. 9(1), 39–55 (1987)CrossRef
33.
Zurück zum Zitat Kashyap, R., Chellappa, R.: Estimation and choice of neighbors in spatial-interaction models of images. IEEE Trans. Inf. Theor. 29(1), 60–72 (1983)CrossRefMATH Kashyap, R., Chellappa, R.: Estimation and choice of neighbors in spatial-interaction models of images. IEEE Trans. Inf. Theor. 29(1), 60–72 (1983)CrossRefMATH
34.
Zurück zum Zitat Dass, S.C.: Markov random field models for directional field and singularity extraction in fingerprint images. IEEE Trans. Image Process. 13(10), 1358–1367 (2004)CrossRef Dass, S.C.: Markov random field models for directional field and singularity extraction in fingerprint images. IEEE Trans. Image Process. 13(10), 1358–1367 (2004)CrossRef
35.
Zurück zum Zitat Goodfellow, I.J., Warde-Farley, D., Mirza, M., Courville, A., Bengio, Y.: Maxout networks (2013). arXiv preprint: arXiv:1302.4389 Goodfellow, I.J., Warde-Farley, D., Mirza, M., Courville, A., Bengio, Y.: Maxout networks (2013). arXiv preprint: arXiv:​1302.​4389
36.
Zurück zum Zitat Wan, L., Zeiler, M., Zhang, S., Cun, Y.L., Fergus, R.: Regularization of neural networks using dropconnect. In: Proceedings of the International Conference on Machine Learning, pp. 1058–1066 (2013) Wan, L., Zeiler, M., Zhang, S., Cun, Y.L., Fergus, R.: Regularization of neural networks using dropconnect. In: Proceedings of the International Conference on Machine Learning, pp. 1058–1066 (2013)
Metadaten
Titel
Markov Random Field Based Convolutional Neural Networks for Image Classification
verfasst von
Yao Peng
Hujun Yin
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-68935-7_42

Premium Partner