nach oben

Erschienen in:

2017 | OriginalPaper | Buchkapitel

Markov Random Field Based Convolutional Neural Networks for Image Classification

verfasst von : Yao Peng, Hujun Yin

Erschienen in: Intelligent Data Engineering and Automated Learning – IDEAL 2017

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

In image classification, deriving efficient image representations from raw data is a key focus as it can largely determine the performance of a vision system. Conventional methods extract low-level features based on experiments or certain theories, whilst deep learning approaches learn image representations hierarchically with multiple layers of abstraction from vast number of sample images. Markov random fields are generative, flexible and stochastic image texture models, in which global image representations can be obtained by means of local conditional probabilities. Texture has been strongly linked to human visual perception. The ability of deriving global description from local structure shares compatibility with convolutional neural networks. Inspired by this property, we investigate the combination of Markov random field models with deep convolutional neural networks for image classification. Various filters from Markov random field models are first derived to form the features maps. Then convolutional neural networks are trained with prefixed filter banks. Comprehensive experiments conducted on the MNIST dataset, EMNIST database and CIFAR-10 object database are reported.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel The Theory of Modified Rings Game

Nächstes Kapitel Using the Multivariate Normal to Improve Random Projections

Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013)CrossRef

Hinton, G.E., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)MathSciNetCrossRefMATH

LeCun, Y., Bengio, Y., Hinton, G.E.: Deep learning. Nature 521(7553), 436–444 (2015)CrossRef

LeCun, Y., Kavukcuoglu, K., Farabet, C.: Convolutional networks and applications in vision. In: Proceedings of IEEE International Symposium on Circuits and Systems, pp. 253–256 (2010)

Julesz, B.: Visual pattern discrimination. IRE Trans. Inf. Theor. 8(2), 84–92 (1962)CrossRef

Julesz, B.: Textons, the elements of texture perception, and their interactions. Nature 290(5802), 91–97 (1981)CrossRef

Li, S.Z.: A Markov random field model for object matching under contextual constraints. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p. 866 (1994)

Li, S.Z.: Modeling image analysis problems using Markov random fields. Stochast. Processes Model. Simul. 20(5), 1–43 (2000)

Geman, S., Geman, D.: Stochastic relaxation, gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell. 6(6), 721–741 (1984)CrossRefMATH

10.

LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef

11.

Cohen, G., Afshar, S., Tapson, J., van Schaik, A.: Emnist: an extension of mnist to handwritten letters (2017). arXiv preprint: arXiv:1702.05373

12.

Krizhevsky, A., Hinton, G.E.: Learning multiple layers of features from tiny images. Master Thesis, the University of Toronto (2009)

13.

Cross, G.R., Jain, A.K.: Markov random field texture models. IEEE Trans. Pattern Anal. Mach. Intell. 5(1), 25–39 (1983)CrossRef

14.

Ising, E.: Beitrag zur theorie des ferromagnetismus. Zeitschrift für Physik 31(1), 253–258 (1925)CrossRef

15.

Yin, H., Allinson, N.M.: Unsupervised segmentation of textured images using a hierarchical neural structure. Electron. Lett. 30(22), 1842–1843 (1994)CrossRef

16.

Nishii, R., Eguchi, S.: Image classification based on Markov random field models with jeffreys divergence. J. Multivar. Anal. 97(9), 1997–2008 (2006)MathSciNetCrossRefMATH

17.

Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

18.

Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). doi:10.1007/978-3-319-10590-1_53

19.

Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014). arXiv preprint: arXiv:1409.1556

20.

Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)

21.

Cireşan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 3642–3649. IEEE (2012)

22.

Jarrett, K., Kavukcuoglu, K., LeCun, Y., et al.: What is the best multi-stage architecture for object recognition? In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2146–2153 (2009)

23.

Lin, M., Chen, Q., Yan, S.: Network in network (2013) arXiv preprint: arXiv:1312.4400

24.

Zeiler, M.D., Fergus, R.: Stochastic pooling for regularization of deep convolutional neural networks (2013). arXiv preprint: arXiv:1301.3557

25.

Springenberg, J.T., Dosovitskiy, A., Brox, T., Riedmiller, M.: Striving for simplicity: the all convolutional net (2014). arXiv preprint: arXiv:1412.6806

26.

Bruna, J., Mallat, S.: Invariant scattering convolution networks. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1872–1886 (2013)CrossRef

27.

Chan, T.H., Jia, K., Gao, S., Lu, J., Zeng, Z., Ma, Y.: PCANet: a simple deep learning baseline for image classification? IEEE Trans. Image Process. 24(12), 5017–5032 (2015)MathSciNetCrossRef

28.

Ng, C.J., Teoh, A.B.J.: Dctnet: a simple learning-free approach for face recognition. In: Proceedings of Asia-Pacific Conference on Signal and Information Processing Association Annual Summit, pp. 761–768 (2015)

29.

Besag, J.: Spatial interaction and the statistical analysis of lattice systems. J. R. Stat. Soc. Ser. B 36, 192–236 (1974)MathSciNetMATH

30.

Hammersley, J.M., Clifford, P.E.: Markov random fields on finite graphs and lattices. Unpublished manuscript (1971)

31.

Elliott, H., Derin, H., Cristi, R., Geman, D.: Application of the gibbs distribution to image segmentation. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 9, pp. 678–681 (1984)

32.

Derin, H., Elliott, H.: Modeling and segmentation of noisy and textured images using gibbs random fields. IEEE Trans. Pattern Anal. Mach. Intell. 9(1), 39–55 (1987)CrossRef

33.

Kashyap, R., Chellappa, R.: Estimation and choice of neighbors in spatial-interaction models of images. IEEE Trans. Inf. Theor. 29(1), 60–72 (1983)CrossRefMATH

34.

Dass, S.C.: Markov random field models for directional field and singularity extraction in fingerprint images. IEEE Trans. Image Process. 13(10), 1358–1367 (2004)CrossRef

35.

Goodfellow, I.J., Warde-Farley, D., Mirza, M., Courville, A., Bengio, Y.: Maxout networks (2013). arXiv preprint: arXiv:1302.4389

36.

Wan, L., Zeiler, M., Zhang, S., Cun, Y.L., Fergus, R.: Regularization of neural networks using dropconnect. In: Proceedings of the International Conference on Machine Learning, pp. 1058–1066 (2013)

Titel: Markov Random Field Based Convolutional Neural Networks for Image Classification
verfasst von: Yao Peng
Hujun Yin
Verlag: Springer International Publishing
Buch: Intelligent Data Engineering and Automated Learning – IDEAL 2017
Print ISBN: 978-3-319-68934-0

Electronic ISBN: 978-3-319-68935-7

Copyright-Jahr: 2017
DOI: https://doi.org/10.1007/978-3-319-68935-7_42

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner