Skip to main content

2017 | OriginalPaper | Buchkapitel

2. Learning and Recognition Methods for Image Search and Video Retrieval

verfasst von : Ajit Puthenputhussery, Shuo Chen, Joyoung Lee, Lazar Spasovic, Chengjun Liu

Erschienen in: Recent Advances in Intelligent Image Search and Video Retrieval

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Effective learning and recognition methods play an important role in intelligent image search and video retrieval. This chapter therefore reviews some popular learning and recognition methods that are broadly applied for image search and video retrieval. First some popular deep learning methods are discussed, such as the feedforward deep neural networks, the deep autoencoders, the convolutional neural networks, and the Deep Boltzmann Machine (DBM). Second, Support Vector Machine (SVM), which is one of the popular machine learning methods, is reviewed. In particular, the linear support vector machine, the soft-margin support vector machine, the non-linear support vector machine, the simplified support vector machine, the efficient Support Vector Machine (eSVM), and the applications of SVM to image search and video retrieval are discussed. Finally, other popular kernel methods and new similarity measures are briefly reviewed.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alpaydin, E.: Introduction to machine learning. The MIT Press, Cambridge (2010) Alpaydin, E.: Introduction to machine learning. The MIT Press, Cambridge (2010)
2.
Zurück zum Zitat Baldi, P.: Autoencoders, unsupervised learning, and deep architectures. J. Mach. Learn. Res. (Proceedings of ICML unsupervised and transfer learning) 27, 37–50 (2011) Baldi, P.: Autoencoders, unsupervised learning, and deep architectures. J. Mach. Learn. Res. (Proceedings of ICML unsupervised and transfer learning) 27, 37–50 (2011)
3.
Zurück zum Zitat Baudat, G., Anouar, F.: Generalized discriminant analysis using a kernel approach. Neural Comput. 12(10), 2385–2404 (2000)CrossRef Baudat, G., Anouar, F.: Generalized discriminant analysis using a kernel approach. Neural Comput. 12(10), 2385–2404 (2000)CrossRef
4.
Zurück zum Zitat Belhumeur, P.N., Hespanha, J.P., Kriegman, D.J.: Eigenfaces vs. Fisherfaces: recognition using class specific linear projection. IEEE Trans. Pattern Anal. Mach. Intell. 19(7), 711–720 (1997) Belhumeur, P.N., Hespanha, J.P., Kriegman, D.J.: Eigenfaces vs. Fisherfaces: recognition using class specific linear projection. IEEE Trans. Pattern Anal. Mach. Intell. 19(7), 711–720 (1997)
5.
Zurück zum Zitat Bengio, Y., Yao, L., Alain, G., Vincent, P.: Generalized denoising auto-encoders as generative models. Adv, Neural Inf. Process. Syst. 26, 899–907 (2013) Bengio, Y., Yao, L., Alain, G., Vincent, P.: Generalized denoising auto-encoders as generative models. Adv, Neural Inf. Process. Syst. 26, 899–907 (2013)
6.
Zurück zum Zitat Beveridge, J., Givens, G., Phillips, P., Draper, B.: Factors that influence algorithm performance in the face recognition grand challenge. Comput. Vis. Image Underst. 113(6), 750–762 (2009)CrossRef Beveridge, J., Givens, G., Phillips, P., Draper, B.: Factors that influence algorithm performance in the face recognition grand challenge. Comput. Vis. Image Underst. 113(6), 750–762 (2009)CrossRef
7.
Zurück zum Zitat Brodatz, P.: Textures: a photographic album for artists and designers. Dover, New York (1966) Brodatz, P.: Textures: a photographic album for artists and designers. Dover, New York (1966)
8.
Zurück zum Zitat Burges, C.: Simplified support vector decision rule. In: Proceedings of the Thirteenth International Conference on Machine Learning (ICML’96), Bari, Italy, July 3–6, 1996 (1996) Burges, C.: Simplified support vector decision rule. In: Proceedings of the Thirteenth International Conference on Machine Learning (ICML’96), Bari, Italy, July 3–6, 1996 (1996)
9.
Zurück zum Zitat Chambon, S., Crouzil, A.: Similarity measures for image matching despite occlusions in stereo vision. Pattern Recognit. 44(9), 2063–2075 (2011)CrossRef Chambon, S., Crouzil, A.: Similarity measures for image matching despite occlusions in stereo vision. Pattern Recognit. 44(9), 2063–2075 (2011)CrossRef
10.
Zurück zum Zitat Chapelle, O., Haffner, P., Vapnik, V.N.: Support vector machines for histogram-based image classification. IEEE Trans. Neural Netw. 10(5), 1055–1064 (1999)CrossRef Chapelle, O., Haffner, P., Vapnik, V.N.: Support vector machines for histogram-based image classification. IEEE Trans. Neural Netw. 10(5), 1055–1064 (1999)CrossRef
11.
Zurück zum Zitat Chen, J., Chen, C.: Reducing SVM classification time using multiple mirror classifers. IEEE Trans. Syst. Man Cybern. 34(2), 1173–1183 (2004)CrossRef Chen, J., Chen, C.: Reducing SVM classification time using multiple mirror classifers. IEEE Trans. Syst. Man Cybern. 34(2), 1173–1183 (2004)CrossRef
12.
Zurück zum Zitat Chen, S., Liu, C.: Eye detection using color information and a new efficient SVM. In: IEEE Fourth International Conference on Biometrics: Theory, Applications, and Systems (BATS’10), Washington DC, USA (2010) Chen, S., Liu, C.: Eye detection using color information and a new efficient SVM. In: IEEE Fourth International Conference on Biometrics: Theory, Applications, and Systems (BATS’10), Washington DC, USA (2010)
13.
Zurück zum Zitat Chen, S., Liu, C.: A new efficient SVM and its application to a real-time accurate eye localization system. In: International Joint Conference on Neural Networks, San Jose, California, USA (2011) Chen, S., Liu, C.: A new efficient SVM and its application to a real-time accurate eye localization system. In: International Joint Conference on Neural Networks, San Jose, California, USA (2011)
14.
Zurück zum Zitat Chen, S., Liu, C.: Eye detection using discriminatory haar features and a new efficient SVM. Image Vis. Comput. 33(c), 68–77 (2015) Chen, S., Liu, C.: Eye detection using discriminatory haar features and a new efficient SVM. Image Vis. Comput. 33(c), 68–77 (2015)
15.
Zurück zum Zitat Chen, P., Lin, C., Scholkopf, B.: A tutorial on \(\upsilon \)-support vector machines. Appl. Stoch. Models Bus. Ind. 21, 111–136 (2005)MathSciNetCrossRefMATH Chen, P., Lin, C., Scholkopf, B.: A tutorial on \(\upsilon \)-support vector machines. Appl. Stoch. Models Bus. Ind. 21, 111–136 (2005)MathSciNetCrossRefMATH
16.
Zurück zum Zitat Cooke, T.: Two variations on Fisher’s linear discriminant for pattern recognition. IEEE Trans. Pattern Anal. Mach. Intell. 24(2), 268–273 (2002) Cooke, T.: Two variations on Fisher’s linear discriminant for pattern recognition. IEEE Trans. Pattern Anal. Mach. Intell. 24(2), 268–273 (2002)
17.
Zurück zum Zitat Corso, J.J., Alahi, A., Grauman, K., Hager, G.D., Morency, L.P., Sawhney, H., Sheikh, Y.: Video Analysis for Bodyworn Cameras in Law Enforcement. The Computing Community Consortium whitepaper (2015) Corso, J.J., Alahi, A., Grauman, K., Hager, G.D., Morency, L.P., Sawhney, H., Sheikh, Y.: Video Analysis for Bodyworn Cameras in Law Enforcement. The Computing Community Consortium whitepaper (2015)
18.
Zurück zum Zitat Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, Cambridge (2000) Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, Cambridge (2000)
19.
Zurück zum Zitat Dhanjal, C., Gunn, S., Shawe-Taylor, J.: Efficient sparse kernel feature extraction based on partial least squares. IEEE Trans. Pattern Anal. Mach. Intell. 31(8), 1347–1361 (2009) Dhanjal, C., Gunn, S., Shawe-Taylor, J.: Efficient sparse kernel feature extraction based on partial least squares. IEEE Trans. Pattern Anal. Mach. Intell. 31(8), 1347–1361 (2009)
20.
Zurück zum Zitat Etemad, K., Chellappa, R.: Discriminant analysis for recognition of human face images. J. Opt. Soc. Am. A 14, 1724–1733 (1997)CrossRef Etemad, K., Chellappa, R.: Discriminant analysis for recognition of human face images. J. Opt. Soc. Am. A 14, 1724–1733 (1997)CrossRef
21.
Zurück zum Zitat Glorot, X., Bordes, A., Bengio, Y.: Deep sparse rectifier neural networks. In: Aistats, vol. 15, p. 275 (2011) Glorot, X., Bordes, A., Bengio, Y.: Deep sparse rectifier neural networks. In: Aistats, vol. 15, p. 275 (2011)
22.
Zurück zum Zitat Goodfellow, I., Mirza, M., Courville, A., Bengio, Y.: Multi-prediction deep boltzmann machines. In: Advances in Neural Information Processing Systems, pp. 548–556 (2013) Goodfellow, I., Mirza, M., Courville, A., Bengio, Y.: Multi-prediction deep boltzmann machines. In: Advances in Neural Information Processing Systems, pp. 548–556 (2013)
24.
Zurück zum Zitat Gundimada, S., Asari, V.: Facial recognition using multisensor images based on localized kernel eigen spaces. IEEE Trans. Image Process. 18(6), 1314–1325 (2009)MathSciNetCrossRef Gundimada, S., Asari, V.: Facial recognition using multisensor images based on localized kernel eigen spaces. IEEE Trans. Image Process. 18(6), 1314–1325 (2009)MathSciNetCrossRef
25.
Zurück zum Zitat Guo, G., Zhang, H.J., Li, S.Z.: Distance-from-boundary as a metric for texture image retrieval. In: IEEE International Conference on Acoustics. Speech, and Signal Processing, vol. 3, pp. 1629–1632, Washington DC, USA (2001) Guo, G., Zhang, H.J., Li, S.Z.: Distance-from-boundary as a metric for texture image retrieval. In: IEEE International Conference on Acoustics. Speech, and Signal Processing, vol. 3, pp. 1629–1632, Washington DC, USA (2001)
26.
Zurück zum Zitat Haykin, S.: Neural Networks — A Comprehensive Foundation. Macmillan College Publishing Company, Inc., New York (1994) Haykin, S.: Neural Networks — A Comprehensive Foundation. Macmillan College Publishing Company, Inc., New York (1994)
27.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1026–1034 (2015) He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1026–1034 (2015)
28.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)
29.
Zurück zum Zitat Herbrich, R.: Learning Kernel Classifiers: Theory and Algorithms. MIT Press, Cambridge (2002) Herbrich, R.: Learning Kernel Classifiers: Theory and Algorithms. MIT Press, Cambridge (2002)
30.
Zurück zum Zitat Hoi, C.H., Chan, C.H., Huang, K., Lyu, M.R., King, I.: Biased support vector machine for relevance feedback in image retrieval. In: 2004 IEEE International Joint Conference on Neural Networks, vol. 4, pp. 3189–3194 (2004) Hoi, C.H., Chan, C.H., Huang, K., Lyu, M.R., King, I.: Biased support vector machine for relevance feedback in image retrieval. In: 2004 IEEE International Joint Conference on Neural Networks, vol. 4, pp. 3189–3194 (2004)
31.
Zurück zum Zitat Hong, P., Tian, Q., Huang, T.S.: Incorporate support vector machines to content-based image retrieval with relevance feedback. In: 2000 International Conference on Image Processing, vol. 3, pp. 750–753 (2000) Hong, P., Tian, Q., Huang, T.S.: Incorporate support vector machines to content-based image retrieval with relevance feedback. In: 2000 International Conference on Image Processing, vol. 3, pp. 750–753 (2000)
32.
Zurück zum Zitat Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: 10th European Conference on Machine Learning (1999) Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: 10th European Conference on Machine Learning (1999)
33.
Zurück zum Zitat Kirby, M., Sirovich, L.: Application of the Karhunen-Loeve procedure for the characterization of human faces. IEEE Trans. Pattern Anal. Mach. Intell. 12(1), 103–108 (1990) Kirby, M., Sirovich, L.: Application of the Karhunen-Loeve procedure for the characterization of human faces. IEEE Trans. Pattern Anal. Mach. Intell. 12(1), 103–108 (1990)
34.
Zurück zum Zitat Kittler, J., Hatef, M., Duin, R., Matas, J.: On combining classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 20(3), 226–239 (1998)CrossRef Kittler, J., Hatef, M., Duin, R., Matas, J.: On combining classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 20(3), 226–239 (1998)CrossRef
35.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 25, pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 25, pp. 1097–1105 (2012)
36.
Zurück zum Zitat Lee, Y., Mangasarian, O.: RSVM: Reduced support vector machines. In: The First SIAM International Conference on Data Mining (2001) Lee, Y., Mangasarian, O.: RSVM: Reduced support vector machines. In: The First SIAM International Conference on Data Mining (2001)
37.
Zurück zum Zitat Li, J., Allinson, N., Tao, D., Li, X.: Multitraining support vector machine for image retrieval. IEEE Trans. Image Process. 15(11), 3597–3601 (2006)CrossRef Li, J., Allinson, N., Tao, D., Li, X.: Multitraining support vector machine for image retrieval. IEEE Trans. Image Process. 15(11), 3597–3601 (2006)CrossRef
38.
Zurück zum Zitat Lin, K., Lin, C.: A study on reduced support vector machine. IEEE Trans. Neural Netw. 14(6), 1449–1559 (2003)CrossRef Lin, K., Lin, C.: A study on reduced support vector machine. IEEE Trans. Neural Netw. 14(6), 1449–1559 (2003)CrossRef
39.
Zurück zum Zitat Liu, C.: Gabor-based kernel PCA with fractional power polynomial models for face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 26(5), 572–581 (2004) Liu, C.: Gabor-based kernel PCA with fractional power polynomial models for face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 26(5), 572–581 (2004)
40.
Zurück zum Zitat Liu, C.: Capitalize on dimensionality increasing techniques for improving face recognition grand challenge performance. IEEE Trans. Pattern Anal. Mach. Intell. 28(5), 725–737 (2006) Liu, C.: Capitalize on dimensionality increasing techniques for improving face recognition grand challenge performance. IEEE Trans. Pattern Anal. Mach. Intell. 28(5), 725–737 (2006)
41.
Zurück zum Zitat Liu, C.: The Bayes decision rule induced similarity measures. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 1086–1090 (2007) Liu, C.: The Bayes decision rule induced similarity measures. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 1086–1090 (2007)
42.
Zurück zum Zitat Liu, C.: Clarification of assumptions in the relationship between the bayes decision rule and the whitened cosine similarity measure. IEEE Trans. Pattern Anal. Mach. Intell. 30(6), 1116–1117 (2008) Liu, C.: Clarification of assumptions in the relationship between the bayes decision rule and the whitened cosine similarity measure. IEEE Trans. Pattern Anal. Mach. Intell. 30(6), 1116–1117 (2008)
43.
Zurück zum Zitat Liu, C.: Effective use of color information for large scale face verification. Neurocomputing 101, 43–51 (2013)CrossRef Liu, C.: Effective use of color information for large scale face verification. Neurocomputing 101, 43–51 (2013)CrossRef
44.
Zurück zum Zitat Liu, C.: Discriminant analysis and similarity measure. Pattern Recognit. 47(1), 359–367 (2014)CrossRef Liu, C.: Discriminant analysis and similarity measure. Pattern Recognit. 47(1), 359–367 (2014)CrossRef
45.
Zurück zum Zitat Liu, Z., Liu, C.: Fusion of color, local spatial and global frequency information for face recognition. Pattern Recognit. 43(8), 2882–2890 (2010)CrossRefMATH Liu, Z., Liu, C.: Fusion of color, local spatial and global frequency information for face recognition. Pattern Recognit. 43(8), 2882–2890 (2010)CrossRefMATH
46.
Zurück zum Zitat Liu, C., Mago, V. (eds.): Cross Disciplinary Biometric Systems. Springer, New York (2012) Liu, C., Mago, V. (eds.): Cross Disciplinary Biometric Systems. Springer, New York (2012)
47.
Zurück zum Zitat Liu, X., Chen, W., Yuen, P., Feng, G.: Learning kernel in kernel-based LDA for face recognition under illumination variations. IEEE Signal Process. Lett. 16(12), 1019–1022 (2009)CrossRef Liu, X., Chen, W., Yuen, P., Feng, G.: Learning kernel in kernel-based LDA for face recognition under illumination variations. IEEE Signal Process. Lett. 16(12), 1019–1022 (2009)CrossRef
48.
Zurück zum Zitat Ma, C., Randolph, M., Drish, J.: A support vector machines-based rejection technique for speech recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 381–384 (2001) Ma, C., Randolph, M., Drish, J.: A support vector machines-based rejection technique for speech recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 381–384 (2001)
49.
Zurück zum Zitat Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of ICML, vol. 30 (2013) Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of ICML, vol. 30 (2013)
50.
Zurück zum Zitat Mika, S., Ratsch, G., Weston, J., Scholkopf, B., Mller, K.R.: Fisher discriminant analysis with kernels. In: Hu, Y.H., Larsen, J., Wilson, E., Douglas, S. (eds.) Neural Networks for Signal Processing IX, pp. 41–48. IEEE (1999) Mika, S., Ratsch, G., Weston, J., Scholkopf, B., Mller, K.R.: Fisher discriminant analysis with kernels. In: Hu, Y.H., Larsen, J., Wilson, E., Douglas, S. (eds.) Neural Networks for Signal Processing IX, pp. 41–48. IEEE (1999)
51.
Zurück zum Zitat Moghaddam, B.: Principal manifolds and probabilistic subspaces for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 24(6), 780–788 (2002) Moghaddam, B.: Principal manifolds and probabilistic subspaces for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 24(6), 780–788 (2002)
52.
Zurück zum Zitat Nagaraja, G., Murthy, S.R., Deepak, T.: Content based video retrieval using support vector machine classification. In: 2015 IEEE International Conference on Applied and Theoretical Computing and Communication Technology, pp. 821–827 (2015) Nagaraja, G., Murthy, S.R., Deepak, T.: Content based video retrieval using support vector machine classification. In: 2015 IEEE International Conference on Applied and Theoretical Computing and Communication Technology, pp. 821–827 (2015)
53.
Zurück zum Zitat Nakajima, C., Pontil, M., Poggio, T.: People recognition and pose estimation in image sequences. In: IEEE International Joint Conference on Neural Networks, vol. 4, pp. 189–194 (2000) Nakajima, C., Pontil, M., Poggio, T.: People recognition and pose estimation in image sequences. In: IEEE International Joint Conference on Neural Networks, vol. 4, pp. 189–194 (2000)
54.
Zurück zum Zitat Nguyen, D., Ho, T.: An efficient method for simplifying support vector machines. In: International Conference on Machine Learning, Bonn, Germany (2005) Nguyen, D., Ho, T.: An efficient method for simplifying support vector machines. In: International Conference on Machine Learning, Bonn, Germany (2005)
55.
Zurück zum Zitat OToole, A.J., Phillips, P.J., Jiang, F., Ayyad, J., Penard, N., Abdi, H.: Face recognition algorithms surpass humans matching faces across changes in illumination. IEEE Trans. Pattern Anal. Mach. Intell. 29(9), 1642–1646 (2007) OToole, A.J., Phillips, P.J., Jiang, F., Ayyad, J., Penard, N., Abdi, H.: Face recognition algorithms surpass humans matching faces across changes in illumination. IEEE Trans. Pattern Anal. Mach. Intell. 29(9), 1642–1646 (2007)
56.
Zurück zum Zitat Pekalska, E., Haasdonk, B.: Kernel discriminant analysis for positive definite and indefinite kernels. IEEE Trans. Pattern Anal. Mach. Intell. 31(6), 1017–1032 (2009) Pekalska, E., Haasdonk, B.: Kernel discriminant analysis for positive definite and indefinite kernels. IEEE Trans. Pattern Anal. Mach. Intell. 31(6), 1017–1032 (2009)
57.
Zurück zum Zitat Ranzato, M.A., Szummer, M.: Semi-supervised learning of compact document representations with deep networks. In: Proceedings of the 25th International Conference on Machine Learning, ICML ’08, pp. 792–799 (2008) Ranzato, M.A., Szummer, M.: Semi-supervised learning of compact document representations with deep networks. In: Proceedings of the 25th International Conference on Machine Learning, ICML ’08, pp. 792–799 (2008)
58.
Zurück zum Zitat Romdhani, S., Torr, B., Scholkopf, B., Blake, A.: Computationally efficient face detection. In: IEEE International Conference on Computer Vision (2001) Romdhani, S., Torr, B., Scholkopf, B., Blake, A.: Computationally efficient face detection. In: IEEE International Conference on Computer Vision (2001)
59.
Zurück zum Zitat Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. Nature 323(6088), 533–536 (1986)CrossRef Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. Nature 323(6088), 533–536 (1986)CrossRef
60.
Zurück zum Zitat Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., Berg, A.C., Fei-Fei, L.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. (IJCV) 115(3), 211–252 (2015). doi:10.1007/s11263-015-0816-y MathSciNetCrossRef Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., Berg, A.C., Fei-Fei, L.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. (IJCV) 115(3), 211–252 (2015). doi:10.​1007/​s11263-015-0816-y MathSciNetCrossRef
61.
Zurück zum Zitat Salakhutdinov, R., Hinton, G.: An efficient learning procedure for deep boltzmann machines. Neural Comput. 24(8), 1967–2006 (2012)MathSciNetCrossRefMATH Salakhutdinov, R., Hinton, G.: An efficient learning procedure for deep boltzmann machines. Neural Comput. 24(8), 1967–2006 (2012)MathSciNetCrossRefMATH
62.
Zurück zum Zitat Santhiya, G., Singaravelan, S.: Multi-SVM for enhancing image search. Int. J. Sci. Eng. Res. 4(6) (2013) Santhiya, G., Singaravelan, S.: Multi-SVM for enhancing image search. Int. J. Sci. Eng. Res. 4(6) (2013)
63.
Zurück zum Zitat Scholkopf, B., Smola, A.: Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond. MIT Press, Cambridege (2002) Scholkopf, B., Smola, A.: Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond. MIT Press, Cambridege (2002)
64.
Zurück zum Zitat Scholkopf, B., Smola, A., Muller, K.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput. 10, 1299–1319 (1998)CrossRef Scholkopf, B., Smola, A., Muller, K.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput. 10, 1299–1319 (1998)CrossRef
65.
Zurück zum Zitat Scholkopf, B., Mika, S., Burges, C., Knirsch, P., Muller, K., Ratsch, G., Smola, A.: Input space versus feature space in kernel-based methods. IEEE Trans. Neural Netw. 10(5), 1000–1017 (1999)CrossRef Scholkopf, B., Mika, S., Burges, C., Knirsch, P., Muller, K., Ratsch, G., Smola, A.: Input space versus feature space in kernel-based methods. IEEE Trans. Neural Netw. 10(5), 1000–1017 (1999)CrossRef
66.
Zurück zum Zitat Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge (2004) Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge (2004)
67.
Zurück zum Zitat Shin, C., Kim, K., Park, M., Kim, H.: Support vector machine-based text detection in digital video. In: IEEE Workshop on Neural Networks for Signal Processing (2000) Shin, C., Kim, K., Park, M., Kim, H.: Support vector machine-based text detection in digital video. In: IEEE Workshop on Neural Networks for Signal Processing (2000)
68.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:​1409.​1556 (2014)
69.
Zurück zum Zitat Swets, D.L., Weng, J.: Using discriminant eigenfeatures for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 18(8), 831–836 (1996) Swets, D.L., Weng, J.: Using discriminant eigenfeatures for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 18(8), 831–836 (1996)
70.
Zurück zum Zitat Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015) Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
71.
Zurück zum Zitat Tao, D., Tang, X., Li, X., Wu, X.: Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval. IEEE Trans. Pattern Anal. Mach. intell. 28(7), 1088–1099 (2006)CrossRef Tao, D., Tang, X., Li, X., Wu, X.: Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval. IEEE Trans. Pattern Anal. Mach. intell. 28(7), 1088–1099 (2006)CrossRef
72.
Zurück zum Zitat Tefas, A., Kotropoulos, C., Pitas, I.: Using support vector machines to enhance the performance of elastic graph matching for frontal face authentication. IEEE Trans. Pattern Anal. Mach. Intell. 23(7), 735–746 (2001)CrossRef Tefas, A., Kotropoulos, C., Pitas, I.: Using support vector machines to enhance the performance of elastic graph matching for frontal face authentication. IEEE Trans. Pattern Anal. Mach. Intell. 23(7), 735–746 (2001)CrossRef
73.
Zurück zum Zitat Teow, L., Loe, K.: Robust vision-based features and classification schemes for off-line handwritten digit recognition. Pattern Recognit. (2002) Teow, L., Loe, K.: Robust vision-based features and classification schemes for off-line handwritten digit recognition. Pattern Recognit. (2002)
74.
Zurück zum Zitat Thung, K., Paramesran, R., Lim, C.: Content-based image quality metric using similarity measure of moment vectors. Pattern Recognit. 45(6), 2193–2204 (2012)CrossRefMATH Thung, K., Paramesran, R., Lim, C.: Content-based image quality metric using similarity measure of moment vectors. Pattern Recognit. 45(6), 2193–2204 (2012)CrossRefMATH
75.
Zurück zum Zitat Tong, S., Chang, E.: Support vector machine active learning for image retrieval. In: the Ninth ACM International Conference on Multimedia, pp. 107–118 (2001) Tong, S., Chang, E.: Support vector machine active learning for image retrieval. In: the Ninth ACM International Conference on Multimedia, pp. 107–118 (2001)
76.
Zurück zum Zitat Turk, M., Pentland, A.: Eigenfaces for recognition. J. Cogn. Neurosci. 13(1), 71–86 (1991)CrossRef Turk, M., Pentland, A.: Eigenfaces for recognition. J. Cogn. Neurosci. 13(1), 71–86 (1991)CrossRef
77.
Zurück zum Zitat Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York, NY (1995) Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York, NY (1995)
78.
Zurück zum Zitat Vapnik, Y.N.: The Nature of Statistical Learning Theory, second edn. Springer, New York (2000) Vapnik, Y.N.: The Nature of Statistical Learning Theory, second edn. Springer, New York (2000)
79.
Zurück zum Zitat Varma, M., Babu, B.: More generality in efficient multiple kernel learning. In: Proceedings of the International Conference on Machine Learning, Montreal, Canada (2009) Varma, M., Babu, B.: More generality in efficient multiple kernel learning. In: Proceedings of the International Conference on Machine Learning, Montreal, Canada (2009)
80.
Zurück zum Zitat Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: Proceedings of the International Conference on Computer Vision, Kyoto, Japan (2009) Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: Proceedings of the International Conference on Computer Vision, Kyoto, Japan (2009)
81.
Zurück zum Zitat Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y., Manzagol, P.A.: Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J. Mach. Learn. Res. 11, 3371–3408 (2010)MathSciNetMATH Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y., Manzagol, P.A.: Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J. Mach. Learn. Res. 11, 3371–3408 (2010)MathSciNetMATH
82.
Zurück zum Zitat Wang, Z., Chen, S., Sun, T.: Multik-MHKS: a novel multiple kernel learning algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 30(2), 348–353 (2008) Wang, Z., Chen, S., Sun, T.: Multik-MHKS: a novel multiple kernel learning algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 30(2), 348–353 (2008)
84.
Zurück zum Zitat Xie, C., Kumar, V.: Comparison of kernel class-dependence feature analysis (KCFA) with kernel discriminant analysis (KDA) for face recognition. In: Proceedings of IEEE on Biometrics: Theory, Application and Systems (2007) Xie, C., Kumar, V.: Comparison of kernel class-dependence feature analysis (KCFA) with kernel discriminant analysis (KDA) for face recognition. In: Proceedings of IEEE on Biometrics: Theory, Application and Systems (2007)
85.
Zurück zum Zitat Yang, M.H., Ahuja, N., Kriegman, D.: Face recognition using kernel Eigenfaces. In: Proc. IEEE International Conference on Image Processing, Vancouver, Canada (2000) Yang, M.H., Ahuja, N., Kriegman, D.: Face recognition using kernel Eigenfaces. In: Proc. IEEE International Conference on Image Processing, Vancouver, Canada (2000)
86.
Zurück zum Zitat Yang, J., Yan, R., Hauptmann, A.G.: Cross-domain video concept detection using adaptive svms. In: 15th ACM International Conference on Multimedia, pp. 188–197 (2007) Yang, J., Yan, R., Hauptmann, A.G.: Cross-domain video concept detection using adaptive svms. In: 15th ACM International Conference on Multimedia, pp. 188–197 (2007)
87.
Zurück zum Zitat Yazdi, H.S., Javidi, M., Pourreza, H.R.: SVM-based relevance feedback for semantic video retrieval. Int. J. Signal Imaging Syst. Eng. 2(3), 99–108 (2009)CrossRef Yazdi, H.S., Javidi, M., Pourreza, H.R.: SVM-based relevance feedback for semantic video retrieval. Int. J. Signal Imaging Syst. Eng. 2(3), 99–108 (2009)CrossRef
88.
Zurück zum Zitat Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: European Conference on Computer Vision, pp. 818–833. Springer, New York (2014) Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: European Conference on Computer Vision, pp. 818–833. Springer, New York (2014)
89.
Zurück zum Zitat Zhang, L., Lin, F., Zhang, B.: Support vector machine learning for image retrieval. In: 2001 International Conference on Image Processing, vol. 2, pp. 721–724 (2001) Zhang, L., Lin, F., Zhang, B.: Support vector machine learning for image retrieval. In: 2001 International Conference on Image Processing, vol. 2, pp. 721–724 (2001)
Metadaten
Titel
Learning and Recognition Methods for Image Search and Video Retrieval
verfasst von
Ajit Puthenputhussery
Shuo Chen
Joyoung Lee
Lazar Spasovic
Chengjun Liu
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-52081-0_2