Skip to main content
Erschienen in: World Wide Web 3/2019

24.05.2018

A novel image retrieval algorithm based on transfer learning and fusion features

verfasst von: Ying Liu, Yanan Peng, Kengpang Lim, Nam Ling

Erschienen in: World Wide Web | Ausgabe 3/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

With proliferation of social media, image has become ubiquitous giving rise to the demand and importance of image semantic analysis and retrieval to access information quickly on social media. However, even with humongous information available, there are certain categories of images which are important for certain applications but are very scarce. Convolutional neural network is an effective method to extract high-level semantic features for image database retrieval. To overcome the problem of over-fitting when the number of training samples in dataset is limited, this paper proposes an image database retrieval algorithm based on the framework of transfer learning and feature fusion. Based on the fine-tuning of the pre-trained Convolutional Neural Network (CNN), the proposed algorithm first extracts the semantic features of the images. Principal Component Analysis (PCA) is then applied for dimension reduction and to reduce the computational complexity. Last, the semantic feature extracted from the CNN is fused with traditional low-level visual feature to improve the retrieval accuracy further. Experimental results demonstrated the effectiveness of the proposed method for image database retrieval.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Babenko, A., Slesarev, A., Chigorin, A., et al.: Neural codes for image retrieval [C]. European Conference on Computer Vision. 584–599 (2014) Babenko, A., Slesarev, A., Chigorin, A., et al.: Neural codes for image retrieval [C]. European Conference on Computer Vision. 584–599 (2014)
2.
Zurück zum Zitat Bakar, S.A., Hitam, M.S.: Content-based image retrieval using SIFT for binary and greyscale images[C]. 2013 IEEE Int. Conf. Signal Image Process. Appl. 83–88, (2013) Bakar, S.A., Hitam, M.S.: Content-based image retrieval using SIFT for binary and greyscale images[C]. 2013 IEEE Int. Conf. Signal Image Process. Appl. 83–88, (2013)
3.
Zurück zum Zitat Bengio, Y.: Deep learning of representations for unsupervised and transfer learning[C]. Proceeding of ICML Workshop on Unsupervised and Transfer Learning. (2012) Bengio, Y.: Deep learning of representations for unsupervised and transfer learning[C]. Proceeding of ICML Workshop on Unsupervised and Transfer Learning. (2012)
4.
Zurück zum Zitat Cao, L., Wang, F.: Robust latent semantic exploration for image retrieval in social media[J]. Neurocomputing. 169, 2 December, 180–184 (2015)CrossRef Cao, L., Wang, F.: Robust latent semantic exploration for image retrieval in social media[J]. Neurocomputing. 169, 2 December, 180–184 (2015)CrossRef
5.
Zurück zum Zitat Carvajal, J.A., Romero, D.G.: Fine-tuning based deep convolutional networks for lepidopterous genus recognition [J]. Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications(CIARP), pp. 467–475, (2016) Carvajal, J.A., Romero, D.G.: Fine-tuning based deep convolutional networks for lepidopterous genus recognition [J]. Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications(CIARP), pp. 467–475, (2016)
6.
Zurück zum Zitat Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: delving deep into convolutional nets[C]. In: British Machine Vision Conference. (2014) Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: delving deep into convolutional nets[C]. In: British Machine Vision Conference. (2014)
7.
Zurück zum Zitat Chitrakar, P., Zhang, C., Warner, G., Liao, X.: Social media image retrieval using distilled convolutional neural network for suspicious e-crime and terrorist account detection[C]. 2016 IEEE International Symposium on Multimedia (ISM), pp. In: 493–498 (2016) Chitrakar, P., Zhang, C., Warner, G., Liao, X.: Social media image retrieval using distilled convolutional neural network for suspicious e-crime and terrorist account detection[C]. 2016 IEEE International Symposium on Multimedia (ISM), pp. In: 493–498 (2016)
8.
Zurück zum Zitat Ciresan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification[C]. IEEE Conf. Comput. Vis. Pattern Recognit. 3642–3649 (2012) Ciresan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification[C]. IEEE Conf. Comput. Vis. Pattern Recognit. 3642–3649 (2012)
9.
Zurück zum Zitat Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., Li, F. F.: ImageNet: a large-scale hierarchical image database[C]. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, 248–255, (2009) Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., Li, F. F.: ImageNet: a large-scale hierarchical image database[C]. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, 248–255, (2009)
10.
Zurück zum Zitat Donahue, J., Jia, Y., Vinyals, O., et al.: DeCAF: a deep convolutional activation feature for generic visual recognition[J]. Proceeding of the 31st International Conference on Maching Learning, PMLR. 32(1), 647–655 (2014) Donahue, J., Jia, Y., Vinyals, O., et al.: DeCAF: a deep convolutional activation feature for generic visual recognition[J]. Proceeding of the 31st International Conference on Maching Learning, PMLR. 32(1), 647–655 (2014)
11.
Zurück zum Zitat Hao, J., Dong, J.: What is the best practice for CNNs applied to visual instance retrieval?[C]. International Conference on Learning Representations (ICLR). (2016) Hao, J., Dong, J.: What is the best practice for CNNs applied to visual instance retrieval?[C]. International Conference on Learning Representations (ICLR). (2016)
12.
Zurück zum Zitat He, K.M., Zhang, X.Y., Ren, S.Q., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Trans. Pattern Anal. Mach. Intell. 37, 1904–1916 (2015)CrossRef He, K.M., Zhang, X.Y., Ren, S.Q., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Trans. Pattern Anal. Mach. Intell. 37, 1904–1916 (2015)CrossRef
13.
Zurück zum Zitat Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks[J]. Science. 313(5786), 504–507 (2006)MathSciNetCrossRefMATH Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks[J]. Science. 313(5786), 504–507 (2006)MathSciNetCrossRefMATH
14.
Zurück zum Zitat Kalantidis, Y., Mellina, C., Osindero, S.: Cross-dimensional weighting for aggregated deep convolutional features[J]. European Conference on Computer Vision (ECCV). (2016) Kalantidis, Y., Mellina, C., Osindero, S.: Cross-dimensional weighting for aggregated deep convolutional features[J]. European Conference on Computer Vision (ECCV). (2016)
15.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks[C]. Proceeding of the 25th International Conference on Neural Information Processing Systems. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks[C]. Proceeding of the 25th International Conference on Neural Information Processing Systems. 1097–1105 (2012)
16.
Zurück zum Zitat Kusamura, Y., Kozawa, Y.: GPU acceleration of content-based image retrieval based on SIFT descriptors[J]. International Conference on Network-Based Information Systems. (2016) Kusamura, Y., Kozawa, Y.: GPU acceleration of content-based image retrieval based on SIFT descriptors[J]. International Conference on Network-Based Information Systems. (2016)
17.
Zurück zum Zitat LeCun, Y., Jackel, L., Bottou, L., Cortes, C., Denker, J.S., Drucker, H., et al.: Learning algorithms for classification: a comparison on handwritten digit recognition. Neural networks: The statistical mechanics perspective[M]. 261–276 (1995) LeCun, Y., Jackel, L., Bottou, L., Cortes, C., Denker, J.S., Drucker, H., et al.: Learning algorithms for classification: a comparison on handwritten digit recognition. Neural networks: The statistical mechanics perspective[M]. 261–276 (1995)
18.
Zurück zum Zitat Liu, G.-H., Yang, J.-Y., et al.: Content-based image retrieval using computational visual attention model[J]. Pattern Recogn. 48, 2554–2566 (2015)MathSciNetCrossRef Liu, G.-H., Yang, J.-Y., et al.: Content-based image retrieval using computational visual attention model[J]. Pattern Recogn. 48, 2554–2566 (2015)MathSciNetCrossRef
19.
Zurück zum Zitat Liu, Y., Hu, D., Fan, J., Wang, F., Zhang, D.: Multi-feature fusion for crime scene investigation image database retrieval. [C]. International Conference on Digital Image Computing: Techniques and Applications (DICTA), Sydney, Australia, Nov.30-Dec. 2, (2017) Liu, Y., Hu, D., Fan, J., Wang, F., Zhang, D.: Multi-feature fusion for crime scene investigation image database retrieval. [C]. International Conference on Digital Image Computing: Techniques and Applications (DICTA), Sydney, Australia, Nov.30-Dec. 2, (2017)
20.
Zurück zum Zitat Lu, X., Duan, X., Mao, X., Li, Y., Zhang, X.: Feature extraction and fusion using deep convolutional neural networks for face detection[J]. Math. Probl. Eng. 1376726 (2017, 2017) Lu, X., Duan, X., Mao, X., Li, Y., Zhang, X.: Feature extraction and fusion using deep convolutional neural networks for face detection[J]. Math. Probl. Eng. 1376726 (2017, 2017)
21.
Zurück zum Zitat Oliva, A., Torralba, A.: Modeling the shape of the scence: a holistic representation of the spatial envelope[J]. Int. J. Comput. Vis. 42, 145–175 (2001)CrossRefMATH Oliva, A., Torralba, A.: Modeling the shape of the scence: a holistic representation of the spatial envelope[J]. Int. J. Comput. Vis. 42, 145–175 (2001)CrossRefMATH
22.
Zurück zum Zitat Pan, S.J., Yang, Q.: A survey on transfer learning[J]. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)CrossRef Pan, S.J., Yang, Q.: A survey on transfer learning[J]. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)CrossRef
23.
Zurück zum Zitat Parida, P., Bhoi, N.: 2-D Gabor filter based transition region extraction and morphological operation for image segmentation[J]. Comput. Electr. Eng. 62, 119–134 (2017)CrossRef Parida, P., Bhoi, N.: 2-D Gabor filter based transition region extraction and morphological operation for image segmentation[J]. Comput. Electr. Eng. 62, 119–134 (2017)CrossRef
24.
Zurück zum Zitat Shankar, T., Yamuna, G., Suman, G.: Segmentation of natural colour image based on colour-texture features[C]. 2013 International Conference on Communication and Signal Processing, pp. 455–459, (2013) Shankar, T., Yamuna, G., Suman, G.: Segmentation of natural colour image based on colour-texture features[C]. 2013 International Conference on Communication and Signal Processing, pp. 455–459, (2013)
25.
Zurück zum Zitat Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions[C]. Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 1–9 (2015) Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions[C]. Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 1–9 (2015)
26.
Zurück zum Zitat Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human-level performance in face verification[C]. Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 1701–1708 (2014) Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human-level performance in face verification[C]. Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 1701–1708 (2014)
27.
Zurück zum Zitat Tolias, G.: Particular object retrieval with integral max-pooling of CNN activations [C]. International Conference on Learning Representations (ICLR). (2016) Tolias, G.: Particular object retrieval with integral max-pooling of CNN activations [C]. International Conference on Learning Representations (ICLR). (2016)
28.
Zurück zum Zitat Tolias, G., Sicre, R., Jégou, H.: Particular object retrieval with integral max-pooling of CNN activations[C]. In: Proceeding of the 4th International Conference on Learning Representations (ICLR) (2016) Tolias, G., Sicre, R., Jégou, H.: Particular object retrieval with integral max-pooling of CNN activations[C]. In: Proceeding of the 4th International Conference on Learning Representations (ICLR) (2016)
29.
Zurück zum Zitat Vedaldi, A., Lenc, K: Matconvnet: Convolutional neural networks for matlab[C]. Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, pp: 689–692, (2015) Vedaldi, A., Lenc, K: Matconvnet: Convolutional neural networks for matlab[C]. Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, pp: 689–692, (2015)
30.
Zurück zum Zitat Zhang, J., Yoo, C.-W., Ha, S.-W.: ROI based natural image retrieval using color and texture feature[C]. Fourth International Conference on Fuzzy Systems and Knowledge Discovery. 4, 740–744 (2007) Zhang, J., Yoo, C.-W., Ha, S.-W.: ROI based natural image retrieval using color and texture feature[C]. Fourth International Conference on Fuzzy Systems and Knowledge Discovery. 4, 740–744 (2007)
31.
Zurück zum Zitat Zheng, L., Wang, S., Liu, Z., Qi, T.: Packing and padding: coupled multi-index for accurate image retrieval [C]. IEEE Conf. Comput. Vis. Pattern Recognit. (2014) Zheng, L., Wang, S., Liu, Z., Qi, T.: Packing and padding: coupled multi-index for accurate image retrieval [C]. IEEE Conf. Comput. Vis. Pattern Recognit. (2014)
32.
Zurück zum Zitat Zheng, L., Wang, S., Zhou, W., and Qi, T.: Bayes merging of multiple vocabularies for scalable image retrieval [C]. IEEE Conf. Comput. Vis. Pattern Recognit. (2014) Zheng, L., Wang, S., Zhou, W., and Qi, T.: Bayes merging of multiple vocabularies for scalable image retrieval [C]. IEEE Conf. Comput. Vis. Pattern Recognit. (2014)
33.
Zurück zum Zitat Zheng, L., Wang, S., Tian, Q.: "Coupled binary embedding for large-scale image retrieval"[J]. IEEE Trans. Image Process. 23(8), 3368–3380 (June, 2014)MathSciNetCrossRefMATH Zheng, L., Wang, S., Tian, Q.: "Coupled binary embedding for large-scale image retrieval"[J]. IEEE Trans. Image Process. 23(8), 3368–3380 (June, 2014)MathSciNetCrossRefMATH
Metadaten
Titel
A novel image retrieval algorithm based on transfer learning and fusion features
verfasst von
Ying Liu
Yanan Peng
Kengpang Lim
Nam Ling
Publikationsdatum
24.05.2018
Verlag
Springer US
Erschienen in
World Wide Web / Ausgabe 3/2019
Print ISSN: 1386-145X
Elektronische ISSN: 1573-1413
DOI
https://doi.org/10.1007/s11280-018-0585-y

Weitere Artikel der Ausgabe 3/2019

World Wide Web 3/2019 Zur Ausgabe