nach oben

World Wide Web

Erschienen in:

24.05.2018

A novel image retrieval algorithm based on transfer learning and fusion features

verfasst von: Ying Liu, Yanan Peng, Kengpang Lim, Nam Ling

Erschienen in: World Wide Web | Ausgabe 3/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

With proliferation of social media, image has become ubiquitous giving rise to the demand and importance of image semantic analysis and retrieval to access information quickly on social media. However, even with humongous information available, there are certain categories of images which are important for certain applications but are very scarce. Convolutional neural network is an effective method to extract high-level semantic features for image database retrieval. To overcome the problem of over-fitting when the number of training samples in dataset is limited, this paper proposes an image database retrieval algorithm based on the framework of transfer learning and feature fusion. Based on the fine-tuning of the pre-trained Convolutional Neural Network (CNN), the proposed algorithm first extracts the semantic features of the images. Principal Component Analysis (PCA) is then applied for dimension reduction and to reduce the computational complexity. Last, the semantic feature extracted from the CNN is fused with traditional low-level visual feature to improve the retrieval accuracy further. Experimental results demonstrated the effectiveness of the proposed method for image database retrieval.

Vorheriger Artikel Discovering negative comments by sentiment analysis on web forum

Nächster Artikel Multimodal deep representation learning for video classification

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Babenko, A., Slesarev, A., Chigorin, A., et al.: Neural codes for image retrieval [C]. European Conference on Computer Vision. 584–599 (2014)

Bakar, S.A., Hitam, M.S.: Content-based image retrieval using SIFT for binary and greyscale images[C]. 2013 IEEE Int. Conf. Signal Image Process. Appl. 83–88, (2013)

Bengio, Y.: Deep learning of representations for unsupervised and transfer learning[C]. Proceeding of ICML Workshop on Unsupervised and Transfer Learning. (2012)

Cao, L., Wang, F.: Robust latent semantic exploration for image retrieval in social media[J]. Neurocomputing. 169, 2 December, 180–184 (2015)CrossRef

Carvajal, J.A., Romero, D.G.: Fine-tuning based deep convolutional networks for lepidopterous genus recognition [J]. Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications(CIARP), pp. 467–475, (2016)

Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: delving deep into convolutional nets[C]. In: British Machine Vision Conference. (2014)

Chitrakar, P., Zhang, C., Warner, G., Liao, X.: Social media image retrieval using distilled convolutional neural network for suspicious e-crime and terrorist account detection[C]. 2016 IEEE International Symposium on Multimedia (ISM), pp. In: 493–498 (2016)

Ciresan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification[C]. IEEE Conf. Comput. Vis. Pattern Recognit. 3642–3649 (2012)

Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., Li, F. F.: ImageNet: a large-scale hierarchical image database[C]. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, 248–255, (2009)

10.

Donahue, J., Jia, Y., Vinyals, O., et al.: DeCAF: a deep convolutional activation feature for generic visual recognition[J]. Proceeding of the 31st International Conference on Maching Learning, PMLR. 32(1), 647–655 (2014)

11.

Hao, J., Dong, J.: What is the best practice for CNNs applied to visual instance retrieval?[C]. International Conference on Learning Representations (ICLR). (2016)

12.

He, K.M., Zhang, X.Y., Ren, S.Q., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Trans. Pattern Anal. Mach. Intell. 37, 1904–1916 (2015)CrossRef

13.

Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks[J]. Science. 313(5786), 504–507 (2006)MathSciNetCrossRefMATH

14.

Kalantidis, Y., Mellina, C., Osindero, S.: Cross-dimensional weighting for aggregated deep convolutional features[J]. European Conference on Computer Vision (ECCV). (2016)

15.

Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks[C]. Proceeding of the 25th International Conference on Neural Information Processing Systems. 1097–1105 (2012)

16.

Kusamura, Y., Kozawa, Y.: GPU acceleration of content-based image retrieval based on SIFT descriptors[J]. International Conference on Network-Based Information Systems. (2016)

17.

LeCun, Y., Jackel, L., Bottou, L., Cortes, C., Denker, J.S., Drucker, H., et al.: Learning algorithms for classification: a comparison on handwritten digit recognition. Neural networks: The statistical mechanics perspective[M]. 261–276 (1995)

18.

Liu, G.-H., Yang, J.-Y., et al.: Content-based image retrieval using computational visual attention model[J]. Pattern Recogn. 48, 2554–2566 (2015)MathSciNetCrossRef

19.

Liu, Y., Hu, D., Fan, J., Wang, F., Zhang, D.: Multi-feature fusion for crime scene investigation image database retrieval. [C]. International Conference on Digital Image Computing: Techniques and Applications (DICTA), Sydney, Australia, Nov.30-Dec. 2, (2017)

20.

Lu, X., Duan, X., Mao, X., Li, Y., Zhang, X.: Feature extraction and fusion using deep convolutional neural networks for face detection[J]. Math. Probl. Eng. 1376726 (2017, 2017)

21.

Oliva, A., Torralba, A.: Modeling the shape of the scence: a holistic representation of the spatial envelope[J]. Int. J. Comput. Vis. 42, 145–175 (2001)CrossRefMATH

22.

Pan, S.J., Yang, Q.: A survey on transfer learning[J]. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)CrossRef

23.

Parida, P., Bhoi, N.: 2-D Gabor filter based transition region extraction and morphological operation for image segmentation[J]. Comput. Electr. Eng. 62, 119–134 (2017)CrossRef

24.

Shankar, T., Yamuna, G., Suman, G.: Segmentation of natural colour image based on colour-texture features[C]. 2013 International Conference on Communication and Signal Processing, pp. 455–459, (2013)

25.

Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions[C]. Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 1–9 (2015)

26.

Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human-level performance in face verification[C]. Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 1701–1708 (2014)

27.

Tolias, G.: Particular object retrieval with integral max-pooling of CNN activations [C]. International Conference on Learning Representations (ICLR). (2016)

28.

Tolias, G., Sicre, R., Jégou, H.: Particular object retrieval with integral max-pooling of CNN activations[C]. In: Proceeding of the 4th International Conference on Learning Representations (ICLR) (2016)

29.

Vedaldi, A., Lenc, K: Matconvnet: Convolutional neural networks for matlab[C]. Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, pp: 689–692, (2015)

30.

Zhang, J., Yoo, C.-W., Ha, S.-W.: ROI based natural image retrieval using color and texture feature[C]. Fourth International Conference on Fuzzy Systems and Knowledge Discovery. 4, 740–744 (2007)

31.

Zheng, L., Wang, S., Liu, Z., Qi, T.: Packing and padding: coupled multi-index for accurate image retrieval [C]. IEEE Conf. Comput. Vis. Pattern Recognit. (2014)

32.

Zheng, L., Wang, S., Zhou, W., and Qi, T.: Bayes merging of multiple vocabularies for scalable image retrieval [C]. IEEE Conf. Comput. Vis. Pattern Recognit. (2014)

33.

Zheng, L., Wang, S., Tian, Q.: "Coupled binary embedding for large-scale image retrieval"[J]. IEEE Trans. Image Process. 23(8), 3368–3380 (June, 2014)MathSciNetCrossRefMATH

Titel: A novel image retrieval algorithm based on transfer learning and fusion features
verfasst von: Ying Liu
Yanan Peng
Kengpang Lim
Nam Ling
Publikationsdatum: 24.05.2018
Verlag: Springer US
Erschienen in: World Wide Web / Ausgabe 3/2019
Print ISSN: 1386-145X
Elektronische ISSN: 1573-1413
DOI: https://doi.org/10.1007/s11280-018-0585-y

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 3/2019

Providing personalized learning guidance in MOOCs by multi-source data analysis

Embedding and predicting the event at early stage

On successive point-of-interest recommendation

Self-feeding frequency estimation and eating action recognition from skeletal representation using Kinect

Guest editorial: special issue on social media and interactive technologies

A novel algorithm for mining opinion leaders in social networks