Skip to main content
Erschienen in: Neural Computing and Applications 8/2019

19.01.2018 | Original Article

Tourism scene classification based on multi-stage transfer learning model

verfasst von: Tangquan Qi, Yong Xu, Haibin Ling

Erschienen in: Neural Computing and Applications | Ausgabe 8/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In the past, many researchers focus on scene classification in computer vision, because it is an important problem. Tourism scene classification, however, has not been paid attention to in the field of computer vision. In this paper, we introduce a new scenic-spots-centric database called tourism scene, which consists of 25 tourism scenic areas with 750 tourism scene categories, about 440 thousand labeled images. For tourism scene classification, we propose a multi-stage transfer learning model with category hierarchical structure and use convolutional neural networks (e.g., AlexNet) as basic building block. To demonstrate the effectiveness of our proposed model, we also propose a baseline model and one-stage transfer learning model. From the results, we observe that our proposed framework achieves new bounds for performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Xiao, J, Hays, J, Ehinger, KA, Oliva, A, Torralba, A (2010) Sun database: large-scale scene recognition from abbey to zoo. In: IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 3485–3492 Xiao, J, Hays, J, Ehinger, KA, Oliva, A, Torralba, A (2010) Sun database: large-scale scene recognition from abbey to zoo. In: IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 3485–3492
2.
Zurück zum Zitat Patterson, G, Hays, J (2012) Sun attribute database: discovering, annotating, and recognizing scene attributes. In: IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 2751–2758 Patterson, G, Hays, J (2012) Sun attribute database: discovering, annotating, and recognizing scene attributes. In: IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 2751–2758
3.
Zurück zum Zitat Quattoni, A, Torralba, A (2009) Recognizing indoor scenes. In: IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 413–420 Quattoni, A, Torralba, A (2009) Recognizing indoor scenes. In: IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 413–420
4.
Zurück zum Zitat Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: IEEE Computer society conference on computer vision and pattern recognition (CVPR), IEEE, pp 324–241 Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: IEEE Computer society conference on computer vision and pattern recognition (CVPR), IEEE, pp 324–241
5.
Zurück zum Zitat Zhou, B, Lapedrizaand, A, Xiao, J, Torralba, A, Oliva, A (2014) Learning deep features for scene recognition using places database. In: Advances in neural information processing systems (NIPS), pp 487–495 Zhou, B, Lapedrizaand, A, Xiao, J, Torralba, A, Oliva, A (2014) Learning deep features for scene recognition using places database. In: Advances in neural information processing systems (NIPS), pp 487–495
7.
Zurück zum Zitat Serre Thomas, Kreiman Gabriel, Kouh Minjoon, Cadieu Charles, Knoblich Ulf, Poggio Tomaso (2007) A quantitative theory of immediate visual recognition. Prog Brain Res 165:33–56CrossRef Serre Thomas, Kreiman Gabriel, Kouh Minjoon, Cadieu Charles, Knoblich Ulf, Poggio Tomaso (2007) A quantitative theory of immediate visual recognition. Prog Brain Res 165:33–56CrossRef
8.
Zurück zum Zitat LeCun Yann, Boser Bernhard, Denker John S, Henderson Donnie, Howard Richard E, Hubbard Wayne, Jackel Lawrence D (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551CrossRef LeCun Yann, Boser Bernhard, Denker John S, Henderson Donnie, Howard Richard E, Hubbard Wayne, Jackel Lawrence D (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551CrossRef
9.
Zurück zum Zitat Krizhevsky, A, Sutskever, I, Hinton, GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems (NIPS), pp 1097–1105 Krizhevsky, A, Sutskever, I, Hinton, GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems (NIPS), pp 1097–1105
10.
11.
Zurück zum Zitat Zerler, MD, Fergus, R (2013) Stochastic pooling for regularization of deep convolutional neural networks. arXiv preprint arXiv:1301.3557 Zerler, MD, Fergus, R (2013) Stochastic pooling for regularization of deep convolutional neural networks. arXiv preprint arXiv:​1301.​3557
12.
Zurück zum Zitat Girshick, R, Donahue, J, Darrell, T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 580–587 Girshick, R, Donahue, J, Darrell, T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 580–587
13.
Zurück zum Zitat Farabet C, Couprie C, Najman L, LeCun Y (2013) Learning hierarchical features for scene labeling. Pattern Anal Mach Intell (PAMI) 35(8):1915–1929CrossRef Farabet C, Couprie C, Najman L, LeCun Y (2013) Learning hierarchical features for scene labeling. Pattern Anal Mach Intell (PAMI) 35(8):1915–1929CrossRef
14.
Zurück zum Zitat Tousch AM, Herbin S, Audibert JY (2012) Semantic hierarchies for image annotation: a survey. Pattern Recognit (PR) 45(1):333–345CrossRef Tousch AM, Herbin S, Audibert JY (2012) Semantic hierarchies for image annotation: a survey. Pattern Recognit (PR) 45(1):333–345CrossRef
15.
Zurück zum Zitat Bengio, S, Jason, W, David, G (2010) Label embedding trees for large multi-class tasks. In: Advances in neural information processing systems (NIPS), pp 163–171 Bengio, S, Jason, W, David, G (2010) Label embedding trees for large multi-class tasks. In: Advances in neural information processing systems (NIPS), pp 163–171
16.
Zurück zum Zitat Gao, T, Daphne, K (2011) Discriminative learning of relaxed hierarchy for large-scale visual recognition. In: International conference on computer vision (ICCV), IEEE, pp 2072–2079 Gao, T, Daphne, K (2011) Discriminative learning of relaxed hierarchy for large-scale visual recognition. In: International conference on computer vision (ICCV), IEEE, pp 2072–2079
17.
Zurück zum Zitat Jia, Y, Joshua, TA, Joseph, A, Thomas, G, Trevor, D (2013) Visual concept learning: Combining machine vision and bayesian generalization on concept hierarchies. In: Advances in neural information processing systems (NIPS), pp 1842–1850 Jia, Y, Joshua, TA, Joseph, A, Thomas, G, Trevor, D (2013) Visual concept learning: Combining machine vision and bayesian generalization on concept hierarchies. In: Advances in neural information processing systems (NIPS), pp 1842–1850
18.
Zurück zum Zitat Marszalek, M, Schmid, C (2007) Semantic hierarchies for visual object recognition. In Computer vision and pattern recognition (CVPR), IEEE, pp 1–7 Marszalek, M, Schmid, C (2007) Semantic hierarchies for visual object recognition. In Computer vision and pattern recognition (CVPR), IEEE, pp 1–7
19.
Zurück zum Zitat Nakul, V, Mahajan, D, Sellamanickam, S, Nair, V (2012) Learning hierarchical similarity metrics. In Computer vision and pattern recognition (CVPR), IEEE, pp 2280–2287 Nakul, V, Mahajan, D, Sellamanickam, S, Nair, V (2012) Learning hierarchical similarity metrics. In Computer vision and pattern recognition (CVPR), IEEE, pp 2280–2287
20.
Zurück zum Zitat Bannour, H, Céline, H (2012) Hierarchical image annotation using semantic hierarchies. In: lProceedings of the 21st ACM international conference on Information and knowledge management (IKM), ACM, pp 2431–2434 Bannour, H, Céline, H (2012) Hierarchical image annotation using semantic hierarchies. In: lProceedings of the 21st ACM international conference on Information and knowledge management (IKM), ACM, pp 2431–2434
21.
Zurück zum Zitat Li, L-J, Wang, C, Lim, Y, Blei , DM, Fei-Fei L (2010) Building and using a semantivisual image hierarchy. In Computer vision and pattern recognition (CVPR), IEEE, pp 3336–3343 Li, L-J, Wang, C, Lim, Y, Blei , DM, Fei-Fei L (2010) Building and using a semantivisual image hierarchy. In Computer vision and pattern recognition (CVPR), IEEE, pp 3336–3343
22.
Zurück zum Zitat Salakhutdinov, R, Torralba, A, Tenenbaum, J (2011) Learning to share visual appearance for multiclass object detection. In Computer vision and pattern recognition (CVPR), IEEE, pp 1481–1488 Salakhutdinov, R, Torralba, A, Tenenbaum, J (2011) Learning to share visual appearance for multiclass object detection. In Computer vision and pattern recognition (CVPR), IEEE, pp 1481–1488
23.
Zurück zum Zitat Deng, J, Jonathan, K, Alexander, CB, Fei-Fei L (2012) Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition. In: Computer vision and pattern recognition (CVPR), IEEE, pp 3450–3457 Deng, J, Jonathan, K, Alexander, CB, Fei-Fei L (2012) Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition. In: Computer vision and pattern recognition (CVPR), IEEE, pp 3450–3457
24.
Zurück zum Zitat Liu, B, Sadeghi, F, Tappen, M, Shamir, O, Liu, C (2013) Probabilistic label trees for efficient large scale image classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 843–850 Liu, B, Sadeghi, F, Tappen, M, Shamir, O, Liu, C (2013) Probabilistic label trees for efficient large scale image classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 843–850
25.
Zurück zum Zitat Srivastava, N, Salakhutdinov, R (2013) Discriminative transfer learning with tree-based priors. In Advances in neural information processing systems (NIPS), pp 2094–2102 Srivastava, N, Salakhutdinov, R (2013) Discriminative transfer learning with tree-based priors. In Advances in neural information processing systems (NIPS), pp 2094–2102
26.
Zurück zum Zitat Yan, Z, Zhang, H, Piramuthu, R, Jagadeesh, V, DeCoste, D, Di, W, Yu Y (2015) HD-CNN: hierarchical deep convolutional neural networks for large scale visual recognition. In: Proceedings of the IEEE international conference on computer vision (ICCV), pp 2740–2748 Yan, Z, Zhang, H, Piramuthu, R, Jagadeesh, V, DeCoste, D, Di, W, Yu Y (2015) HD-CNN: hierarchical deep convolutional neural networks for large scale visual recognition. In: Proceedings of the IEEE international conference on computer vision (ICCV), pp 2740–2748
27.
Zurück zum Zitat Ouyang, W, Wang, X, Zhang, C, Yang, X (2016) Factors in finetuning deep model for object detection with long-tail distribution. In: IEEE computer society conference on computer vision and pattern recognition (CVPR) Ouyang, W, Wang, X, Zhang, C, Yang, X (2016) Factors in finetuning deep model for object detection with long-tail distribution. In: IEEE computer society conference on computer vision and pattern recognition (CVPR)
28.
Zurück zum Zitat Xiao, T, Zhang, J, Yang, K, Peng, Y, Zhang, Z (2014) Error-driven incremental learning in deep convolutional neural network for large-scale image classification. In: Proceedings of the ACM international conference on multimedia (ICM), ACM, pp 177–186 Xiao, T, Zhang, J, Yang, K, Peng, Y, Zhang, Z (2014) Error-driven incremental learning in deep convolutional neural network for large-scale image classification. In: Proceedings of the ACM international conference on multimedia (ICM), ACM, pp 177–186
29.
Zurück zum Zitat Kim, S, Xing, EP (2010) Tree-guided group lasso for multi-task regression with structured sparsity. In: International conference on machine learning (ICML), pp 543–550 Kim, S, Xing, EP (2010) Tree-guided group lasso for multi-task regression with structured sparsity. In: International conference on machine learning (ICML), pp 543–550
30.
Zurück zum Zitat Kang, Z, Grauman, K, Sha F (2011) Learning with whom to share in multi-task feature learning. In: International conference on machine learning (ICML), pp 521–528 Kang, Z, Grauman, K, Sha F (2011) Learning with whom to share in multi-task feature learning. In: International conference on machine learning (ICML), pp 521–528
31.
Zurück zum Zitat Zweig, A, Weinshall D (2013) Hierarchical regularization cascade for joint learning. In: International conference on machine learning (ICML), pp 37–45 Zweig, A, Weinshall D (2013) Hierarchical regularization cascade for joint learning. In: International conference on machine learning (ICML), pp 37–45
33.
Zurück zum Zitat LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef
34.
Zurück zum Zitat Simonyan, K, Zisserman, A (2015) Very deep convolutional networks for large-scale image recognition. In: International conference on learning representations (ICLR) Simonyan, K, Zisserman, A (2015) Very deep convolutional networks for large-scale image recognition. In: International conference on learning representations (ICLR)
35.
Zurück zum Zitat Jia, Y, Shelhamer, E, Donahue, J, Karayev, S, Long, J, Girshick, R, Guadarrama, S, Darrell T (2014) Caffe: convolutional architecture for fast feature embedding. arXiv:1408.5093 Jia, Y, Shelhamer, E, Donahue, J, Karayev, S, Long, J, Girshick, R, Guadarrama, S, Darrell T (2014) Caffe: convolutional architecture for fast feature embedding. arXiv:​1408.​5093
36.
Zurück zum Zitat Krizhevsky, A, Hinton, G (2009) Learning multiple layers of features from tiny images. Master’s thesis, Citeseer Krizhevsky, A, Hinton, G (2009) Learning multiple layers of features from tiny images. Master’s thesis, Citeseer
Metadaten
Titel
Tourism scene classification based on multi-stage transfer learning model
verfasst von
Tangquan Qi
Yong Xu
Haibin Ling
Publikationsdatum
19.01.2018
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 8/2019
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-018-3351-2

Weitere Artikel der Ausgabe 8/2019

Neural Computing and Applications 8/2019 Zur Ausgabe