Skip to main content

2021 | OriginalPaper | Buchkapitel

An Analysis of the Transfer Learning of Convolutional Neural Networks for Artistic Images

verfasst von : Nicolas Gonthier, Yann Gousseau, Saïd Ladjal

Erschienen in: Pattern Recognition. ICPR International Workshops and Challenges

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Transfer learning from huge natural image datasets, fine-tuning of deep neural networks and the use of the corresponding pre-trained networks have become de facto the core of art analysis applications. Nevertheless, the effects of transfer learning are still poorly understood. In this paper, we first use techniques for visualizing the network internal representations in order to provide clues to the understanding of what the network has learned on artistic images. Then, we provide a quantitative analysis of the changes introduced by the learning process thanks to metrics in both the feature and parameter spaces, as well as metrics computed on the set of maximal activation images. These analyses are performed on several variations of the transfer learning procedure. In particular, we observed that the network could specialize some pre-trained filters to the new image modality and also that higher layers tend to concentrate classes. Finally, we have shown that a double fine-tuning involving a medium-size artistic dataset can improve the classification on smaller datasets, even when the task changes.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The reader can find more feature visualizations at https://​artfinetune.​telecom-paris.​fr/​data/​.
 
2
A slight extension of this work is available at https://​arxiv.​org/​abs/​2011.​02727 with the differences between the optimization schemes and more visualization experiments.
 
Literatur
1.
Zurück zum Zitat Bianco, S., Mazzini, D., Napoletano, P., Schettini, R.: Multitask painting categorization by deep multibranch neural network. Exp. Syst. Appl. 135, 90–101 (2019)CrossRef Bianco, S., Mazzini, D., Napoletano, P., Schettini, R.: Multitask painting categorization by deep multibranch neural network. Exp. Syst. Appl. 135, 90–101 (2019)CrossRef
2.
Zurück zum Zitat Cetinic, E., Lipic, T., Grgic, S.: Fine-tuning convolutional neural networks for fine art classification. Exp. Syst. Appl. 114, 107–118 (2018)CrossRef Cetinic, E., Lipic, T., Grgic, S.: Fine-tuning convolutional neural networks for fine art classification. Exp. Syst. Appl. 114, 107–118 (2018)CrossRef
3.
Zurück zum Zitat Cetinic, E., Lipic, T., Grgic, S.: Learning the principles of art history with convolutional neural networks. Pattern Recogn. Lett. 129, 56–62 (2019)CrossRef Cetinic, E., Lipic, T., Grgic, S.: Learning the principles of art history with convolutional neural networks. Pattern Recogn. Lett. 129, 56–62 (2019)CrossRef
6.
Zurück zum Zitat Donahue, J., et al.: DeCAF: a deep convolutional activation feature for generic visual recognition. In: International Conference on Machine Learning (2014) Donahue, J., et al.: DeCAF: a deep convolutional activation feature for generic visual recognition. In: International Conference on Machine Learning (2014)
7.
Zurück zum Zitat Erhan, D., Bengio, Y., Courville, A., Vincent, P.: Visualizing higher-layer features of a deep network. Univ. Montreal 1341(3), 1 (2009) Erhan, D., Bengio, Y., Courville, A., Vincent, P.: Visualizing higher-layer features of a deep network. Univ. Montreal 1341(3), 1 (2009)
8.
Zurück zum Zitat Geirhos, R., Rubisch, P., Michaelis, C., Bethge, M., Wichmann, F.A., Brendel, W.: ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness. In: ICLR (2019) Geirhos, R., Rubisch, P., Michaelis, C., Bethge, M., Wichmann, F.A., Brendel, W.: ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness. In: ICLR (2019)
9.
Zurück zum Zitat Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition (2014) Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition (2014)
11.
Zurück zum Zitat Kornblith, S., Norouzi, M., Lee, H., Hinton, G.: Similarity of neural network representations revisited. In: ICML (2019) Kornblith, S., Norouzi, M., Lee, H., Hinton, G.: Similarity of neural network representations revisited. In: ICML (2019)
12.
Zurück zum Zitat Lecoutre, A., Negrevergne, B., Yger, F.: Recognizing art style automatically in painting with deep learning. In: Asian Conference on Machine Learning. JMLR: Workshop and Conference Proceedings (2017) Lecoutre, A., Negrevergne, B., Yger, F.: Recognizing art style automatically in painting with deep learning. In: Asian Conference on Machine Learning. JMLR: Workshop and Conference Proceedings (2017)
13.
Zurück zum Zitat Madhu, P., Kosti, R., Mührenberg, L., Bell, P., Maier, A., Christlein, V.: Recognizing characters in art history using deep learning. In: Proceedings of the 1st Workshop on Structuring and Understanding of Multimedia heritAge Contents, SUMAC 2019, pp. 15–22.ACM (2019) Madhu, P., Kosti, R., Mührenberg, L., Bell, P., Maier, A., Christlein, V.: Recognizing characters in art history using deep learning. In: Proceedings of the 1st Workshop on Structuring and Understanding of Multimedia heritAge Contents, SUMAC 2019, pp. 15–22.ACM (2019)
14.
Zurück zum Zitat Neyshabur, B., Sedghi, H., Zhang, C.: What is being transferred in transfer learning? In: Advances in Neural Information Processing Systems, vol. 33 (2020) Neyshabur, B., Sedghi, H., Zhang, C.: What is being transferred in transfer learning? In: Advances in Neural Information Processing Systems, vol. 33 (2020)
15.
Zurück zum Zitat van Noord, N., Postma, E.: Learning scale-variant and scale-invariant features for deep image classification. Pattern Recogn. 61, 583–592 (2017)CrossRef van Noord, N., Postma, E.: Learning scale-variant and scale-invariant features for deep image classification. Pattern Recogn. 61, 583–592 (2017)CrossRef
16.
Zurück zum Zitat Olah, C., Cammarata, N., Schubert, L., Goh, G., Petrov, M., Carter, S.: An overview of early vision in InceptionV1. Distill 5(4) (2020) Olah, C., Cammarata, N., Schubert, L., Goh, G., Petrov, M., Carter, S.: An overview of early vision in InceptionV1. Distill 5(4) (2020)
17.
Zurück zum Zitat Olah, C., Mordvintsev, A., Schubert, L.: Feature visualization. Distill 2(11), e7 (2017)CrossRef Olah, C., Mordvintsev, A., Schubert, L.: Feature visualization. Distill 2(11), e7 (2017)CrossRef
18.
Zurück zum Zitat Olah, C., et al.: The building blocks of interpretability. Distill 3(3), e10 (2018)CrossRef Olah, C., et al.: The building blocks of interpretability. Distill 3(3), e10 (2018)CrossRef
19.
Zurück zum Zitat Russakovsky, O., et al.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2015)MathSciNetCrossRef Russakovsky, O., et al.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2015)MathSciNetCrossRef
20.
Zurück zum Zitat Sabatelli, M., Kestemont, M., Daelemans, W., Geurts, P.: Deep transfer learning for art classification problems. In: Workshop on Computer Vision for Art Analysis ECCV, Munich, pp. 1–17 (2018) Sabatelli, M., Kestemont, M., Daelemans, W., Geurts, P.: Deep transfer learning for art classification problems. In: Workshop on Computer Vision for Art Analysis ECCV, Munich, pp. 1–17 (2018)
22.
Zurück zum Zitat Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps. In: ICLR, pp. 1–8 (2014) Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps. In: ICLR, pp. 1–8 (2014)
23.
Zurück zum Zitat Strezoski, G., Worning, M.: OmniArt: a large-scale artistic benchmark. ACM Trans. Multimedia Comput. Commun. Appl. (TOMM) - Spec. Sect. Deep Learn. Intell. Multimedia Anal. 14(4), 1–21 (2018) Strezoski, G., Worning, M.: OmniArt: a large-scale artistic benchmark. ACM Trans. Multimedia Comput. Commun. Appl. (TOMM) - Spec. Sect. Deep Learn. Intell. Multimedia Anal. 14(4), 1–21 (2018)
24.
Zurück zum Zitat Strezoski, G., Worring, M.: Plug-and-Play Interactive Deep Network Visualization. Visual Analytics for Deep Learning, VADL (2017) Strezoski, G., Worring, M.: Plug-and-Play Interactive Deep Network Visualization. Visual Analytics for Deep Learning, VADL (2017)
25.
Zurück zum Zitat Szabó, R., Katona, D., Csillag, M., Csiszárik, A., Varga, D.: Visualizing transfer learning. In: ICML Workshop on Human Interpretability in Machine Learning (2020) Szabó, R., Katona, D., Csillag, M., Csiszárik, A., Varga, D.: Visualizing transfer learning. In: ICML Workshop on Human Interpretability in Machine Learning (2020)
26.
Zurück zum Zitat Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015) Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015)
27.
Zurück zum Zitat Tan, W.R., Chan, C.S., Aguirre, H.E., Tanaka, K.: Ceci n’est pas une pipe: a deep convolutional network for fine-art paintings classification. In: 2016 IEEE International Conference on Image Processing (ICIP), pp. 3703–3707 (2016) Tan, W.R., Chan, C.S., Aguirre, H.E., Tanaka, K.: Ceci n’est pas une pipe: a deep convolutional network for fine-art paintings classification. In: 2016 IEEE International Conference on Image Processing (ICIP), pp. 3703–3707 (2016)
29.
Zurück zum Zitat Wilber, M.J., Fang, C., Jin, H., Hertzmann, A., Collomosse, J., Belongie, S.: BAM! The behance artistic media dataset for recognition beyond photography. In: IEEE International Conference on Computer Vision (ICCV), pp. 1211–1220 (2017) Wilber, M.J., Fang, C., Jin, H., Hertzmann, A., Collomosse, J., Belongie, S.: BAM! The behance artistic media dataset for recognition beyond photography. In: IEEE International Conference on Computer Vision (ICCV), pp. 1211–1220 (2017)
30.
Zurück zum Zitat Yin, R., Monson, E., Honig, E., Daubechies, I., Maggioni, M.: Object recognition in art drawings: transfer of a neural network. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2299–2303 (2016) Yin, R., Monson, E., Honig, E., Daubechies, I., Maggioni, M.: Object recognition in art drawings: transfer of a neural network. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2299–2303 (2016)
Metadaten
Titel
An Analysis of the Transfer Learning of Convolutional Neural Networks for Artistic Images
verfasst von
Nicolas Gonthier
Yann Gousseau
Saïd Ladjal
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-68796-0_39