Skip to main content

2019 | OriginalPaper | Buchkapitel

How to Read Paintings: Semantic Art Understanding with Multi-modal Retrieval

verfasst von : Noa Garcia, George Vogiatzis

Erschienen in: Computer Vision – ECCV 2018 Workshops

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Automatic art analysis has been mostly focused on classifying artworks into different artistic styles. However, understanding an artistic representation involves more complex processes, such as identifying the elements in the scene or recognizing author influences. We present SemArt, a multi-modal dataset for semantic art understanding. SemArt is a collection of fine-art painting images in which each image is associated to a number of attributes and a textual artistic comment, such as those that appear in art catalogues or museum collections. To evaluate semantic art understanding, we envisage the Text2Art challenge, a multi-modal retrieval task where relevant paintings are retrieved according to an artistic text, and vice versa. We also propose several models for encoding visual and textual artistic representations into a common semantic space. Our best approach is able to find the correct image within the top 10 ranked images in the 45.5% of the test samples. Moreover, our models show remarkable levels of art understanding when compared against human evaluation.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
3.
Zurück zum Zitat Crowley, E., Zisserman, A.: The state of the art: object retrieval in paintings using discriminative regions. In: BMVC (2014) Crowley, E., Zisserman, A.: The state of the art: object retrieval in paintings using discriminative regions. In: BMVC (2014)
4.
Zurück zum Zitat Crowley, E.J., Parkhi, O.M., Zisserman, A.: Face painting: querying art with photos. In: BMVC, pp. 65.1–65.13 (2015) Crowley, E.J., Parkhi, O.M., Zisserman, A.: Face painting: querying art with photos. In: BMVC, pp. 65.1–65.13 (2015)
7.
Zurück zum Zitat Gong, Y., Ke, Q., Isard, M., Lazebnik, S.: A multi-view embedding space for modeling internet images, tags, and their semantics. Int. J. Comput. Vis. 106(2), 210–233 (2014)CrossRef Gong, Y., Ke, Q., Isard, M., Lazebnik, S.: A multi-view embedding space for modeling internet images, tags, and their semantics. Int. J. Comput. Vis. 106(2), 210–233 (2014)CrossRef
8.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)
9.
Zurück zum Zitat Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef
10.
Zurück zum Zitat Johnson, C.R., et al.: Image processing for artist identification. IEEE Signal Process. Mag. 25(4) (2008)CrossRef Johnson, C.R., et al.: Image processing for artist identification. IEEE Signal Process. Mag. 25(4) (2008)CrossRef
11.
Zurück zum Zitat Karayev, S., et al.: Recognizing image style. In: BMVC (2014) Karayev, S., et al.: Recognizing image style. In: BMVC (2014)
12.
Zurück zum Zitat Khan, F.S., Beigpour, S., Van de Weijer, J., Felsberg, M.: Painting-91: a large scale database for computational painting categorization. Mach. Vis. Appl. 25, 1385–1397 (2014)CrossRef Khan, F.S., Beigpour, S., Van de Weijer, J., Felsberg, M.: Painting-91: a large scale database for computational painting categorization. Mach. Vis. Appl. 25, 1385–1397 (2014)CrossRef
13.
Zurück zum Zitat Kiros, R., et al.: Skip-thought vectors. In: Advances in Neural Information Processing Systems (2015) Kiros, R., et al.: Skip-thought vectors. In: Advances in Neural Information Processing Systems (2015)
14.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS (2012)
16.
Zurück zum Zitat Ma, D., et al.: From part to whole: who is behind the painting? In: Proceedings of the 2017 ACM on Multimedia Conference. ACM (2017) Ma, D., et al.: From part to whole: who is behind the painting? In: Proceedings of the 2017 ACM on Multimedia Conference. ACM (2017)
17.
Zurück zum Zitat Manning, C.D., Schütze, H.: Foundations of statistical natural language processing. Inf. Retrieval 4, 80–81 (2001)CrossRef Manning, C.D., Schütze, H.: Foundations of statistical natural language processing. Inf. Retrieval 4, 80–81 (2001)CrossRef
18.
Zurück zum Zitat Mao, H., Cheung, M., She, J.: DeepArt: learning joint representations of visual arts. In: ACM on Multimedia Conference (2017) Mao, H., Cheung, M., She, J.: DeepArt: learning joint representations of visual arts. In: ACM on Multimedia Conference (2017)
19.
Zurück zum Zitat Mensink, T., Van Gemert, J.: The Rijksmuseum challenge: museum-centered visual recognition. In: ICMR (2014) Mensink, T., Van Gemert, J.: The Rijksmuseum challenge: museum-centered visual recognition. In: ICMR (2014)
20.
Zurück zum Zitat Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH
21.
Zurück zum Zitat Saleh, B., Elgammal, A.M.: Large-scale classification of fine-art paintings: learning the right metric on the right feature. CoRR (2015) Saleh, B., Elgammal, A.M.: Large-scale classification of fine-art paintings: learning the right metric on the right feature. CoRR (2015)
23.
Zurück zum Zitat Shamir, L., Macura, T., Orlov, N., Eckley, D.M., Goldberg, I.G.: Impressionism, expressionism, surrealism: automated recognition of painters and schools of art. ACM Trans. Appl. Percept. 7, 8 (2010)CrossRef Shamir, L., Macura, T., Orlov, N., Eckley, D.M., Goldberg, I.G.: Impressionism, expressionism, surrealism: automated recognition of painters and schools of art. ACM Trans. Appl. Percept. 7, 8 (2010)CrossRef
24.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015)
25.
Zurück zum Zitat Tan, W.R., Chan, C.S., Aguirre, H.E., Tanaka, K.: Ceci n’est pas une pipe: a deep convolutional network for fine-art paintings classification. In: ICIP (2016) Tan, W.R., Chan, C.S., Aguirre, H.E., Tanaka, K.: Ceci n’est pas une pipe: a deep convolutional network for fine-art paintings classification. In: ICIP (2016)
26.
Zurück zum Zitat Tolias, G., Sicre, R., Jégou, H.: Particular object retrieval with integral max-pooling of CNN activations. In: International Conference on Learning Representations (2015) Tolias, G., Sicre, R., Jégou, H.: Particular object retrieval with integral max-pooling of CNN activations. In: International Conference on Learning Representations (2015)
27.
Zurück zum Zitat Wang, L., Li, Y., Huang, J., Lazebnik, S.: Learning two-branch neural networks for image-text matching tasks. IEEE Trans. Pattern Anal. Mach. Intell. (2018) Wang, L., Li, Y., Huang, J., Lazebnik, S.: Learning two-branch neural networks for image-text matching tasks. IEEE Trans. Pattern Anal. Mach. Intell. (2018)
Metadaten
Titel
How to Read Paintings: Semantic Art Understanding with Multi-modal Retrieval
verfasst von
Noa Garcia
George Vogiatzis
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-11012-3_52