Skip to main content
Top

2021 | OriginalPaper | Chapter

Transfer Learning Methods for Extracting, Classifying and Searching Large Collections of Historical Images and Their Captions

Authors : Cindy Roullet, David Fredrick, John Gauch, Rhodora G. Vennarucci, William Loder

Published in: Pattern Recognition. ICPR International Workshops and Challenges

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper is about the creation of an interactive software tool and dataset useful for exploring the unindexed 11-volume set, Pompei: Pitture e Mosaici (PPM), a valuable resource containing over 20,000 annotated historical images of the archaeological site of Pompeii, Italy. The tool includes functionalities such as a word search, and an images and captions similarity search. Searches for similarity are conducted using transfer learning on the data retrieved from the scanned version of PPM. Image processing, convolutional neural networks and natural language processing also had to come into play to extract, classify, and archive the text and image data from the digitized version of the books.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Aletras, N., Stevenson, M., Clough, P.: Computing similarity between items in a digital library of cultural heritage. J. Comput. Cult. Heritage (JOCCH) 5(4), 1–19 (2013) Aletras, N., Stevenson, M., Clough, P.: Computing similarity between items in a digital library of cultural heritage. J. Comput. Cult. Heritage (JOCCH) 5(4), 1–19 (2013)
2.
go back to reference Arnold, T., Tilton, L.: Enriching historic photography with structured data using image region segmentation. In: Proceedings of the 1st International Workshop on Artificial Intelligence for Historical Image Enrichment and Access, pp. 1–10 (2020) Arnold, T., Tilton, L.: Enriching historic photography with structured data using image region segmentation. In: Proceedings of the 1st International Workshop on Artificial Intelligence for Historical Image Enrichment and Access, pp. 1–10 (2020)
3.
go back to reference Becattini, F., Ferracani, A., Landucci, L., Pezzatini, D., Uricchio, T., Del Bimbo, A.: Imaging novecento. A mobile app for automatic recognition of artworks and transfer of artistic styles. In: Ioannides, M., et al. (eds.) EuroMed 2016. LNCS, vol. 10058, pp. 781–791. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-48496-9_62CrossRef Becattini, F., Ferracani, A., Landucci, L., Pezzatini, D., Uricchio, T., Del Bimbo, A.: Imaging novecento. A mobile app for automatic recognition of artworks and transfer of artistic styles. In: Ioannides, M., et al. (eds.) EuroMed 2016. LNCS, vol. 10058, pp. 781–791. Springer, Cham (2016). https://​doi.​org/​10.​1007/​978-3-319-48496-9_​62CrossRef
4.
go back to reference Belhi, A., Bouras, A., Foufou, S.: Leveraging known data for missing label prediction in cultural heritage context. Appl. Sci. 8(10), 1768 (2018)CrossRef Belhi, A., Bouras, A., Foufou, S.: Leveraging known data for missing label prediction in cultural heritage context. Appl. Sci. 8(10), 1768 (2018)CrossRef
5.
go back to reference Díaz-Rodríguez, N., Pisoni, G.: Accessible cultural heritage through explainable artificial intelligence. In: 11th Workshop on Personalized Access to Cultural Heritage (2020) Díaz-Rodríguez, N., Pisoni, G.: Accessible cultural heritage through explainable artificial intelligence. In: 11th Workshop on Personalized Access to Cultural Heritage (2020)
6.
go back to reference Gultom, Y., Arymurthy, A.M., Masikome, R.J.: Batik classification using deep convolutional network transfer learning. J. Ilmu Komputer dan Inf. 11(2), 59–66 (2018)CrossRef Gultom, Y., Arymurthy, A.M., Masikome, R.J.: Batik classification using deep convolutional network transfer learning. J. Ilmu Komputer dan Inf. 11(2), 59–66 (2018)CrossRef
7.
go back to reference Jboor, N.H., et al.: Towards an inpainting framework for visual cultural heritage. In: 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT), pp. 602–607. IEEE (2019) Jboor, N.H., et al.: Towards an inpainting framework for visual cultural heritage. In: 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT), pp. 602–607. IEEE (2019)
8.
go back to reference Kulkarni, U., et al.: Classification of cultural heritage sites using transfer learning. In: 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM), pp. 391–397. IEEE (2019) Kulkarni, U., et al.: Classification of cultural heritage sites using transfer learning. In: 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM), pp. 391–397. IEEE (2019)
9.
go back to reference Liu, Y., et al.: A many-objective evolutionary algorithm using a one-by-one selection strategy. IEEE Trans. Cybern. 47(9), 2689–2702 (2017)CrossRef Liu, Y., et al.: A many-objective evolutionary algorithm using a one-by-one selection strategy. IEEE Trans. Cybern. 47(9), 2689–2702 (2017)CrossRef
10.
go back to reference Palma, V: Towards deep learning for architecture: amonument recognition mobile app. In: International Archives of the Photogrammetry, Remote Sensing & Spatial Information Sciences (2019) Palma, V: Towards deep learning for architecture: amonument recognition mobile app. In: International Archives of the Photogrammetry, Remote Sensing & Spatial Information Sciences (2019)
11.
go back to reference Roullet, C., et al.: An automated technique to recognize and extract images from scanned archaeological documents. In: 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW), vol. 1, pp. 20–25. IEEE (2019) Roullet, C., et al.: An automated technique to recognize and extract images from scanned archaeological documents. In: 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW), vol. 1, pp. 20–25. IEEE (2019)
12.
go back to reference Schulman, A.J., Mitchell, R.R.: Operating characteristics from yes-no and forced-choice procedures. J. Acoust. Soc. Am. 40(2), 473–477 (1966)CrossRef Schulman, A.J., Mitchell, R.R.: Operating characteristics from yes-no and forced-choice procedures. J. Acoust. Soc. Am. 40(2), 473–477 (1966)CrossRef
13.
go back to reference Szegedy, C., et al.: Inception-v4, inception-resnet and the impact of residual connections on learning. arXiv preprint arXiv: 1602.07261 (2016) Szegedy, C., et al.: Inception-v4, inception-resnet and the impact of residual connections on learning. arXiv preprint arXiv:​ 1602.​07261 (2016)
14.
go back to reference Szegedy, C., et al.: Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826 (2016) Szegedy, C., et al.: Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826 (2016)
16.
go back to reference Trstenjak, B., Mikac, S., Donko, D.: KNN with TF-IDF based framework for text categorization. Proc. Eng. 69, 1356–1364 (2014)CrossRef Trstenjak, B., Mikac, S., Donko, D.: KNN with TF-IDF based framework for text categorization. Proc. Eng. 69, 1356–1364 (2014)CrossRef
17.
go back to reference Wang, J., et al.: Text similarity calculation method based on hybrid model of LDA and TF-IDF. In: Proceedings of the 2019 3rd International Conference on Computer Science and Artificial Intelligence, pp. 1–8 (2019) Wang, J., et al.: Text similarity calculation method based on hybrid model of LDA and TF-IDF. In: Proceedings of the 2019 3rd International Conference on Computer Science and Artificial Intelligence, pp. 1–8 (2019)
18.
go back to reference Wevers, M., Smits, T.: The visual digital turn: using neural networks to study historical images. Digit. Scholarsh. Hum. 35(1), 194–207 (2020) Wevers, M., Smits, T.: The visual digital turn: using neural networks to study historical images. Digit. Scholarsh. Hum. 35(1), 194–207 (2020)
19.
go back to reference Zohar, M., Shimshoni, I., Khateb, F.: GIScience Integrated with Computer Vision for the Interpretation and Analysis of Old Paintings. In: GISTAM, pp. 233–239 (2020) Zohar, M., Shimshoni, I., Khateb, F.: GIScience Integrated with Computer Vision for the Interpretation and Analysis of Old Paintings. In: GISTAM, pp. 233–239 (2020)
Metadata
Title
Transfer Learning Methods for Extracting, Classifying and Searching Large Collections of Historical Images and Their Captions
Authors
Cindy Roullet
David Fredrick
John Gauch
Rhodora G. Vennarucci
William Loder
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-68787-8_13

Premium Partner