Skip to main content
Top

2019 | OriginalPaper | Chapter

Estimating Comic Content from the Book Cover Information Using Fine-Tuned VGG Model for Comic Search

Authors : Byeongseon Park, Mitsunori Matsushita

Published in: MultiMedia Modeling

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The purpose of this research is to realize retrieval of comic based on content information. Resources of the contents information of existing comics were only the comics itself and review. However, these pieces of information have drawbacks that they can not sufficiently extract information necessary for searching, and that they contain a lot of unnecessary information. In order to solve this problem, we proposed to use the book cover of comics as a resource to grasp the contents of comics. In the proposed method, we estimate the age and cultural background of comics expressed by clothes and belongings written on the cover of comics from the reasoning model which performed fine-tuning from the VGG-16 model. Also, we associated comics with each other based on the obtained semantic vectors and tags. As a result of the experiment, the accuracy of the model was 0.693, and the reproducibility of the tag to the correct data was 0.918. Furthermore, we observed unity in the comics related by the obtained information.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
\(\copyright \) Yuzuru Shimazaki, Kodansha Ltd.
 
2
\(\copyright \) Ken Yagami, Kadokawa Publishing Ltd.
 
3
\(\copyright \) Yuka Kuniki, Takeshobo Ltd.
 
Literature
1.
go back to reference Park, B., Okamoto, K., Yamashita, R., Matsushita, M.: Designing a comic exploration system using a hierarchical topic classification of reviews. Inf. Eng. Express Int. Instit. Appl. Inform. 3(2), 45–57 (2017) Park, B., Okamoto, K., Yamashita, R., Matsushita, M.: Designing a comic exploration system using a hierarchical topic classification of reviews. Inf. Eng. Express Int. Instit. Appl. Inform. 3(2), 45–57 (2017)
2.
go back to reference Rigaud, C., Gurin, C., Karatzas, D., Burie, J.C., Ogier, J.M.: Knowledge-driven understanding of images in comic books. IJDAR 18(3), 199–221 (2015)CrossRef Rigaud, C., Gurin, C., Karatzas, D., Burie, J.C., Ogier, J.M.: Knowledge-driven understanding of images in comic books. IJDAR 18(3), 199–221 (2015)CrossRef
3.
go back to reference Tanaka, T., Shoji, K., Toyama, F., Miyamichi, J.: Layout analysis of tree-structured scene frames in comic images. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence, IJCAI, Hyderabad, pp. 2885–2890 (2007) Tanaka, T., Shoji, K., Toyama, F., Miyamichi, J.: Layout analysis of tree-structured scene frames in comic images. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence, IJCAI, Hyderabad, pp. 2885–2890 (2007)
4.
go back to reference Arai, K., Tolle H.: Method for automatic E-comic scene frame extraction for reading comic on mobile devices. In: Seventh International Conference on Information Technology, pp. 370–375. IEEE, Las Vegas (2010) Arai, K., Tolle H.: Method for automatic E-comic scene frame extraction for reading comic on mobile devices. In: Seventh International Conference on Information Technology, pp. 370–375. IEEE, Las Vegas (2010)
5.
go back to reference Chu, W-T., Li, W-W.: Manga FaceNet: face detection in manga based on deep neural network. In: 17th Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, ICMR, Bucharest, pp. 412–415 (2017) Chu, W-T., Li, W-W.: Manga FaceNet: face detection in manga based on deep neural network. In: 17th Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, ICMR, Bucharest, pp. 412–415 (2017)
6.
go back to reference LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. In: Proceedings of the IEEE, pp. 2278–2324. IEEE (1998) LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. In: Proceedings of the IEEE, pp. 2278–2324. IEEE (1998)
7.
go back to reference Blei, D.M., Andrew, Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH Blei, D.M., Andrew, Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH
8.
9.
go back to reference Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE, Miami (2009) Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE, Miami (2009)
10.
go back to reference Lin, M., Chen, Q., Yan, S.: Network in network. In: International Conference on Learning Representations, San Diego, (2014) Lin, M., Chen, Q., Yan, S.: Network in network. In: International Conference on Learning Representations, San Diego, (2014)
11.
go back to reference Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations, San Diego (2015) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations, San Diego (2015)
12.
go back to reference Saito, M., Matsui, Y.: Illustration2Vec: a semantic vector representation of illustrations. In: SIGGRAPH ASIA 2015 Technical Briefs, No. 5 (2015) Saito, M., Matsui, Y.: Illustration2Vec: a semantic vector representation of illustrations. In: SIGGRAPH ASIA 2015 Technical Briefs, No. 5 (2015)
13.
go back to reference Islam, A., Inkpen, D.: Semantic text similarity using corpus-based word similarity and string similarity. ACM Trans. Knowl. Discov. Data 2(2), 10 (2008)CrossRef Islam, A., Inkpen, D.: Semantic text similarity using corpus-based word similarity and string similarity. ACM Trans. Knowl. Discov. Data 2(2), 10 (2008)CrossRef
14.
go back to reference Cha, S.H.: Comprehensive survey on distance/similarity measures between probability density functions. Int. J. Math. Models Methods Appl. Sci. 1(4), 300–307 (2007) Cha, S.H.: Comprehensive survey on distance/similarity measures between probability density functions. Int. J. Math. Models Methods Appl. Sci. 1(4), 300–307 (2007)
Metadata
Title
Estimating Comic Content from the Book Cover Information Using Fine-Tuned VGG Model for Comic Search
Authors
Byeongseon Park
Mitsunori Matsushita
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-05716-9_58