Skip to main content
Top

Hint

Swipe to navigate through the chapters of this book

2019 | OriginalPaper | Chapter

Estimating Comic Content from the Book Cover Information Using Fine-Tuned VGG Model for Comic Search

Authors : Byeongseon Park, Mitsunori Matsushita

Published in: MultiMedia Modeling

Publisher: Springer International Publishing

share
SHARE

Abstract

The purpose of this research is to realize retrieval of comic based on content information. Resources of the contents information of existing comics were only the comics itself and review. However, these pieces of information have drawbacks that they can not sufficiently extract information necessary for searching, and that they contain a lot of unnecessary information. In order to solve this problem, we proposed to use the book cover of comics as a resource to grasp the contents of comics. In the proposed method, we estimate the age and cultural background of comics expressed by clothes and belongings written on the cover of comics from the reasoning model which performed fine-tuning from the VGG-16 model. Also, we associated comics with each other based on the obtained semantic vectors and tags. As a result of the experiment, the accuracy of the model was 0.693, and the reproducibility of the tag to the correct data was 0.918. Furthermore, we observed unity in the comics related by the obtained information.

To get access to this content you need the following product:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 69.000 Bücher
  • über 500 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt 90 Tage mit der neuen Mini-Lizenz testen!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 50.000 Bücher
  • über 380 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe



 


Jetzt 90 Tage mit der neuen Mini-Lizenz testen!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 58.000 Bücher
  • über 300 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko





Jetzt 90 Tage mit der neuen Mini-Lizenz testen!

Footnotes
1
\(\copyright \) Yuzuru Shimazaki, Kodansha Ltd.
 
2
\(\copyright \) Ken Yagami, Kadokawa Publishing Ltd.
 
3
\(\copyright \) Yuka Kuniki, Takeshobo Ltd.
 
Literature
1.
go back to reference Park, B., Okamoto, K., Yamashita, R., Matsushita, M.: Designing a comic exploration system using a hierarchical topic classification of reviews. Inf. Eng. Express Int. Instit. Appl. Inform. 3(2), 45–57 (2017) Park, B., Okamoto, K., Yamashita, R., Matsushita, M.: Designing a comic exploration system using a hierarchical topic classification of reviews. Inf. Eng. Express Int. Instit. Appl. Inform. 3(2), 45–57 (2017)
2.
go back to reference Rigaud, C., Gurin, C., Karatzas, D., Burie, J.C., Ogier, J.M.: Knowledge-driven understanding of images in comic books. IJDAR 18(3), 199–221 (2015) CrossRef Rigaud, C., Gurin, C., Karatzas, D., Burie, J.C., Ogier, J.M.: Knowledge-driven understanding of images in comic books. IJDAR 18(3), 199–221 (2015) CrossRef
3.
go back to reference Tanaka, T., Shoji, K., Toyama, F., Miyamichi, J.: Layout analysis of tree-structured scene frames in comic images. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence, IJCAI, Hyderabad, pp. 2885–2890 (2007) Tanaka, T., Shoji, K., Toyama, F., Miyamichi, J.: Layout analysis of tree-structured scene frames in comic images. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence, IJCAI, Hyderabad, pp. 2885–2890 (2007)
4.
go back to reference Arai, K., Tolle H.: Method for automatic E-comic scene frame extraction for reading comic on mobile devices. In: Seventh International Conference on Information Technology, pp. 370–375. IEEE, Las Vegas (2010) Arai, K., Tolle H.: Method for automatic E-comic scene frame extraction for reading comic on mobile devices. In: Seventh International Conference on Information Technology, pp. 370–375. IEEE, Las Vegas (2010)
5.
go back to reference Chu, W-T., Li, W-W.: Manga FaceNet: face detection in manga based on deep neural network. In: 17th Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, ICMR, Bucharest, pp. 412–415 (2017) Chu, W-T., Li, W-W.: Manga FaceNet: face detection in manga based on deep neural network. In: 17th Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, ICMR, Bucharest, pp. 412–415 (2017)
6.
go back to reference LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. In: Proceedings of the IEEE, pp. 2278–2324. IEEE (1998) LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. In: Proceedings of the IEEE, pp. 2278–2324. IEEE (1998)
7.
go back to reference Blei, D.M., Andrew, Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003) MATH Blei, D.M., Andrew, Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003) MATH
8.
9.
go back to reference Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE, Miami (2009) Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE, Miami (2009)
10.
go back to reference Lin, M., Chen, Q., Yan, S.: Network in network. In: International Conference on Learning Representations, San Diego, (2014) Lin, M., Chen, Q., Yan, S.: Network in network. In: International Conference on Learning Representations, San Diego, (2014)
11.
go back to reference Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations, San Diego (2015) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations, San Diego (2015)
12.
go back to reference Saito, M., Matsui, Y.: Illustration2Vec: a semantic vector representation of illustrations. In: SIGGRAPH ASIA 2015 Technical Briefs, No. 5 (2015) Saito, M., Matsui, Y.: Illustration2Vec: a semantic vector representation of illustrations. In: SIGGRAPH ASIA 2015 Technical Briefs, No. 5 (2015)
13.
go back to reference Islam, A., Inkpen, D.: Semantic text similarity using corpus-based word similarity and string similarity. ACM Trans. Knowl. Discov. Data 2(2), 10 (2008) CrossRef Islam, A., Inkpen, D.: Semantic text similarity using corpus-based word similarity and string similarity. ACM Trans. Knowl. Discov. Data 2(2), 10 (2008) CrossRef
14.
go back to reference Cha, S.H.: Comprehensive survey on distance/similarity measures between probability density functions. Int. J. Math. Models Methods Appl. Sci. 1(4), 300–307 (2007) Cha, S.H.: Comprehensive survey on distance/similarity measures between probability density functions. Int. J. Math. Models Methods Appl. Sci. 1(4), 300–307 (2007)
Metadata
Title
Estimating Comic Content from the Book Cover Information Using Fine-Tuned VGG Model for Comic Search
Authors
Byeongseon Park
Mitsunori Matsushita
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-05716-9_58