Skip to main content
Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) 3/2015

01.09.2015 | Original Paper

Knowledge-driven understanding of images in comic books

verfasst von: Christophe Rigaud, Clément Guérin, Dimosthenis Karatzas, Jean-Christophe Burie, Jean-Marc Ogier

Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) | Ausgabe 3/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Document analysis is an active field of research, which can attain a complete understanding of the semantics of a given document. One example of the document understanding process is enabling a computer to identify the key elements of a comic book story and arrange them according to a predefined domain knowledge. In this study, we propose a knowledge-driven system that can interact with bottom-up and top-down information to progressively understand the content of a document. We model the comic book’s and the image processing domains knowledge for information consistency analysis. In addition, different image processing methods are improved or developed to extract panels, balloons, tails, texts, comic characters and their semantic relations in an unsupervised way.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Arai, K., Tolle, H.: Method for automatic e-comic scene frame extraction for reading comic on mobile devices. In: IEEE Computer Society Seventh International Conference on Information Technology: New Generations, ITNG ’10, pp. 370–375, Washington, DC, USA, (2010) Arai, K., Tolle, H.: Method for automatic e-comic scene frame extraction for reading comic on mobile devices. In: IEEE Computer Society Seventh International Conference on Information Technology: New Generations, ITNG ’10, pp. 370–375, Washington, DC, USA, (2010)
2.
Zurück zum Zitat Arai, K., Tolle, H.: Method for real time text extraction of digital manga comic. Int. J. Image Proces. (IJIP) 4(6), 669–676 (2011) Arai, K., Tolle, H.: Method for real time text extraction of digital manga comic. Int. J. Image Proces. (IJIP) 4(6), 669–676 (2011)
3.
Zurück zum Zitat Back, M., Gold, R., Balsamo, A., Chow, M., Gorbet, M., Harrison, S., MacDonald, D., Minnerman, S.: Designing innovative reading experiences for a museum exhibition. Computer 34(1), 80–87 (2001)CrossRef Back, M., Gold, R., Balsamo, A., Chow, M., Gorbet, M., Harrison, S., MacDonald, D., Minnerman, S.: Designing innovative reading experiences for a museum exhibition. Computer 34(1), 80–87 (2001)CrossRef
4.
Zurück zum Zitat Blaschke, T., Hay, G.J., Kelly, M., Lang, S., Hofmann, P., Addink, E., Feitosa, R.Q., van der Meer, F., van der Werff, H., van Coillie, F., Tiede, D.: Geographic object-based image analysis: towards a new paradigm. J. Photogramm. Remote Sens. 87, 180–191 (2014)CrossRef Blaschke, T., Hay, G.J., Kelly, M., Lang, S., Hofmann, P., Addink, E., Feitosa, R.Q., van der Meer, F., van der Werff, H., van Coillie, F., Tiede, D.: Geographic object-based image analysis: towards a new paradigm. J. Photogramm. Remote Sens. 87, 180–191 (2014)CrossRef
5.
Zurück zum Zitat Borodo, M.: Multimodality, translation and comics. Perspectives 1–20 (2014) Borodo, M.: Multimodality, translation and comics. Perspectives 1–20 (2014)
7.
Zurück zum Zitat Di Sciascio, E., Donini, F.M., Mongiello, M.: Structured knowledge representation for image retrieval. J. Artif. Intell. Res. 16(1), 209–257 (2002)MATH Di Sciascio, E., Donini, F.M., Mongiello, M.: Structured knowledge representation for image retrieval. J. Artif. Intell. Res. 16(1), 209–257 (2002)MATH
8.
Zurück zum Zitat Duc, B.: L’art de la BD—Tome 1—Du scénario à la réalisation. Glénat (1982) Duc, B.: L’art de la BD—Tome 1—Du scénario à la réalisation. Glénat (1982)
9.
Zurück zum Zitat Duda, R.O., Hart, P.E.: Use of the hough transformation to detect lines and curves in pictures. Commun. ACM 15, 11–15 (1972)MATHCrossRef Duda, R.O., Hart, P.E.: Use of the hough transformation to detect lines and curves in pictures. Commun. ACM 15, 11–15 (1972)MATHCrossRef
10.
Zurück zum Zitat Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)CrossRef Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)CrossRef
11.
Zurück zum Zitat Fidler, S., Yao, J., Urtasun, R.: Describing the scene as a whole: joint object detection, scene classification and semantic segmentation. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp. 702–709. (2012) Fidler, S., Yao, J., Urtasun, R.: Describing the scene as a whole: joint object detection, scene classification and semantic segmentation. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp. 702–709. (2012)
12.
Zurück zum Zitat Guérin, C.: Ontologies nd spatial relations applied to comic books reading. In: PhD Symposium of Knowledge Engineering and Knowledge Management (EKAW), Galway, Ireland (2012) Guérin, C.: Ontologies nd spatial relations applied to comic books reading. In: PhD Symposium of Knowledge Engineering and Knowledge Management (EKAW), Galway, Ireland (2012)
13.
Zurück zum Zitat Guérin, C., Rigaud, C., Mercier, A., et al.: ebdtheque: a representative database of comics. In: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), Washington DC (2013) Guérin, C., Rigaud, C., Mercier, A., et al.: ebdtheque: a representative database of comics. In: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), Washington DC (2013)
14.
Zurück zum Zitat Haarslev, V., Hidde, K., Möller, R., Wessel, M.: The RacerPro knowledge representation and reasoning system. Semant. Web 3(3), 267–277 (2012) Haarslev, V., Hidde, K., Möller, R., Wessel, M.: The RacerPro knowledge representation and reasoning system. Semant. Web 3(3), 267–277 (2012)
15.
Zurück zum Zitat Han, E., Kim, K., Yang, H., Jung, K.: Frame segmentation used mlp-based x-y recursive for mobile cartoon content. In: Proceedings of the 12th International Conference on Human–Computer Interaction: Intelligent Multimodal Interaction Environments, HCI’07, pp. 872–881. Springer, Berlin (2007) Han, E., Kim, K., Yang, H., Jung, K.: Frame segmentation used mlp-based x-y recursive for mobile cartoon content. In: Proceedings of the 12th International Conference on Human–Computer Interaction: Intelligent Multimodal Interaction Environments, HCI’07, pp. 872–881. Springer, Berlin (2007)
16.
Zurück zum Zitat Hayes-Roth, F., Waterman, D., Lenat, D.: Building expert systems. Addison-Wesley, Reading (1984) Hayes-Roth, F., Waterman, D., Lenat, D.: Building expert systems. Addison-Wesley, Reading (1984)
17.
Zurück zum Zitat Hermann, A., Ferré, S., Ducassé, M.: Guided semantic annotation of comic panels with sewelis. In: EKAW, volume 7603 of Lecture Notes in Computer Science, pp. 430–433. Springer (2012) Hermann, A., Ferré, S., Ducassé, M.: Guided semantic annotation of comic panels with sewelis. In: EKAW, volume 7603 of Lecture Notes in Computer Science, pp. 430–433. Springer (2012)
18.
Zurück zum Zitat Ho, A. K. N., Burie, J.-C., Ogier, J.-M.: Comics page structure analysis based on automatic panel extraction. In: GREC 2011, Nineth IAPR International Workshop on Graphics Recognition, Seoul, Korea, pp. 15–16 (2011) Ho, A. K. N., Burie, J.-C., Ogier, J.-M.: Comics page structure analysis based on automatic panel extraction. In: GREC 2011, Nineth IAPR International Workshop on Graphics Recognition, Seoul, Korea, pp. 15–16 (2011)
19.
Zurück zum Zitat Ho, A. K. N., Burie, J.-C., Ogier, J.-M.: Panel and speech balloon extraction from comic books. In: 2012 10th IAPR International Workshop on Document Analysis Systems, pp. 424–428 (2012) Ho, A. K. N., Burie, J.-C., Ogier, J.-M.: Panel and speech balloon extraction from comic books. In: 2012 10th IAPR International Workshop on Document Analysis Systems, pp. 424–428 (2012)
20.
Zurück zum Zitat Ho, H. N., Rigaud, C., Burie, J.-C., Ogier, J.-M.: Redundant structure detection in attributed adjacency graphs for character detection in comics books. In: Proceedings of the 10th IAPR International Workshop on Graphics Recognition (GREC), Bethlehem, PA, USA, (2013) Ho, H. N., Rigaud, C., Burie, J.-C., Ogier, J.-M.: Redundant structure detection in attributed adjacency graphs for character detection in comics books. In: Proceedings of the 10th IAPR International Workshop on Graphics Recognition (GREC), Bethlehem, PA, USA, (2013)
21.
Zurück zum Zitat Hu, B., Dasmahapatra, S., Lewis, P., Shadbolt, N.: Ontology-based medical image annotation with description logics. In: Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence (2003) Hu, B., Dasmahapatra, S., Lewis, P., Shadbolt, N.: Ontology-based medical image annotation with description logics. In: Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence (2003)
22.
Zurück zum Zitat Hudelot, C., Atif, J., Bloch, I.: Fuzzy spatial relation ontology for image interpretation. Fuzzy Sets Syst. 159(15), 1929–1951 (2008)MathSciNetCrossRef Hudelot, C., Atif, J., Bloch, I.: Fuzzy spatial relation ontology for image interpretation. Fuzzy Sets Syst. 159(15), 1929–1951 (2008)MathSciNetCrossRef
23.
Zurück zum Zitat IBISWorld. Comic book publishing in the US: Market research report, (2013) IBISWorld. Comic book publishing in the US: Market research report, (2013)
24.
Zurück zum Zitat In, Y., Oie, T., Higuchi, M., Kawasaki, S., Koike, A., Murakami, H.: Fast frame decomposition and sorting by contour tracing for mobile phone comic images. Int. J. Syst. Appl. Eng. Dev. 5(2), 216–223 (2011) In, Y., Oie, T., Higuchi, M., Kawasaki, S., Koike, A., Murakami, H.: Fast frame decomposition and sorting by contour tracing for mobile phone comic images. Int. J. Syst. Appl. Eng. Dev. 5(2), 216–223 (2011)
26.
Zurück zum Zitat Jérémy, R., Vincent, B.: Comics reading: an automatic script generation. In: Proceedings of the 21st International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision (WSCG), pp. 88–96 (2013) Jérémy, R., Vincent, B.: Comics reading: an automatic script generation. In: Proceedings of the 21st International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision (WSCG), pp. 88–96 (2013)
27.
Zurück zum Zitat Khan, F. S., Rao, M. A., van de Weijer, J., Bagdanov, A. D., Vanrell, M., Lopez, A.: Color attributes for object detection. In: Twenty-Fifth IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2012) (2012) Khan, F. S., Rao, M. A., van de Weijer, J., Bagdanov, A. D., Vanrell, M., Lopez, A.: Color attributes for object detection. In: Twenty-Fifth IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2012) (2012)
28.
Zurück zum Zitat Lainé, J.-M., Delzant, S.: Le lettrage des bulles. Eyrolles, Paris (2010) Lainé, J.-M., Delzant, S.: Le lettrage des bulles. Eyrolles, Paris (2010)
29.
Zurück zum Zitat Lamiroy, B., Ogier, J.-M.: Analysis and interpretation of graphical documents. In: Doermann, D., Tombre, K. (eds.) Handbook of Document Image Processing and Recognition. Springer, Berlin (2014) Lamiroy, B., Ogier, J.-M.: Analysis and interpretation of graphical documents. In: Doermann, D., Tombre, K. (eds.) Handbook of Document Image Processing and Recognition. Springer, Berlin (2014)
30.
Zurück zum Zitat Li, C., Kowdle, A., Saxena, A., Chen, T.: Toward holistic scene understanding: feedback enabled cascaded classification models. IEEE Trans. Pattern Anal. Mach. Intell. 34(7), 1394–1408 (2012)CrossRef Li, C., Kowdle, A., Saxena, A., Chen, T.: Toward holistic scene understanding: feedback enabled cascaded classification models. IEEE Trans. Pattern Anal. Mach. Intell. 34(7), 1394–1408 (2012)CrossRef
31.
Zurück zum Zitat Li, L., Wang, Y., Tang, Z., Gao, L.: Automatic comic page segmentation based on polygon detection. Multimed. Tools Appl. 69(1), 171–197 (2014)CrossRef Li, L., Wang, Y., Tang, Z., Gao, L.: Automatic comic page segmentation based on polygon detection. Multimed. Tools Appl. 69(1), 171–197 (2014)CrossRef
32.
Zurück zum Zitat Li, L., Wang, Y., Tang, Z., Lu, X., Gao, L.: Unsupervised speech text localization in comic images. In: 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 1190–1194 (2013) Li, L., Wang, Y., Tang, Z., Lu, X., Gao, L.: Unsupervised speech text localization in comic images. In: 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 1190–1194 (2013)
33.
Zurück zum Zitat Mao, S., Rosenfeld, A., Kanungo, T.: Document structure analysis algorithms: a literature survey. In: Kanungo, T., Smith, E.H.B., Hu, J., Kantor, P.B. (eds.) Document Recognition and Retrieval X, volume 5010 of SPIE Proceedings, pp. 197–207. SPIE, Bellingham (2003) Mao, S., Rosenfeld, A., Kanungo, T.: Document structure analysis algorithms: a literature survey. In: Kanungo, T., Smith, E.H.B., Hu, J., Kantor, P.B. (eds.) Document Recognition and Retrieval X, volume 5010 of SPIE Proceedings, pp. 197–207. SPIE, Bellingham (2003)
34.
Zurück zum Zitat McCloud, S.: Understanding Comics. William Morrow Paperbacks, New York (1994) McCloud, S.: Understanding Comics. William Morrow Paperbacks, New York (1994)
35.
Zurück zum Zitat McGuinness, D. L., Van Harmelen, F.: OWL Web Ontology Language Overview. Technical report, W3C (2004) McGuinness, D. L., Van Harmelen, F.: OWL Web Ontology Language Overview. Technical report, W3C (2004)
36.
Zurück zum Zitat Mezaris, V., Kompatsiaris, I., Strintzis, M.G.: An ontology approach to object-based image retrieval. In: International Conference on Image Processing (ICIP) vol 2, pp. 511–514 (2003) Mezaris, V., Kompatsiaris, I., Strintzis, M.G.: An ontology approach to object-based image retrieval. In: International Conference on Image Processing (ICIP) vol 2, pp. 511–514 (2003)
37.
Zurück zum Zitat Ogier, J., Mullot, R., Labiche, J., Lecourtier, Y.: Semantic coherency: the basis of an image interpretation device-application to the cadastral map interpretation. IEEE Trans. Syst. Man Cybern. Part B Cybern. 30(2), 322–338 (2000)CrossRef Ogier, J., Mullot, R., Labiche, J., Lecourtier, Y.: Semantic coherency: the basis of an image interpretation device-application to the cadastral map interpretation. IEEE Trans. Syst. Man Cybern. Part B Cybern. 30(2), 322–338 (2000)CrossRef
38.
Zurück zum Zitat Otsu, N.: A threshold selection method from gray level histograms. IEEE Trans. Syst. Man Cybern. 9, 62–66 (1979)CrossRef Otsu, N.: A threshold selection method from gray level histograms. IEEE Trans. Syst. Man Cybern. 9, 62–66 (1979)CrossRef
39.
Zurück zum Zitat Ponsard, C: Enhancing the accessibility for all of digital comic books. e-Minds, 1(5), (2009) Ponsard, C: Enhancing the accessibility for all of digital comic books. e-Minds, 1(5), (2009)
40.
Zurück zum Zitat Ponsard, C., Ramdoyal, R., Dziamski, D.: An ocr-enabled digital comic books viewer. In: Computers Helping People with Special Needs, pp. 471–478. Springer, (2012) Ponsard, C., Ramdoyal, R., Dziamski, D.: An ocr-enabled digital comic books viewer. In: Computers Helping People with Special Needs, pp. 471–478. Springer, (2012)
41.
Zurück zum Zitat Ratier, G.: 2013 : l’année de la décélération—acbd.fr, (2013) Ratier, G.: 2013 : l’année de la décélération—acbd.fr, (2013)
42.
Zurück zum Zitat Rhoades, S.: A Complete History of American Comic Books. Peter Lang, New York (2008) Rhoades, S.: A Complete History of American Comic Books. Peter Lang, New York (2008)
43.
Zurück zum Zitat Rigaud, C., Karatzas, D., Burie, J.-C., Ogier, J.-M.: Speech balloon contour classification in comics. In: Proceedings of the 10th IAPR International Workshop on Graphics Recognition (GREC), pp. 23–25, Bethlehem, PA, USA, (2013) Rigaud, C., Karatzas, D., Burie, J.-C., Ogier, J.-M.: Speech balloon contour classification in comics. In: Proceedings of the 10th IAPR International Workshop on Graphics Recognition (GREC), pp. 23–25, Bethlehem, PA, USA, (2013)
44.
Zurück zum Zitat Rigaud, C., Karatzas, D., Burie, J.-C., Ogier, J.-M.: Color descriptor for content-based drawing retrieval. In: Proceedings of International Workshop on Document Analysis Systems (DAS), Tours, France, (2014) Rigaud, C., Karatzas, D., Burie, J.-C., Ogier, J.-M.: Color descriptor for content-based drawing retrieval. In: Proceedings of International Workshop on Document Analysis Systems (DAS), Tours, France, (2014)
45.
Zurück zum Zitat Rigaud, C., Karatzas, D., Van de Weijer, J., Burie, J.-C., Ogier, J.-M.: An active contour model for speech balloon detection in comics. In: IEEE Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR), (2013) Rigaud, C., Karatzas, D., Van de Weijer, J., Burie, J.-C., Ogier, J.-M.: An active contour model for speech balloon detection in comics. In: IEEE Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR), (2013)
46.
Zurück zum Zitat Rigaud, C., Karatzas, D., Van de Weijer, J., Burie, J.-C., Ogier, J.-M.: Automatic text localisation in scanned comic books. In: Proceedings of the 8th International Conference on Computer Vision Theory and Applications (VISAPP). SCITEPRESS Digital Library, (2013) Rigaud, C., Karatzas, D., Van de Weijer, J., Burie, J.-C., Ogier, J.-M.: Automatic text localisation in scanned comic books. In: Proceedings of the 8th International Conference on Computer Vision Theory and Applications (VISAPP). SCITEPRESS Digital Library, (2013)
47.
Zurück zum Zitat Rigaud, C., Tsopze, N., Burie, J.-C., Ogier, J.-M.: Robust frame and text extraction from comic books. In: Kwon, Y.-B., Ogier, J.-M. (eds.) Graphics Recognition. New Trends and Challenges. Lecture Notes in Computer Science, vol. 7423, pp. 129–138. Springer, Berlin (2013)CrossRef Rigaud, C., Tsopze, N., Burie, J.-C., Ogier, J.-M.: Robust frame and text extraction from comic books. In: Kwon, Y.-B., Ogier, J.-M. (eds.) Graphics Recognition. New Trends and Challenges. Lecture Notes in Computer Science, vol. 7423, pp. 129–138. Springer, Berlin (2013)CrossRef
48.
Zurück zum Zitat Robin Varnum, G., Christina, T.: The Language of Comics: Word and Image. University Press of Mississippi, Mississippi (2007). Studies in Popular Culture Robin Varnum, G., Christina, T.: The Language of Comics: Word and Image. University Press of Mississippi, Mississippi (2007). Studies in Popular Culture
49.
Zurück zum Zitat Sarwar, S., Qayyum, Z. U., Majeed, S.: Ontology based image retrieval framework using qualitative semantic image descriptions. In: Procedia Computer Science, 17th International Conference in Knowledge Based and Intelligent Information and Engineering Systems—KES2013 22:285–294, (2013) Sarwar, S., Qayyum, Z. U., Majeed, S.: Ontology based image retrieval framework using qualitative semantic image descriptions. In: Procedia Computer Science, 17th International Conference in Knowledge Based and Intelligent Information and Engineering Systems—KES2013 22:285–294, (2013)
50.
Zurück zum Zitat Singh, S., Cheok, A. D., Ng, G. L., Farbiz, F.: 3d augmented reality comic book and notes for children using mobile phones. In: Proceedings of the 2004 Conference on Interaction Design and Children: Building a Community, IDC ’04, pp. 149–150, ACM, New York, (2004) Singh, S., Cheok, A. D., Ng, G. L., Farbiz, F.: 3d augmented reality comic book and notes for children using mobile phones. In: Proceedings of the 2004 Conference on Interaction Design and Children: Building a Community, IDC ’04, pp. 149–150, ACM, New York, (2004)
51.
Zurück zum Zitat Sirin, E., Parsia, B., Cuenca Grau, B., Kalyanpur, A., Katz, Y.: Pellet: a practical OWL-DL reasoner. Web Semant. Sci. Serv. Agents World Wide Web 5(2), 51–53 (2007)CrossRef Sirin, E., Parsia, B., Cuenca Grau, B., Kalyanpur, A., Katz, Y.: Pellet: a practical OWL-DL reasoner. Web Semant. Sci. Serv. Agents World Wide Web 5(2), 51–53 (2007)CrossRef
52.
Zurück zum Zitat Smith, R.: An overview of the tesseract ocr engine. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition—vol. 02, ICDAR ’07, pp. 629–633, IEEE Computer Society, Washington, DC, (2007) Smith, R.: An overview of the tesseract ocr engine. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition—vol. 02, ICDAR ’07, pp. 629–633, IEEE Computer Society, Washington, DC, (2007)
53.
Zurück zum Zitat Stommel, M., Merhej, L. I., Müller, M. G.: Segmentation-free detection of comic panels. In: Computer Vision and Graphics, pp. 633–640. Springer, (2012) Stommel, M., Merhej, L. I., Müller, M. G.: Segmentation-free detection of comic panels. In: Computer Vision and Graphics, pp. 633–640. Springer, (2012)
54.
Zurück zum Zitat Su, C.-Y., Chang, R.-I., Liu, J.-C.: Recognizing text elements for svg comic compression and its novel applications. In: Proceedings of the 11th International Conference on Document Analysis and Recognition, ICDAR ’11, pp. 1329–1333, IEEE Computer Society, Washington, DC, (2011) Su, C.-Y., Chang, R.-I., Liu, J.-C.: Recognizing text elements for svg comic compression and its novel applications. In: Proceedings of the 11th International Conference on Document Analysis and Recognition, ICDAR ’11, pp. 1329–1333, IEEE Computer Society, Washington, DC, (2011)
55.
Zurück zum Zitat Sun, W., Kise, K.: Detection of exact and similar partial copies for copyright protection of manga. Int. J. Doc. Anal. Recognit. (IJDAR) 16(4), 331–349 (2013)CrossRef Sun, W., Kise, K.: Detection of exact and similar partial copies for copyright protection of manga. Int. J. Doc. Anal. Recognit. (IJDAR) 16(4), 331–349 (2013)CrossRef
56.
Zurück zum Zitat Sun, W., Kise, K., Burie, J.-C., Ogier, J.-M.: Specific comic character detection using local feature matching. In: Proceedings of International Conference on Document Analysis and Recognition (ICDAR 2013), Washington, USA, (2013) Sun, W., Kise, K., Burie, J.-C., Ogier, J.-M.: Specific comic character detection using local feature matching. In: Proceedings of International Conference on Document Analysis and Recognition (ICDAR 2013), Washington, USA, (2013)
57.
Zurück zum Zitat Suzuki, S., et al.: Topological structural analysis of digitized binary images by border following. Comput. Vision Graph. Image Process. 30(1), 32–46 (1985)MATHCrossRef Suzuki, S., et al.: Topological structural analysis of digitized binary images by border following. Comput. Vision Graph. Image Process. 30(1), 32–46 (1985)MATHCrossRef
58.
Zurück zum Zitat Tanaka, T., Shoji, K., Toyama, F., Miyamichi, J.: Layout analysis of tree-structured scene frames in comic images. In: IJCAI’07, pp. 2885–2890, (2007) Tanaka, T., Shoji, K., Toyama, F., Miyamichi, J.: Layout analysis of tree-structured scene frames in comic images. In: IJCAI’07, pp. 2885–2890, (2007)
59.
Zurück zum Zitat Thomas, E.: Invisible Art, Invisible Planes, Invisible People. Multicultural Comics: From Zap to Blue Beetle. University of Texas Press, Texas (2010) Thomas, E.: Invisible Art, Invisible Planes, Invisible People. Multicultural Comics: From Zap to Blue Beetle. University of Texas Press, Texas (2010)
61.
Zurück zum Zitat Yamada, M., Budiarto, R., Endo, M., Miyazaki, S.: Comic image decomposition for reading comics on cellular phones. IEICE Trans. 87–D(6), 1370–1376 (2004) Yamada, M., Budiarto, R., Endo, M., Miyazaki, S.: Comic image decomposition for reading comics on cellular phones. IEICE Trans. 87–D(6), 1370–1376 (2004)
Metadaten
Titel
Knowledge-driven understanding of images in comic books
verfasst von
Christophe Rigaud
Clément Guérin
Dimosthenis Karatzas
Jean-Christophe Burie
Jean-Marc Ogier
Publikationsdatum
01.09.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Document Analysis and Recognition (IJDAR) / Ausgabe 3/2015
Print ISSN: 1433-2833
Elektronische ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-015-0243-1

Weitere Artikel der Ausgabe 3/2015

International Journal on Document Analysis and Recognition (IJDAR) 3/2015 Zur Ausgabe

Premium Partner