Skip to main content

2014 | OriginalPaper | Buchkapitel

Adaptive Contour Classification of Comics Speech Balloons

verfasst von : Christophe Rigaud, Dimosthenis Karatzas, Jean-Christophe Burie, Jean-Marc Ogier

Erschienen in: Graphics Recognition. Current Trends and Challenges

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Comic books digitization combined with subsequent comic book understanding give rise to a variety of new applications, including content reflowing, mobile reading and multi-modal search. Document understanding in this domain is challenging as comics are semi-structured documents, with semantic information shared between the graphical and textual parts. Speech balloon contour analysis reveals the speech tone which is an essential step towards a fully automatic comics understanding. In this paper we present the first approach for classifying speech balloon in scanned comic books where we separate and analyze their contour variations to classify them as “smooth” (normal speech), “wavy” (thought) or “zigzag” (exclamation). The experiments show a global accuracy classification of 85.2 % on a wide variety of balloons from the eBDtheque dataset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Abbasi, S., Mokhtarian, F., Kittler, J.: Curvature scale space image in shape similarity retrieval. Multimedia Syst. 7(6), 467–476 (1999)CrossRef Abbasi, S., Mokhtarian, F., Kittler, J.: Curvature scale space image in shape similarity retrieval. Multimedia Syst. 7(6), 467–476 (1999)CrossRef
2.
Zurück zum Zitat Arai, K., Tolle, H.: Method for real time text extraction of digital manga comic. Int. J. Image Process. (IJIP) 4(6), 669–676 (2011) Arai, K., Tolle, H.: Method for real time text extraction of digital manga comic. Int. J. Image Process. (IJIP) 4(6), 669–676 (2011)
3.
Zurück zum Zitat Bader, T., Räpple, R., Beyerer, J.: Fast invariant contour-based classification of hand symbols for HCI. In: Jiang, X., Petkov, N. (eds.) CAIP 2009. LNCS, vol. 5702, pp. 689–696. Springer, Heidelberg (2009)CrossRef Bader, T., Räpple, R., Beyerer, J.: Fast invariant contour-based classification of hand symbols for HCI. In: Jiang, X., Petkov, N. (eds.) CAIP 2009. LNCS, vol. 5702, pp. 689–696. Springer, Heidelberg (2009)CrossRef
4.
Zurück zum Zitat Bober, M.: Mpeg-7 visual shape descriptors. IEEE Trans. Circ. Syst. 11(6), 716–719 (2001)CrossRef Bober, M.: Mpeg-7 visual shape descriptors. IEEE Trans. Circ. Syst. 11(6), 716–719 (2001)CrossRef
5.
Zurück zum Zitat Cenkery, C.: Wavelet contour classification. In: Proceedings of the 20th Workshop of the Austrian Association for Pattern Recognition (OAGM/AAPR) on Pattern Recognition, 1996, Leibnitz, Austria, pp. 263–271. R. Oldenbourg Verlag GmbH, Munich, Germany (1996) Cenkery, C.: Wavelet contour classification. In: Proceedings of the 20th Workshop of the Austrian Association for Pattern Recognition (OAGM/AAPR) on Pattern Recognition, 1996, Leibnitz, Austria, pp. 263–271. R. Oldenbourg Verlag GmbH, Munich, Germany (1996)
6.
Zurück zum Zitat Grigoriu, A., Vonwiller, J., King, R.: An automatic intonation tone contour labelling and classification algorithm. In: 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-94, vol. 2, pp. II-181. IEEE (1994) Grigoriu, A., Vonwiller, J., King, R.: An automatic intonation tone contour labelling and classification algorithm. In: 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-94, vol. 2, pp. II-181. IEEE (1994)
7.
Zurück zum Zitat Guérin, C., Rigaud, C., Mercier, A., et al.: eBDtheque: a representative database of comics. In: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), Washington DC (2013) Guérin, C., Rigaud, C., Mercier, A., et al.: eBDtheque: a representative database of comics. In: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), Washington DC (2013)
8.
Zurück zum Zitat Ho, A.K.N., Burie, J.C., Ogier, J.M.: Panel and Speech Balloon Extraction from Comic Books. In: 2012 10th IAPR International Workshop on Document Analysis Systems, pp. 424–428, Mar 2012 Ho, A.K.N., Burie, J.C., Ogier, J.M.: Panel and Speech Balloon Extraction from Comic Books. In: 2012 10th IAPR International Workshop on Document Analysis Systems, pp. 424–428, Mar 2012
9.
Zurück zum Zitat Hu, M.K.: Visual pattern recognition by moment invariants. IRE Trans. Inf. Theory 8(2), 179–187 (1962)MATHCrossRef Hu, M.K.: Visual pattern recognition by moment invariants. IRE Trans. Inf. Theory 8(2), 179–187 (1962)MATHCrossRef
10.
Zurück zum Zitat Keogh, E., Wei, L., Xi, X., Hee Lee, S., Vlachos, M.: Lb keogh supports exact indexing of shapes under rotation invariance with arbitrary representations and distance measures. In: VLDB, pp. 882–893 (2006) Keogh, E., Wei, L., Xi, X., Hee Lee, S., Vlachos, M.: Lb keogh supports exact indexing of shapes under rotation invariance with arbitrary representations and distance measures. In: VLDB, pp. 882–893 (2006)
11.
Zurück zum Zitat Kühne, G., Richter, S., Beier, M.: Motion-based segmentation and contour-based classification of video objects. In: Proceedings of the Ninth ACM International Conference on Multimedia, pp. 41–50. ACM (2001) Kühne, G., Richter, S., Beier, M.: Motion-based segmentation and contour-based classification of video objects. In: Proceedings of the Ninth ACM International Conference on Multimedia, pp. 41–50. ACM (2001)
12.
Zurück zum Zitat Leung, W.H., Chen, T.: Trademark retrieval using contour-skeleton stroke classification. In: Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, ICME’02, vol. 2, pp. 517–520. IEEE (2002) Leung, W.H., Chen, T.: Trademark retrieval using contour-skeleton stroke classification. In: Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, ICME’02, vol. 2, pp. 517–520. IEEE (2002)
13.
Zurück zum Zitat Liu, H.C., Srinath, M.D.: Partial shape classification using contour matching in distance transformation. IEEE Trans. Pattern Anal. Mach. Intell. 12(11), 1072–1079 (1990)CrossRef Liu, H.C., Srinath, M.D.: Partial shape classification using contour matching in distance transformation. IEEE Trans. Pattern Anal. Mach. Intell. 12(11), 1072–1079 (1990)CrossRef
14.
Zurück zum Zitat Lopatka, M., Houten, W.V.: Science and justice automated shape annotation for illicit tablet preparations: a contour angle based classification from digital images. Sci. Justice 53(1), 60–66 (2013)CrossRef Lopatka, M., Houten, W.V.: Science and justice automated shape annotation for illicit tablet preparations: a contour angle based classification from digital images. Sci. Justice 53(1), 60–66 (2013)CrossRef
15.
Zurück zum Zitat Mitchell, T.M.: Mach. Learn., 1st edn. McGraw-Hill Inc., New York (1997) Mitchell, T.M.: Mach. Learn., 1st edn. McGraw-Hill Inc., New York (1997)
17.
Zurück zum Zitat Mukundan, R., Ramakrishnan, K.: Moment Functions in Image Analysis: Theory and Applications, vol. 100. World Scientific, Singapore (1998)MATHCrossRef Mukundan, R., Ramakrishnan, K.: Moment Functions in Image Analysis: Theory and Applications, vol. 100. World Scientific, Singapore (1998)MATHCrossRef
18.
Zurück zum Zitat Richter, S., Kühne, G., Schuster, O.: Contour-based classification of video objects. In: Proceedings of SPIE, vol. 4315, p. 608 (2001) Richter, S., Kühne, G., Schuster, O.: Contour-based classification of video objects. In: Proceedings of SPIE, vol. 4315, p. 608 (2001)
19.
Zurück zum Zitat Rigaud, C., Karatzas, D., Van de Weijer, J., Burie, J.C., Ogier, J.M.: An active contour model for speech balloon detection in comics. In: Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR). IEEE (2013) Rigaud, C., Karatzas, D., Van de Weijer, J., Burie, J.C., Ogier, J.M.: An active contour model for speech balloon detection in comics. In: Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR). IEEE (2013)
20.
Zurück zum Zitat Sun, K.B., Super, B.J.: Classification of contour shapes using class segment sets. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 2, pp. 727–733. IEEE (2005) Sun, K.B., Super, B.J.: Classification of contour shapes using class segment sets. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 2, pp. 727–733. IEEE (2005)
21.
Zurück zum Zitat Veltkamp, R.C., Tanase, M.: Content-based image retrieval systems: A survey. Technical report (2000) Veltkamp, R.C., Tanase, M.: Content-based image retrieval systems: A survey. Technical report (2000)
22.
Zurück zum Zitat Wang, Z., Chi, Z., Feng, D.: Shape based leaf image retrieval. IEEE Proc. Vis. Image Signal Process. 150(1), 34–43 (2003)CrossRef Wang, Z., Chi, Z., Feng, D.: Shape based leaf image retrieval. IEEE Proc. Vis. Image Signal Process. 150(1), 34–43 (2003)CrossRef
23.
Zurück zum Zitat Zahn, C.T., Roskies, R.Z.: Fourier descriptors for plane closed curves. IEEE Trans. Comput. c–21(3), 269–281 (1972)MathSciNetCrossRef Zahn, C.T., Roskies, R.Z.: Fourier descriptors for plane closed curves. IEEE Trans. Comput. c–21(3), 269–281 (1972)MathSciNetCrossRef
24.
Zurück zum Zitat Zhang, D., Lu, G.: Review of shape representation and description techniques. PR 37(1), 1–19 (2004)MATHCrossRef Zhang, D., Lu, G.: Review of shape representation and description techniques. PR 37(1), 1–19 (2004)MATHCrossRef
Metadaten
Titel
Adaptive Contour Classification of Comics Speech Balloons
verfasst von
Christophe Rigaud
Dimosthenis Karatzas
Jean-Christophe Burie
Jean-Marc Ogier
Copyright-Jahr
2014
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-662-44854-0_5