Skip to main content

2020 | OriginalPaper | Buchkapitel

Automated Classifier Development Process for Recognizing Book Pages from Video Frames

verfasst von : Adam Brzeski, Jan Cychnerski, Karol Draszawka, Krystyna Dziubich, Tomasz Dziubich, Waldemar Korłub, Paweł Rościszewski

Erschienen in: ADBIS, TPDL and EDA 2020 Common Workshops and Doctoral Consortium

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

One of the latest developments made by publishing companies is introducing mixed and augmented reality to their printed media (e.g. to produce augmented books). An important computer vision problem that they are facing is classification of book pages from video frames. The problem is non-trivial, especially considering that typical training data is limited to only one digital original per book page, while the trained classifier should be suitable for real-time utilization on mobile devices, where camera can be exposed to highly diverse conditions and computing resources are limited. In this paper we address this problem by proposing an automated classifier development process that allows training classification models that run real-time, with high usability, on low-end mobile devices and achieve average accuracy of 88.95% on our in-house developed test set consisting of over 20 000 frames from real videos of 5 books for children. At the same time, deployment tests reveal that the classifier development process time is reduced approximately 16-fold.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Hull, J.J., et al.: Paper-based augmented reality. In: 17th International Conference on Artificial Reality and Telexistence (ICAT 2007), Denmark, November 2007, pp. 205–209. IEEE (2007) Hull, J.J., et al.: Paper-based augmented reality. In: 17th International Conference on Artificial Reality and Telexistence (ICAT 2007), Denmark, November 2007, pp. 205–209. IEEE (2007)
3.
Zurück zum Zitat Fujinami, K., Inagawa, N.: Page-flipping detection and information presentation for implicit interaction with a book. Int. J. Multimed. Ubiquitous Eng. 4(3), 20 (2009) Fujinami, K., Inagawa, N.: Page-flipping detection and information presentation for implicit interaction with a book. Int. J. Multimed. Ubiquitous Eng. 4(3), 20 (2009)
4.
Zurück zum Zitat Back, M., Cohen, J., Gold, R., Harrison, S., Minneman, S.: Listen reader: an electronically augmented paper-based book. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems - CHI 2001, Seattle, Washington, USA, pp. 23–29. ACM Press (2001) Back, M., Cohen, J., Gold, R., Harrison, S., Minneman, S.: Listen reader: an electronically augmented paper-based book. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems - CHI 2001, Seattle, Washington, USA, pp. 23–29. ACM Press (2001)
5.
Zurück zum Zitat Garris, M.D., Creating and validating a large image database for METTREC. Technical report NIST IR 6090, National Institute of Standards and Technology, Gaithersburg, MD (1997) Garris, M.D., Creating and validating a large image database for METTREC. Technical report NIST IR 6090, National Institute of Standards and Technology, Gaithersburg, MD (1997)
6.
Zurück zum Zitat Chakraborty, D., Roy, P.P., Alvarez, J.M., Pal, U.: Duplicate open page removal from video stream of book flipping. In: 2013 Fourth National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), Jodhpur, India, December 2013, pp. 1–4. IEEE (2013) Chakraborty, D., Roy, P.P., Alvarez, J.M., Pal, U.: Duplicate open page removal from video stream of book flipping. In: 2013 Fourth National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), Jodhpur, India, December 2013, pp. 1–4. IEEE (2013)
8.
Zurück zum Zitat Jang, S.-W., Ko, J., Lee, H.J., Kim, Y.S.: A study on tracking and augmentation in mobile AR for e-Leisure. Mobile Inf. Syst. 2018, 1–11 (2018)CrossRef Jang, S.-W., Ko, J., Lee, H.J., Kim, Y.S.: A study on tracking and augmentation in mobile AR for e-Leisure. Mobile Inf. Syst. 2018, 1–11 (2018)CrossRef
9.
10.
Zurück zum Zitat Wikitude GmbH. Developer’s Guide (2020) Wikitude GmbH. Developer’s Guide (2020)
Metadaten
Titel
Automated Classifier Development Process for Recognizing Book Pages from Video Frames
verfasst von
Adam Brzeski
Jan Cychnerski
Karol Draszawka
Krystyna Dziubich
Tomasz Dziubich
Waldemar Korłub
Paweł Rościszewski
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-55814-7_14

Premium Partner