Skip to main content

2014 | OriginalPaper | Buchkapitel

Dewarping Book Page Spreads Captured with a Mobile Phone Camera

verfasst von : Chelhwon Kim, Patrick Chiu, Surendar Chandra

Erschienen in: Camera-Based Document Analysis and Recognition

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Capturing book images is more convenient with a mobile phone camera than with more specialized flat-bed scanners or 3D capture devices. We built an application for the iPhone 4S that captures a sequence of hi-res (8 MP) images of a page spread as the user sweeps the device across the book. To do the 3D dewarping, we implemented two algorithms: optical flow (OF) and structure from motion (SfM). Making further use of the image sequence, we examined the potential of multi-frame OCR. Preliminary evaluation on a small set of data shows that OF and SfM had comparable OCR performance for both single-frame and multi-frame techniques, and that multi-frame was substantially better than single-frame. The computation time was much less for OF than for SfM.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Beardsley, P., Zisserman, A., Murray, D.: Sequential updating of projective and affine structure from motion. Intl. J. Comput. Vision 23(3), 235–259 (1997)CrossRef Beardsley, P., Zisserman, A., Murray, D.: Sequential updating of projective and affine structure from motion. Intl. J. Comput. Vision 23(3), 235–259 (1997)CrossRef
2.
Zurück zum Zitat Bukhari, S.S., Shafait, F., Breuel, T.M.: Border noise removal of camera-captured document images using page frame detection. In: Iwamura, M., Shafait, F. (eds.) CBDAR 2011. LNCS, vol. 7139, pp. 126–137. Springer, Heidelberg (2012) Bukhari, S.S., Shafait, F., Breuel, T.M.: Border noise removal of camera-captured document images using page frame detection. In: Iwamura, M., Shafait, F. (eds.) CBDAR 2011. LNCS, vol. 7139, pp. 126–137. Springer, Heidelberg (2012)
3.
Zurück zum Zitat Bradski, G.: The OpenCV Library. Dr. Dobb’s Journal of Software Tools (2000) Bradski, G.: The OpenCV Library. Dr. Dobb’s Journal of Software Tools (2000)
4.
Zurück zum Zitat Brown, M., Seales, W.: Image restoration of arbitrarily warped documents. IEEE TPAMI 26, 1295–1306 (2004)CrossRef Brown, M., Seales, W.: Image restoration of arbitrarily warped documents. IEEE TPAMI 26, 1295–1306 (2004)CrossRef
5.
Zurück zum Zitat Brown, M., Tsoi, Y.-C.: Geometric and shading correction for images of printed materials using boundary. IEEE Trans. Image Process. 15, 1544–1554 (2006)CrossRef Brown, M., Tsoi, Y.-C.: Geometric and shading correction for images of printed materials using boundary. IEEE Trans. Image Process. 15, 1544–1554 (2006)CrossRef
6.
Zurück zum Zitat Cao, H., Ding, X., Liu, C.: Rectifying the bound document image captured by the camera: a model based approach. In: Proceedings of ICDAR 2003, pp. 71–75 (2003) Cao, H., Ding, X., Liu, C.: Rectifying the bound document image captured by the camera: a model based approach. In: Proceedings of ICDAR 2003, pp. 71–75 (2003)
7.
Zurück zum Zitat Cutter, M., Chiu, P.: Capture and dewarping of page spreads with a handheld compact 3D camera. In: Proceedings of DAS 2012, pp. 205–209 (2012) Cutter, M., Chiu, P.: Capture and dewarping of page spreads with a handheld compact 3D camera. In: Proceedings of DAS 2012, pp. 205–209 (2012)
8.
Zurück zum Zitat Fu, B., Wu, M., Li, R., Li, W., Xu, Z., Yang, C.: A model-based book dewarping method using text line detection. In: Proceedings of CBDAR 2007, pp. 63–70 (2007) Fu, B., Wu, M., Li, R., Li, W., Xu, Z., Yang, C.: A model-based book dewarping method using text line detection. In: Proceedings of CBDAR 2007, pp. 63–70 (2007)
9.
Zurück zum Zitat Liang, J., DeMenthon, D., Doermann, D.: Geometric rectification of camera-captured document images. IEEE TPAMI 30, 591–605 (2008)CrossRef Liang, J., DeMenthon, D., Doermann, D.: Geometric rectification of camera-captured document images. IEEE TPAMI 30, 591–605 (2008)CrossRef
10.
Zurück zum Zitat Nakajima, N., Iketani, A., Sato, T., Ikeda, S., Kanbara, M., Yokoya, N.: Video mosaicing for document imaging. In: Proceedings of CBDAR 2007, pp. 171–178 (2007) Nakajima, N., Iketani, A., Sato, T., Ikeda, S., Kanbara, M., Yokoya, N.: Video mosaicing for document imaging. In: Proceedings of CBDAR 2007, pp. 171–178 (2007)
11.
Zurück zum Zitat Newman, W., Dance, C., Taylor, A., Taylor, S., Taylor, M., Aldhous, T.: CamWorks: a video-based tool for efficient capture from paper source documents. In: Proceedings of International Conference on Multimedia Computing and Systems, ICMCS 1999, pp 647–653 (1999) Newman, W., Dance, C., Taylor, A., Taylor, S., Taylor, M., Aldhous, T.: CamWorks: a video-based tool for efficient capture from paper source documents. In: Proceedings of International Conference on Multimedia Computing and Systems, ICMCS 1999, pp 647–653 (1999)
12.
Zurück zum Zitat Peng, X., Cao, H., Subramanian, K., Prasad, R., Natarajan, P.: Automated image quality assessment for camera-captured OCR. In: Proceedings of ICIP 2011, pp. 2669–2672 (2011) Peng, X., Cao, H., Subramanian, K., Prasad, R., Natarajan, P.: Automated image quality assessment for camera-captured OCR. In: Proceedings of ICIP 2011, pp. 2669–2672 (2011)
13.
Zurück zum Zitat Rother, C., Kolmogorov, V., Blake, A.: Grabcut: interactive foreground extraction using iterated graph cuts. In: Proceedings of Siggraph 2004, pp. 309–314 (2004) Rother, C., Kolmogorov, V., Blake, A.: Grabcut: interactive foreground extraction using iterated graph cuts. In: Proceedings of Siggraph 2004, pp. 309–314 (2004)
14.
Zurück zum Zitat Shafait, F., Breuel, T.: Document image dewarping contest, CBDAR 2007 Shafait, F., Breuel, T.: Document image dewarping contest, CBDAR 2007
15.
Zurück zum Zitat Shafait, F., Cutter, M., van Beusekom, J., Bukhari, S., Breuel, T.: Decapod: a flexible, low cost digitization solution for small and medium archives. In: Proceedings of CBDAR 2011, pp. 41–46 (2011) Shafait, F., Cutter, M., van Beusekom, J., Bukhari, S., Breuel, T.: Decapod: a flexible, low cost digitization solution for small and medium archives. In: Proceedings of CBDAR 2011, pp. 41–46 (2011)
16.
Zurück zum Zitat Shi, J., Tomasi, C.: Good features to track. In: Proceedings of CVPR 1994, pp. 593–600 (1994) Shi, J., Tomasi, C.: Good features to track. In: Proceedings of CVPR 1994, pp. 593–600 (1994)
17.
Zurück zum Zitat Szeliski, R.: Computer Vision: Algorithms and Applications. Springer, New York (2010) Szeliski, R.: Computer Vision: Algorithms and Applications. Springer, New York (2010)
18.
Zurück zum Zitat Taylor, M., Dance, C.: Enhancement of document images from cameras. In: SPIE Conference on Document Recognition V, vol. 3305, 230–241 (1998) Taylor, M., Dance, C.: Enhancement of document images from cameras. In: SPIE Conference on Document Recognition V, vol. 3305, 230–241 (1998)
20.
Zurück zum Zitat Triggs, B., McLauchlan, P., Hartley, R., Fitzgibbon, A.: Bundle adjustment a modern synthesis. In: Proceedings of ICCV 1999, pp. 298–372 (1999) Triggs, B., McLauchlan, P., Hartley, R., Fitzgibbon, A.: Bundle adjustment a modern synthesis. In: Proceedings of ICCV 1999, pp. 298–372 (1999)
21.
Zurück zum Zitat Xu, Li, Jia, Jiaya: Two-phase kernel estimation for robust motion deblurring. In: Daniilidis, Kostas, Maragos, Petros, Paragios, Nikos (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 157–170. Springer, Heidelberg (2010) Xu, Li, Jia, Jiaya: Two-phase kernel estimation for robust motion deblurring. In: Daniilidis, Kostas, Maragos, Petros, Paragios, Nikos (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 157–170. Springer, Heidelberg (2010)
Metadaten
Titel
Dewarping Book Page Spreads Captured with a Mobile Phone Camera
verfasst von
Chelhwon Kim
Patrick Chiu
Surendar Chandra
Copyright-Jahr
2014
DOI
https://doi.org/10.1007/978-3-319-05167-3_8