Skip to main content

2014 | OriginalPaper | Buchkapitel

Mobile Phone Camera-Based Video Scanning of Paper Documents

verfasst von : Muhammad Muzzamil Luqman, Petra Gomez-Krämer, Jean-Marc Ogier

Erschienen in: Camera-Based Document Analysis and Recognition

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Mobile phone camera-based document video scanning is an interesting research problem which has entered into a new era with the emergence of widely used, processing capable and motion sensors equipped smartphones. We present our ongoing research on mobile phone camera-based document image mosaic reconstruction method for video scanning of paper documents. In this work, we have optimized the classic keypoint feature descriptor-based image registration method, by employing the accelerometer and gyroscope sensor data. Experimental results are evaluated using optical character recognition (OCR) on the reconstructed mosaic from mobile phone camera-based video scanning of paper documents.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alahi, A., Ortiz, R., Vandergheynst, P.: FREAK: fast retina keypoint. In: International Conference on Computer Vision and Pattern Recognition, pp. 510–517 (2012) Alahi, A., Ortiz, R., Vandergheynst, P.: FREAK: fast retina keypoint. In: International Conference on Computer Vision and Pattern Recognition, pp. 510–517 (2012)
2.
Zurück zum Zitat Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006) Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)
3.
Zurück zum Zitat Fischler, M., Bolles, R.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)CrossRefMathSciNet Fischler, M., Bolles, R.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)CrossRefMathSciNet
4.
Zurück zum Zitat Hannuksela, J., Sangi, P., Heikkila, J., Liu, X., Doermann, D.: Document image mosaicing with mobile phones. In: International Conference on Image Analysis and Processing, pp. 575–582 (2007) Hannuksela, J., Sangi, P., Heikkila, J., Liu, X., Doermann, D.: Document image mosaicing with mobile phones. In: International Conference on Image Analysis and Processing, pp. 575–582 (2007)
5.
Zurück zum Zitat Jagannathan, L., Jawahar, C.: Perspective correction methods for camera based document analysis. In: International Workshop on Camera-Based Document Analysis and Recognition, pp. 148–154 (2005) Jagannathan, L., Jawahar, C.: Perspective correction methods for camera based document analysis. In: International Workshop on Camera-Based Document Analysis and Recognition, pp. 148–154 (2005)
6.
Zurück zum Zitat Levenshtein, V.: Binary codes capable of correcting deletions, insertions and reversals. Sov. Phys. Dokl. 10(8), 707–710 (1966)MathSciNet Levenshtein, V.: Binary codes capable of correcting deletions, insertions and reversals. Sov. Phys. Dokl. 10(8), 707–710 (1966)MathSciNet
7.
Zurück zum Zitat Liang, J., DeMenthon, D., Doermann, D.: Mosaicing of camera-captured document images. Comput. Vis. Image Underst. 113(4), 572–579 (2009)CrossRef Liang, J., DeMenthon, D., Doermann, D.: Mosaicing of camera-captured document images. Comput. Vis. Image Underst. 113(4), 572–579 (2009)CrossRef
8.
Zurück zum Zitat Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. Int. J. Doc. Anal. Recogn. 7(2–3), 84–104 (2005)CrossRef Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. Int. J. Doc. Anal. Recogn. 7(2–3), 84–104 (2005)CrossRef
9.
Zurück zum Zitat Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef
10.
Zurück zum Zitat Muja, M., Lowe, D.: Fast approximate nearest neighbors with automatic algorithm configuration. In: International Conference on Computer Vision Theory and Applications, pp. 331–340 (2009) Muja, M., Lowe, D.: Fast approximate nearest neighbors with automatic algorithm configuration. In: International Conference on Computer Vision Theory and Applications, pp. 331–340 (2009)
11.
Zurück zum Zitat Nakai, T., Kise, K., Iwamura, M.: Camera-based document image mosaicing using LLAH. In: Document Recognition and Retrieval XVI, pp. 1–10 (2009) Nakai, T., Kise, K., Iwamura, M.: Camera-based document image mosaicing using LLAH. In: Document Recognition and Retrieval XVI, pp. 1–10 (2009)
12.
Zurück zum Zitat Rosten, E., Drummond, T.W.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006) Rosten, E., Drummond, T.W.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006)
13.
Zurück zum Zitat Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: International Conference on Computer Vision, pp. 2564–2571 (2011) Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: International Conference on Computer Vision, pp. 2564–2571 (2011)
14.
Zurück zum Zitat Sawhney, H.S., Hsu, S., Kumar, R.: Robust video mosaicing through topology inference and local to global alignment. In: Burkhardt, H., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1407, p. 103. Springer, Heidelberg (1998) Sawhney, H.S., Hsu, S., Kumar, R.: Robust video mosaicing through topology inference and local to global alignment. In: Burkhardt, H., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1407, p. 103. Springer, Heidelberg (1998)
15.
Zurück zum Zitat Szeliski, R.: Image alignment and stitching. In: Handbook of Mathematical Models in Computer Vision, pp. 273–292. Springer (2006) Szeliski, R.: Image alignment and stitching. In: Handbook of Mathematical Models in Computer Vision, pp. 273–292. Springer (2006)
16.
Zurück zum Zitat Woodman, O.J.: An introduction to inertial navigation. Technical report 696, University of Cambridge, Computer Laboratory, Cambridge (2007) Woodman, O.J.: An introduction to inertial navigation. Technical report 696, University of Cambridge, Computer Laboratory, Cambridge (2007)
17.
Zurück zum Zitat Yang, Q., Wang, C., Gao, Y., Qu, H., Chang, E.: Inertial sensors aided image alignment and stitching for panorama on mobile phones. In: International Workshop on Mobile Location-Based Service, pp. 21–30 (2011) Yang, Q., Wang, C., Gao, Y., Qu, H., Chang, E.: Inertial sensors aided image alignment and stitching for panorama on mobile phones. In: International Workshop on Mobile Location-Based Service, pp. 21–30 (2011)
Metadaten
Titel
Mobile Phone Camera-Based Video Scanning of Paper Documents
verfasst von
Muhammad Muzzamil Luqman
Petra Gomez-Krämer
Jean-Marc Ogier
Copyright-Jahr
2014
DOI
https://doi.org/10.1007/978-3-319-05167-3_13