Skip to main content

2022 | OriginalPaper | Buchkapitel

A Digitization Pipeline for Mixed-Typed Documents Using Machine Learning and Optical Character Recognition

verfasst von : Tizian Matschak, Florian Rampold, Malte Hellmeier, Christoph Prinz, Simon Trang

Erschienen in: The Transdisciplinary Reach of Design Science Research

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Although digitization is advancing rapidly, a large amount of data processed by companies is in printed format. Technologies such as Optical Character Recognition (OCR) support the transformation of printed text into machine-readable content. However, OCR struggles when data on documents is highly unstructured and includes non-text objects. This, e.g., applies to documents such as medical prescriptions. Leveraging Design Science Research (DSR), we propose a flexible processing pipeline that can deal with character recognition on the one hand and object detection on the other hand. To do so, we derive Design Requirements (DR) in cooperation with a practitioner doing prescription billing in the healthcare domain. We then developed a prototype blueprint that is applicable to similar problem formulations. Overall, we contribute to research and practice in multiple ways. First, we provide evidence for selected OCR methods provided by previous research. Second, we design a machine-learning-based digitization pipeline for printed documents containing both text and non-text objects in the context of medical prescriptions. Third, we derive a nascent design pattern for this type of document digitization. These patterns are the foundation for further research and can support the development of innovative information systems leading to more efficient decision making and thus to economic resource usage.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Troisi, O., Maione, G., Grimaldi, M., Moia, F.: Growth hacking: Insights on data-driven decision-making from three firms. Ind. Mark. Manage. 90, 538–557 (2020)CrossRef Troisi, O., Maione, G., Grimaldi, M., Moia, F.: Growth hacking: Insights on data-driven decision-making from three firms. Ind. Mark. Manage. 90, 538–557 (2020)CrossRef
2.
Zurück zum Zitat Long, Q.: Data-driven decision making for supply chain networks with agent-based computational experiment. Knowl.-Based Syst. 141, 55–66 (2018)CrossRef Long, Q.: Data-driven decision making for supply chain networks with agent-based computational experiment. Knowl.-Based Syst. 141, 55–66 (2018)CrossRef
3.
Zurück zum Zitat ABDA, B.D.A.e.V.: Arzneimittel 2020: Weniger Rezepte, aber höhere GKV-Ausgaben im Pandemie-Jahr. (2021) ABDA, B.D.A.e.V.: Arzneimittel 2020: Weniger Rezepte, aber höhere GKV-Ausgaben im Pandemie-Jahr. (2021)
4.
Zurück zum Zitat Memon, J., Sami, M., Khan, R.A., Uddin, M.: Handwritten optical character recognition (OCR): a comprehensive systematic literature review (SLR). IEEE Access. 8, 142642–142668 (2020)CrossRef Memon, J., Sami, M., Khan, R.A., Uddin, M.: Handwritten optical character recognition (OCR): a comprehensive systematic literature review (SLR). IEEE Access. 8, 142642–142668 (2020)CrossRef
5.
Zurück zum Zitat Tappert, C.C., Suen, C.Y., Wakahara, T.: The state of the art in online handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 12, 787–8–8 (1990) Tappert, C.C., Suen, C.Y., Wakahara, T.: The state of the art in online handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 12, 787–8–8 (1990)
6.
Zurück zum Zitat Gupta, M.R., Jacobson, N.P., Garcia, E.K.: OCR binarization and image pre-processing for searching historical documents. Pattern Recogn. 40, 389–397 (2007)CrossRefMATH Gupta, M.R., Jacobson, N.P., Garcia, E.K.: OCR binarization and image pre-processing for searching historical documents. Pattern Recogn. 40, 389–397 (2007)CrossRefMATH
7.
Zurück zum Zitat Shinde, A.A., Chougule, D.G.: Text pre-processing and text segmentation for OCR. Int. J. Comp. Sci. Eng. Technol. 2, 810–812 (2012) Shinde, A.A., Chougule, D.G.: Text pre-processing and text segmentation for OCR. Int. J. Comp. Sci. Eng. Technol. 2, 810–812 (2012)
8.
Zurück zum Zitat Hevner, A.R.: A three cycle view of design science research. SJIS. 19, 87–92 (2007) Hevner, A.R.: A three cycle view of design science research. SJIS. 19, 87–92 (2007)
9.
Zurück zum Zitat Akram, S., Dar, M.-U.-D., Quyoum, A.: Document image processing - a review. IJCA. 10, 35–40 (2010)CrossRef Akram, S., Dar, M.-U.-D., Quyoum, A.: Document image processing - a review. IJCA. 10, 35–40 (2010)CrossRef
11.
Zurück zum Zitat Singh, S.: Optical Character Recognition Techniques: A Survey. Int. J. Adv. Res. Comp. Eng. Technol. 4, 6 (2013)CrossRef Singh, S.: Optical Character Recognition Techniques: A Survey. Int. J. Adv. Res. Comp. Eng. Technol. 4, 6 (2013)CrossRef
12.
Zurück zum Zitat Chaudhuri, A., Mandaviya, K., Badelia, P., Ghosh, S.K.: Optical character recognition systems. In: Chaudhuri, A., Mandaviya, K., Badelia, P., and K Ghosh, S. (eds.) Optical Character Recognition Systems for Different Languages with Soft Computing, pp. 9–41. Springer International Publishing, Cham (2017) Chaudhuri, A., Mandaviya, K., Badelia, P., Ghosh, S.K.: Optical character recognition systems. In: Chaudhuri, A., Mandaviya, K., Badelia, P., and K Ghosh, S. (eds.) Optical Character Recognition Systems for Different Languages with Soft Computing, pp. 9–41. Springer International Publishing, Cham (2017)
13.
Zurück zum Zitat Islam, N., Islam, Z., Noor, N.: A survey on optical character recognition system. Journal of Information. 10, 4 (2016) Islam, N., Islam, Z., Noor, N.: A survey on optical character recognition system. Journal of Information. 10, 4 (2016)
14.
Zurück zum Zitat Ning, M.: Id card number identification based on artificial neural network. In: 2016 International Conference on Robots & Intelligent System (ICRIS), pp. 207–212. IEEE, ZhangJiaJie, China (2016) Ning, M.: Id card number identification based on artificial neural network. In: 2016 International Conference on Robots & Intelligent System (ICRIS), pp. 207–212. IEEE, ZhangJiaJie, China (2016)
15.
Zurück zum Zitat Sakhawat, Z., Ali, S., Hongzhi, L.: Handwritten digits recognition based on deep Learning4j. In: Proceedings of the 2018 International Conference on Artificial Intelligence and Pattern Recognition - AIPR 2018, pp. 21–25. ACM Press, Beijing, China (2018) Sakhawat, Z., Ali, S., Hongzhi, L.: Handwritten digits recognition based on deep Learning4j. In: Proceedings of the 2018 International Conference on Artificial Intelligence and Pattern Recognition - AIPR 2018, pp. 21–25. ACM Press, Beijing, China (2018)
16.
Zurück zum Zitat Trier, F., Afzal, M.Z., Ebbecke, M., Liwicki, M.: Deep convolutional neural networks for image resolution detection. In: Proceedings of the 4th International Workshop on Historical Document Imaging and Processing - HIP2017, pp. 77–82. ACM Press, Kyoto, Japan (2017) Trier, F., Afzal, M.Z., Ebbecke, M., Liwicki, M.: Deep convolutional neural networks for image resolution detection. In: Proceedings of the 4th International Workshop on Historical Document Imaging and Processing - HIP2017, pp. 77–82. ACM Press, Kyoto, Japan (2017)
17.
Zurück zum Zitat Zhai, X., Bensaali, F., Sotudeh, R.: OCR-based neural network for ANPR. In: 2012 IEEE International Conference on Imaging Systems and Techniques Proceedings, pp. 393–397. IEEE, Manchester, United Kingdom (2012) Zhai, X., Bensaali, F., Sotudeh, R.: OCR-based neural network for ANPR. In: 2012 IEEE International Conference on Imaging Systems and Techniques Proceedings, pp. 393–397. IEEE, Manchester, United Kingdom (2012)
18.
Zurück zum Zitat Alday, R.B., Pagayon, R.M.: MediPic: a mobile application for medical prescriptions. In: IISA 2013, pp. 1–4. IEEE, Piraeus, Greece (2013) Alday, R.B., Pagayon, R.M.: MediPic: a mobile application for medical prescriptions. In: IISA 2013, pp. 1–4. IEEE, Piraeus, Greece (2013)
19.
Zurück zum Zitat Carchiolo, V., Longheu, A., Reitano, G., Zagarella, L.: Medical prescription classification: a NLP-based approach. In: Presented at the 2019 Federated Conference on Computer Science and Information Systems September 26 (2019) Carchiolo, V., Longheu, A., Reitano, G., Zagarella, L.: Medical prescription classification: a NLP-based approach. In: Presented at the 2019 Federated Conference on Computer Science and Information Systems September 26 (2019)
20.
Zurück zum Zitat Tabrizi, S.S., Cavus, N.: A hybrid KNN-SVM model for iranian license plate recognition. Procedia Comp. Sci. 102, 588–594 (2016)CrossRef Tabrizi, S.S., Cavus, N.: A hybrid KNN-SVM model for iranian license plate recognition. Procedia Comp. Sci. 102, 588–594 (2016)CrossRef
21.
Zurück zum Zitat Hevner, A.R., March, S.T., Park, J., Ram, S.: Design science in information systems research. MISQ, 75–105 (2004) Hevner, A.R., March, S.T., Park, J., Ram, S.: Design science in information systems research. MISQ, 75–105 (2004)
22.
Zurück zum Zitat Iivari, J., Venable, J.R.: Action research and design science research - seemingly similar but decisively dissimilar. In: ECIS 2009 Proceedings, p. 13 (2009) Iivari, J., Venable, J.R.: Action research and design science research - seemingly similar but decisively dissimilar. In: ECIS 2009 Proceedings, p. 13 (2009)
23.
Zurück zum Zitat Hillebrand, K., Johannsen, F.: KlimaKarl – a chatbot to promote employees’ climate-friendly behavior in an office setting. In: International Conference on Design Science Research in Information Systems and Technology, pp. 3–15. Springer, Cham (2021) Hillebrand, K., Johannsen, F.: KlimaKarl – a chatbot to promote employees’ climate-friendly behavior in an office setting. In: International Conference on Design Science Research in Information Systems and Technology, pp. 3–15. Springer, Cham (2021)
25.
Zurück zum Zitat Peffers, K., Tuunanen, T., Gengler, C.E., Rossi, M., Hui, W.: The design science research process: a model for producing and presenting information systems research. J. Manag. Inf. Syst. 24, 45–77 (2007)CrossRef Peffers, K., Tuunanen, T., Gengler, C.E., Rossi, M., Hui, W.: The design science research process: a model for producing and presenting information systems research. J. Manag. Inf. Syst. 24, 45–77 (2007)CrossRef
26.
Zurück zum Zitat Weigand, H.H.: Value expression in design science research. In: 2019 13th International Conference on Research Challenges in Information Science (RCIS), pp. 1–11. IEEE, Brussels, Belgium (2019) Weigand, H.H.: Value expression in design science research. In: 2019 13th International Conference on Research Challenges in Information Science (RCIS), pp. 1–11. IEEE, Brussels, Belgium (2019)
27.
Zurück zum Zitat McCarthy, S., Rowan, W., Lynch, L., Fitzgerald, C.: Blended stakeholder participation for responsible information systems research. CAIS. 47, 716–742 (2020)CrossRef McCarthy, S., Rowan, W., Lynch, L., Fitzgerald, C.: Blended stakeholder participation for responsible information systems research. CAIS. 47, 716–742 (2020)CrossRef
28.
Zurück zum Zitat Gideon, S.J., Kandulna, A., Kujur, A.A., Diana, A., Raimond, K.: Handwritten signature forgery detection using convolutional neural networks. Procedia Comp. Sci. 143, 978–987 (2018)CrossRef Gideon, S.J., Kandulna, A., Kujur, A.A., Diana, A., Raimond, K.: Handwritten signature forgery detection using convolutional neural networks. Procedia Comp. Sci. 143, 978–987 (2018)CrossRef
29.
Zurück zum Zitat Tse, J., Jones, C., Curtis, D., Yfantis, E.: An OCR-independent character segmentation using shortest-path in grayscale document images. In: Sixth International Conference on Machine Learning and Applications (ICMLA 2007), pp. 142–147. IEEE, Cincinnati, OH, USA (2007) Tse, J., Jones, C., Curtis, D., Yfantis, E.: An OCR-independent character segmentation using shortest-path in grayscale document images. In: Sixth International Conference on Machine Learning and Applications (ICMLA 2007), pp. 142–147. IEEE, Cincinnati, OH, USA (2007)
30.
Zurück zum Zitat Gleichman, S., Ophir, B., Geva, A., Marder, M., Barkan, E., Packer, E.: Detection and segmentation of antialiased text in screen images. In: 2011 International Conference on Document Analysis and Recognition, pp. 424–428. IEEE, Beijing, China (2011) Gleichman, S., Ophir, B., Geva, A., Marder, M., Barkan, E., Packer, E.: Detection and segmentation of antialiased text in screen images. In: 2011 International Conference on Document Analysis and Recognition, pp. 424–428. IEEE, Beijing, China (2011)
31.
Zurück zum Zitat Kasar, T., Kumar, J., Ramakrishnan, A.G.: Font and background color independent text binarization. In: Second International Workshop on Camera-based Document Analysis and Recognition, pp. 3–9 (2007) Kasar, T., Kumar, J., Ramakrishnan, A.G.: Font and background color independent text binarization. In: Second International Workshop on Camera-based Document Analysis and Recognition, pp. 3–9 (2007)
32.
Zurück zum Zitat Manikandan, A.V.M., Choudhury, S., Majumder, S.: Text reader for visually impaired people: any reader. In: 2017 IEEE International Conference on Power, Control, Signals and Instrumentation Engineering (ICPCSI), pp. 2389–2393. IEEE, Chennai (2017) Manikandan, A.V.M., Choudhury, S., Majumder, S.: Text reader for visually impaired people: any reader. In: 2017 IEEE International Conference on Power, Control, Signals and Instrumentation Engineering (ICPCSI), pp. 2389–2393. IEEE, Chennai (2017)
33.
Zurück zum Zitat Palekar, R.R., Parab, S.U., Parikh, D.P., Kamble, V.N.: Real time license plate detection using openCV and tesseract. In: 2017 International Conference on Communication and Signal Processing (ICCSP), pp. 2111–2115. IEEE, Chennai (2017) Palekar, R.R., Parab, S.U., Parikh, D.P., Kamble, V.N.: Real time license plate detection using openCV and tesseract. In: 2017 International Conference on Communication and Signal Processing (ICCSP), pp. 2111–2115. IEEE, Chennai (2017)
34.
Zurück zum Zitat Sajjad, K.M.: Automatic License Plate Recognition using Python and OpenCV. Department of Computer Science and Engineering MES College of Engineering, p. 5 (2010) Sajjad, K.M.: Automatic License Plate Recognition using Python and OpenCV. Department of Computer Science and Engineering MES College of Engineering, p. 5 (2010)
36.
Zurück zum Zitat Wager, S., Fithian, W., Wang, S., Liang, P.S.: Altitude training: strong bounds for single-layer dropout. Adv. Neu. Info. Proc. Sys. 1–8 (2014) Wager, S., Fithian, W., Wang, S., Liang, P.S.: Altitude training: strong bounds for single-layer dropout. Adv. Neu. Info. Proc. Sys. 1–8 (2014)
37.
Zurück zum Zitat Jin Huang, Ling, C.X.: Using AUC and accuracy in evaluating learning algorithms. IEEE Trans. Knowl. Data Eng. 17, 299–310 (2005). Jin Huang, Ling, C.X.: Using AUC and accuracy in evaluating learning algorithms. IEEE Trans. Knowl. Data Eng. 17, 299–310 (2005).
38.
Zurück zum Zitat Tang, O., Grubbström, R.W., Zanoni, S.: Planned lead time determination in a make-to-order remanufacturing system. Int. J. Prod. Econ. 108, 426–435 (2007)CrossRef Tang, O., Grubbström, R.W., Zanoni, S.: Planned lead time determination in a make-to-order remanufacturing system. Int. J. Prod. Econ. 108, 426–435 (2007)CrossRef
39.
Zurück zum Zitat Gregor, S., Hevner, A.R.: Positioning and presenting design science research for maximum impact. MISQ. 37, 337–355 (2013)CrossRef Gregor, S., Hevner, A.R.: Positioning and presenting design science research for maximum impact. MISQ. 37, 337–355 (2013)CrossRef
Metadaten
Titel
A Digitization Pipeline for Mixed-Typed Documents Using Machine Learning and Optical Character Recognition
verfasst von
Tizian Matschak
Florian Rampold
Malte Hellmeier
Christoph Prinz
Simon Trang
Copyright-Jahr
2022
DOI
https://doi.org/10.1007/978-3-031-06516-3_15

Neuer Inhalt