Skip to main content
Top

2016 | OriginalPaper | Chapter

Direct Unsupervised Text Line Extraction from Colored Historical Manuscript Images Using DCT

Authors : Asim Baig, Somaya Al-Maadeed, Ahmed Bouridane, Mohamed Cheriet

Published in: Image Analysis and Recognition

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Extracting lines of text from a manuscript is an important preprocessing step in many digital paleography applications. These extracted lines play a fundamental part in the identification of the author and/or age of the manuscript. In this paper we present an unsupervised approach to text line extraction in historical manuscripts that can be applied directly to a color manuscript image. Each of the red, green and blue channels are processed separately by applying DCT on them individually. One of the key advantages of this approach is that it can be applied directly to the manuscript image without any preprocessing, training or tuning steps. Extensive testing on complex Arabic handwritten manuscripts shows the effectiveness of the proposed approach.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Shahab, S.A., Al-Khatib, W.G., Mahmoud, S.A.: Computer Aided Indexing of Historical Manuscripts. In: International Conference on Computer Graphics, Imaging and Visualisation, Sydney, 2006, pp. 287–295. doi:10.1109/CGIV.2006.31 Shahab, S.A., Al-Khatib, W.G., Mahmoud, S.A.: Computer Aided Indexing of Historical Manuscripts. In: International Conference on Computer Graphics, Imaging and Visualisation, Sydney, 2006, pp. 287–295. doi:10.​1109/​CGIV.​2006.​31
2.
go back to reference Fiel, S., Hollaus, F., Gau, M., Sablatnig, R.: Writer identification on historical Glagolitic documents. In: SPIE Proceedings on Document Recognition and Retrieval XXI, p. 902102 (2013). doi:10.1117/12.2042338 Fiel, S., Hollaus, F., Gau, M., Sablatnig, R.: Writer identification on historical Glagolitic documents. In: SPIE Proceedings on Document Recognition and Retrieval XXI, p. 902102 (2013). doi:10.​1117/​12.​2042338
3.
go back to reference He, S., Sammara, P., Burgers, J., Schomaker, L.: Towards style-based dating of historical documents. In: 14th International Conference on Frontiers in Handwriting Recognition (ICFHR), Heraklion, pp. 265–270 (2014). doi:10.1109/ICFHR.2014.52 He, S., Sammara, P., Burgers, J., Schomaker, L.: Towards style-based dating of historical documents. In: 14th International Conference on Frontiers in Handwriting Recognition (ICFHR), Heraklion, pp. 265–270 (2014). doi:10.​1109/​ICFHR.​2014.​52
4.
go back to reference Antonacopoulos, A., Clausner, C., Papadopoulous, C., Pletschacher, S.: Historical document layout analysis competition. In: IEEE International Conference on Document Analysis and Recognition (ICDAR), pp. 1516–1520 (2011) Antonacopoulos, A., Clausner, C., Papadopoulous, C., Pletschacher, S.: Historical document layout analysis competition. In: IEEE International Conference on Document Analysis and Recognition (ICDAR), pp. 1516–1520 (2011)
5.
go back to reference Antonacopoulos, A., Clausner, C., Papadopoulous, C., Pletschacher, S.: ICDAR 2013 competition on Historical Newspaper Layout Analysis (HNLA 2013). In: IEEE International Conference on Document Analysis and Recognition (ICDAR), pp. 1454–1458 (2013) Antonacopoulos, A., Clausner, C., Papadopoulous, C., Pletschacher, S.: ICDAR 2013 competition on Historical Newspaper Layout Analysis (HNLA 2013). In: IEEE International Conference on Document Analysis and Recognition (ICDAR), pp. 1454–1458 (2013)
7.
go back to reference Liwicki, M., Indermuhle, E., Bunke, H.: On-line handwritten text line detection using dynamic programming. In: International Conference on Document Analysis and Recognition, vol. 1, pp. 447–451 (2007) Liwicki, M., Indermuhle, E., Bunke, H.: On-line handwritten text line detection using dynamic programming. In: International Conference on Document Analysis and Recognition, vol. 1, pp. 447–451 (2007)
8.
go back to reference Fischer, A., Wuthrich, M., Liwicki, M., Frinken, V., Bunke, H., Viehhauser, G., Stolz, M.: Automatic transcription of handwritten medieval documents. In: International Conference on Virtual Systems and Multimedia, pp. 137–142 (2009) Fischer, A., Wuthrich, M., Liwicki, M., Frinken, V., Bunke, H., Viehhauser, G., Stolz, M.: Automatic transcription of handwritten medieval documents. In: International Conference on Virtual Systems and Multimedia, pp. 137–142 (2009)
9.
go back to reference Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G., Stolz, M.: Ground truth creation for handwriting recognition in historical documents. In: IAPR International Workshop on Document Analysis Systems, pp. 3–10 (2010) Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G., Stolz, M.: Ground truth creation for handwriting recognition in historical documents. In: IAPR International Workshop on Document Analysis Systems, pp. 3–10 (2010)
10.
go back to reference Bulacu, M., van Koert, R., Schomaker, L., van der Zant, T.: Layout analysis of handwritten historical documents for searching the archive of the cabinet of the Dutch Queen. In: International Conference on Document Analysis and Recognition, pp. 357–361 (2007) Bulacu, M., van Koert, R., Schomaker, L., van der Zant, T.: Layout analysis of handwritten historical documents for searching the archive of the cabinet of the Dutch Queen. In: International Conference on Document Analysis and Recognition, pp. 357–361 (2007)
11.
go back to reference Marti, U.V., Bunke, H.: Using a statistical language model to improve the performance of an HMM-based cursive handwriting recognition system. Int. J. Pattern Recogn. Artif. Intell. 15(01), 65–90 (2001)CrossRef Marti, U.V., Bunke, H.: Using a statistical language model to improve the performance of an HMM-based cursive handwriting recognition system. Int. J. Pattern Recogn. Artif. Intell. 15(01), 65–90 (2001)CrossRef
12.
go back to reference Louloudis, G., Gatos, B., Pratikakis, I., Halatsis, C.: Text line detection in handwritten documents. Pattern Recogn. 41(12), 3758–3772 (2008)CrossRefMATH Louloudis, G., Gatos, B., Pratikakis, I., Halatsis, C.: Text line detection in handwritten documents. Pattern Recogn. 41(12), 3758–3772 (2008)CrossRefMATH
13.
go back to reference Likforman-Sulem, L., Hanimyan, A., Faure, C.: A Hough based algorithm for extracting text lines in handwritten documents. In: International Conference on Document Analysis and Recognition, vol. 2, pp. 774–777 (1995) Likforman-Sulem, L., Hanimyan, A., Faure, C.: A Hough based algorithm for extracting text lines in handwritten documents. In: International Conference on Document Analysis and Recognition, vol. 2, pp. 774–777 (1995)
14.
go back to reference Nikolaou, N., Makridis, M., Gatos, B., Stamatopoulos, N., Papamarkos, N.: Segmentation of historical machine-printed documents using adaptive run length smoothing and skeleton segmentation paths. Image Vis. Comput. 28(4), 590–604 (2010)CrossRef Nikolaou, N., Makridis, M., Gatos, B., Stamatopoulos, N., Papamarkos, N.: Segmentation of historical machine-printed documents using adaptive run length smoothing and skeleton segmentation paths. Image Vis. Comput. 28(4), 590–604 (2010)CrossRef
15.
go back to reference Arvanitopoulos, N., Susstrunk, S.: Seam carving for text line extraction on color and grayscale historical manuscripts. In: Proceedings of International Conference on Frontiers in Handwriting Recognition, ICFHR, pp. 726–731, December 2014 Arvanitopoulos, N., Susstrunk, S.: Seam carving for text line extraction on color and grayscale historical manuscripts. In: Proceedings of International Conference on Frontiers in Handwriting Recognition, ICFHR, pp. 726–731, December 2014
16.
go back to reference Arvanitopoulos, N., Susstrunk, S.: Seam carving for text line extraction on color and grayscale historical manuscripts. In: Proceedings of International Conference on Frontiers in Handwriting Recognition, ICFHR, pp. 726–731, December 2014 Arvanitopoulos, N., Susstrunk, S.: Seam carving for text line extraction on color and grayscale historical manuscripts. In: Proceedings of International Conference on Frontiers in Handwriting Recognition, ICFHR, pp. 726–731, December 2014
17.
go back to reference Alaql, O., Lu, C.C.: Text line extraction for historical document images using steerable directional filters. In: 2014 International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, 2014, pp. 312–317. doi:10.1109/ICALIP.2014.7009807 Alaql, O., Lu, C.C.: Text line extraction for historical document images using steerable directional filters. In: 2014 International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, 2014, pp. 312–317. doi:10.​1109/​ICALIP.​2014.​7009807
21.
go back to reference Hung, A.C., Meng, T.H.-Y.: A comparison of fast DCT algorithms. Multimed. Syst. 2(5), 204–217 (1994)CrossRef Hung, A.C., Meng, T.H.-Y.: A comparison of fast DCT algorithms. Multimed. Syst. 2(5), 204–217 (1994)CrossRef
22.
go back to reference Haque, M.A.: A two-dimensional fast cosine transform. IEEE Trans. Acoust. Speech Signal Process. ASSP-33, 1532–1539 (1985)CrossRefMATH Haque, M.A.: A two-dimensional fast cosine transform. IEEE Trans. Acoust. Speech Signal Process. ASSP-33, 1532–1539 (1985)CrossRefMATH
Metadata
Title
Direct Unsupervised Text Line Extraction from Colored Historical Manuscript Images Using DCT
Authors
Asim Baig
Somaya Al-Maadeed
Ahmed Bouridane
Mohamed Cheriet
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-41501-7_84

Premium Partner