Skip to main content
Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) 4/2013

01.12.2013 | Original Paper

Effective technique for the recognition of offline Arabic handwritten words using hidden Markov models

verfasst von: Sherif Abdel Azeem, Hany Ahmed

Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) | Ausgabe 4/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we present a novel segmentation-free Arabic handwriting recognition system based on hidden Markov model (HMM). Two main contributions are introduced: a new technique for dividing the image into nonuniform horizontal segments to extract the features and a new technique for solving the problems of the skewing of characters by fusing multiple HMMs. Moreover, two enhancements are introduced: the pre-processing method and feature extraction using concavity space. The proposed system first pre-processes the input image by setting the thickness of the input word to three pixels and fixing the spacing between the different parts of the word. The input image is divided into constant number of nonuniform horizontal segments depending on the distribution of the foreground pixels. A set of robust features representing the gradient of the foreground pixels is extracted using sliding windows. The input image is decomposed into several images representing the vertical, horizontal, left diagonal and right diagonal edges in the image. A set of robust features representing the densities of the foreground pixels in the various edge images is extracted using sliding windows. The proposed system builds character HMM models and learns word HMM models using embedded training. Besides the vertical sliding window, two slanted sliding windows are used to extract the features. Three different HMMs are used: one for the vertical sliding window and two for the slanted windows. A fusion scheme is used to combine the three HMMs. The proposed system is very promising and outperforms all the other Arabic handwriting recognition systems reported in the literature.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Al-Hajj, R., Likforman-Sulem, L., Mokbel, C.: Arabic handwriting recognition using baseline dependant features and hidden Markov modeling. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR’05) (2005) Al-Hajj, R., Likforman-Sulem, L., Mokbel, C.: Arabic handwriting recognition using baseline dependant features and hidden Markov modeling. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR’05) (2005)
2.
Zurück zum Zitat Al-Hajj, R., Likforman-Sulem, L., Mokbel, C.: Combination of HMM-based classifiers for the recognition of arabic handwritten words. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition (ICDAR’07) (2007) Al-Hajj, R., Likforman-Sulem, L., Mokbel, C.: Combination of HMM-based classifiers for the recognition of arabic handwritten words. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition (ICDAR’07) (2007)
3.
Zurück zum Zitat Al-Hajj, R., Likforman-Sulem, L., Mokbel, C.: Combining slanted-frame classifiers for improved HMM-based arabic handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(7), 1165–1177 (2009) Al-Hajj, R., Likforman-Sulem, L., Mokbel, C.: Combining slanted-frame classifiers for improved HMM-based arabic handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(7), 1165–1177 (2009)
4.
Zurück zum Zitat AlKhateeb, J.H., Ren, J., Jiang, J., Al-Muhtaseb, H.: Offline handwritten Arabic cursive text recognition using hidden Markov models and re-ranking. Pattern Recognit. Lett. 32, 8 (2011) AlKhateeb, J.H., Ren, J., Jiang, J., Al-Muhtaseb, H.: Offline handwritten Arabic cursive text recognition using hidden Markov models and re-ranking. Pattern Recognit. Lett. 32, 8 (2011)
5.
Zurück zum Zitat Benouareth, A., Ennaji, A., Sellami, M.: HMMs with explicit state duration applied to handwritten Arabic word recognition. In: Proceeding of 18th International Conference Pattern Recognition (ICPR) (2006) Benouareth, A., Ennaji, A., Sellami, M.: HMMs with explicit state duration applied to handwritten Arabic word recognition. In: Proceeding of 18th International Conference Pattern Recognition (ICPR) (2006)
6.
Zurück zum Zitat Benouareth, A., Ennaji, A., Sellami, M.: Semi-continuous HMMs with explicit state duration for unconstrained arabic word modeling and recognition. Pattern Recognit. Lett. 29, 1742–1752 (2008)CrossRef Benouareth, A., Ennaji, A., Sellami, M.: Semi-continuous HMMs with explicit state duration for unconstrained arabic word modeling and recognition. Pattern Recognit. Lett. 29, 1742–1752 (2008)CrossRef
7.
Zurück zum Zitat Bianne-Bernard, A.-L., Menasri, F., Al-Hajj Mohamad, R., Mokbel, C., Kermorvant, C., Likforman-Sulem, L.: Dynamic and contextual information in HMM modeling for handwritten word recognition. IEEE Trans. Pattern Anal. Mach. Intell. 33(10), 2066–2080 (2011) Bianne-Bernard, A.-L., Menasri, F., Al-Hajj Mohamad, R., Mokbel, C., Kermorvant, C., Likforman-Sulem, L.: Dynamic and contextual information in HMM modeling for handwritten word recognition. IEEE Trans. Pattern Anal. Mach. Intell. 33(10), 2066–2080 (2011)
8.
Zurück zum Zitat Dreuw, P., Jonas, S., Ney, H.: White-space models for offline Arabic handwriting recognition. In: Proceeding of 19th Int. Conf. Pattern Recognition (ICPR) (2008) Dreuw, P., Jonas, S., Ney, H.: White-space models for offline Arabic handwriting recognition. In: Proceeding of 19th Int. Conf. Pattern Recognition (ICPR) (2008)
9.
Zurück zum Zitat El Abed, H., Märgner, V.: Comparison of different preprocessing and feature extraction methods for offline recognition of handwritten Arabic words. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition (ICDAR’07) (2007) El Abed, H., Märgner, V.: Comparison of different preprocessing and feature extraction methods for offline recognition of handwritten Arabic words. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition (ICDAR’07) (2007)
10.
Zurück zum Zitat Elbaati, A., Boubaker, H., Kherallah, M., Alimi, A.M., Ennaji, A., El Abed, H.: Arabic handwriting recognition using restored stroke chronology. In: Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR), pp. 411–415, July (2009) Elbaati, A., Boubaker, H., Kherallah, M., Alimi, A.M., Ennaji, A., El Abed, H.: Arabic handwriting recognition using restored stroke chronology. In: Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR), pp. 411–415, July (2009)
11.
Zurück zum Zitat Gatos, B., Pratikakis, I., Kesidis, A.L., Perantonis, S.J.: Efficient off-line cursive handwriting word recognition. In: Proceedings of the Tenth International Workshop on Frontiers in Handwriting Recognition, Oct. La Baule (2006) Gatos, B., Pratikakis, I., Kesidis, A.L., Perantonis, S.J.: Efficient off-line cursive handwriting word recognition. In: Proceedings of the Tenth International Workshop on Frontiers in Handwriting Recognition, Oct. La Baule (2006)
12.
Zurück zum Zitat Gonzales, R.C., Woods, R.E.: Digital Image Processing, vol. 2. Addison-Wesley, Reading, MA (2002) Gonzales, R.C., Woods, R.E.: Digital Image Processing, vol. 2. Addison-Wesley, Reading, MA (2002)
13.
Zurück zum Zitat Hamdani, M., El Abed, H., Kherallah, M., Alimi Adel, M.: Combining multiple HMMs using online and offline features for offline Arabic handwriting recognition. In: Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR) (2009) Hamdani, M., El Abed, H., Kherallah, M., Alimi Adel, M.: Combining multiple HMMs using online and offline features for offline Arabic handwriting recognition. In: Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR) (2009)
15.
Zurück zum Zitat Kessentini, Y., Paquet, T., Ben Hamado, A.M.: Offline handwritten word recognition using multi-stream hidden Markov models. J. Pattern Recognit. Lett. 1(1) (2010) Kessentini, Y., Paquet, T., Ben Hamado, A.M.: Offline handwritten word recognition using multi-stream hidden Markov models. J. Pattern Recognit. Lett. 1(1) (2010)
16.
17.
Zurück zum Zitat Liu, C., Nakashima, K., Sako, H., Fujisawa, H.: Handwritten digit recognition: benchmarking of state-of-the-art techniques. Pattern Recognit. 36, 2271–2285 (2003)CrossRefMATH Liu, C., Nakashima, K., Sako, H., Fujisawa, H.: Handwritten digit recognition: benchmarking of state-of-the-art techniques. Pattern Recognit. 36, 2271–2285 (2003)CrossRefMATH
18.
Zurück zum Zitat Märgner, V., El Abed, H.: ICDAR 2007 Arabic handwriting recognition competition. In: Proceedings 9th Int. Conf. on Document Analysis and Recognition (ICDAR), pp. 1274–1278 (2007) Märgner, V., El Abed, H.: ICDAR 2007 Arabic handwriting recognition competition. In: Proceedings 9th Int. Conf. on Document Analysis and Recognition (ICDAR), pp. 1274–1278 (2007)
19.
Zurück zum Zitat Märgner, V., El Abed, H.: ICDAR 2009 Arabic handwriting recognition competition. In: Proceedings of the 10th Int. Conf. on Document Analysis and Recognition (ICDAR), pp. 1383–1387 (2009) Märgner, V., El Abed, H.: ICDAR 2009 Arabic handwriting recognition competition. In: Proceedings of the 10th Int. Conf. on Document Analysis and Recognition (ICDAR), pp. 1383–1387 (2009)
20.
Zurück zum Zitat Märgner, V., El Abed, H.: ICDAR 2011 Arabic handwriting recognition competition. In: Proceedings of the 11th Int. Conf. on Document Analysis and Recognition (ICDAR) (2011) Märgner, V., El Abed, H.: ICDAR 2011 Arabic handwriting recognition competition. In: Proceedings of the 11th Int. Conf. on Document Analysis and Recognition (ICDAR) (2011)
21.
Zurück zum Zitat Märgner, V., El Abed, H.: ICFHR 2010 Arabic handwriting recognition competition. In: Proceedings of the 12th International Conference on Frontiers in Handwriting Recognition(ICFHR) (2010) Märgner, V., El Abed, H.: ICFHR 2010 Arabic handwriting recognition competition. In: Proceedings of the 12th International Conference on Frontiers in Handwriting Recognition(ICFHR) (2010)
22.
Zurück zum Zitat Märgner, V., Pechwitz, M., El Abed, H.: ICDAR 2005 Arabic handwriting recognition competition. Proc. 8th Int. Conf. Doc. Anal. Recognit. 1, 70–74 (2005) Märgner, V., Pechwitz, M., El Abed, H.: ICDAR 2005 Arabic handwriting recognition competition. Proc. 8th Int. Conf. Doc. Anal. Recognit. 1, 70–74 (2005)
23.
Zurück zum Zitat Pechwitz, M., Maddouri, S.S., Maergner, V., Ellouze, N., Amiri, H.: IFN/ENIT-database of handwritten Arabic words. In: Proceedings of the Colloque International Francophone surl’Ècrit et le Document (CIFED ’02), pp. 129–136. Hammamet, Tunisia, October (2002) Pechwitz, M., Maddouri, S.S., Maergner, V., Ellouze, N., Amiri, H.: IFN/ENIT-database of handwritten Arabic words. In: Proceedings of the Colloque International Francophone surl’Ècrit et le Document (CIFED ’02), pp. 129–136. Hammamet, Tunisia, October (2002)
24.
Zurück zum Zitat Pechwitz, M., Maergner, V.: HMM based approach for handwritten Arabic word recognition using the IFN/ENIT-database. In: Proceedings of the Seventh International Conference on Document Analysis and Recognition (ICDAR’03) (2003) Pechwitz, M., Maergner, V.: HMM based approach for handwritten Arabic word recognition using the IFN/ENIT-database. In: Proceedings of the Seventh International Conference on Document Analysis and Recognition (ICDAR’03) (2003)
25.
Zurück zum Zitat Rodríguez, J.A., Perronnin, F.: Local gradient histogram features for word spotting in unconstrained handwritten documents. In: Proceeding of International Conference on Frontiers and Handwriting Recognition (ICFHR2008) Montréal, Québec (2008) Rodríguez, J.A., Perronnin, F.: Local gradient histogram features for word spotting in unconstrained handwritten documents. In: Proceeding of International Conference on Frontiers and Handwriting Recognition (ICFHR2008) Montréal, Québec (2008)
26.
Zurück zum Zitat Suen, C.Y., Lam, L., Lee, S.-W.: Thinning methodologies—a comprehensive survey. IEEE Trans. Pattern Anal. Mach. Intell. 14(9), 879 (1992) Suen, C.Y., Lam, L., Lee, S.-W.: Thinning methodologies—a comprehensive survey. IEEE Trans. Pattern Anal. Mach. Intell. 14(9), 879 (1992)
27.
Zurück zum Zitat Xiang, D., Yan, H., Chen, X., Cheng, Y.: Offline Arabic handwriting recognition system based on HMM. In: Computer Science and Information Technology ICCSIT, 3rd IEEE International Conference (2010) Xiang, D., Yan, H., Chen, X., Cheng, Y.: Offline Arabic handwriting recognition system based on HMM. In: Computer Science and Information Technology ICCSIT, 3rd IEEE International Conference (2010)
Metadaten
Titel
Effective technique for the recognition of offline Arabic handwritten words using hidden Markov models
verfasst von
Sherif Abdel Azeem
Hany Ahmed
Publikationsdatum
01.12.2013
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Document Analysis and Recognition (IJDAR) / Ausgabe 4/2013
Print ISSN: 1433-2833
Elektronische ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-013-0201-8

Weitere Artikel der Ausgabe 4/2013

International Journal on Document Analysis and Recognition (IJDAR) 4/2013 Zur Ausgabe