Skip to main content

2012 | OriginalPaper | Buchkapitel

6. Features for HMM-Based Arabic Handwritten Word Recognition Systems

verfasst von : Laurence Likforman-Sulem, Ramy Al Hajj Mohammad, Chafic Mokbel, Fares Menasri, Anne-Laure Bianne-Bernard, Christopher Kermorvant

Erschienen in: Guide to OCR for Arabic Scripts

Verlag: Springer London

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

HMM-based systems need observation sequences as input. These observations consist of discrete values or vectors extracted from word images or text lines. In this chapter we explore various types of features which are popular for Arabic cursive handwriting recognition. Some of these features are statistical, based on pixel distributions or local directions. Others are structural, based on the presence of loops, ascenders, or descenders. We show how these features can be efficient within HMM-based systems based on sliding windows or grapheme segmentation.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Al-Hajj Mohamad, R., Likforman-Sulem, L., Mokbel, C.: Combining slanted-frame classifiers for improved HMM-based Arabic handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31 (2009) Al-Hajj Mohamad, R., Likforman-Sulem, L., Mokbel, C.: Combining slanted-frame classifiers for improved HMM-based Arabic handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31 (2009)
2.
Zurück zum Zitat Amin, A.: Off-line Arabic character recognition: the state of the art. Pattern Recognit. 31(5), 517–530 (1998) MathSciNetCrossRef Amin, A.: Off-line Arabic character recognition: the state of the art. Pattern Recognit. 31(5), 517–530 (1998) MathSciNetCrossRef
3.
Zurück zum Zitat BenAmara, N., Bouslama, F.: Classification of Arabic script using multiple sources of information: state of the art and perspective. Int. J. Doc. Anal. Recognit. 5, 195–212 (2003) CrossRef BenAmara, N., Bouslama, F.: Classification of Arabic script using multiple sources of information: state of the art and perspective. Int. J. Doc. Anal. Recognit. 5, 195–212 (2003) CrossRef
4.
Zurück zum Zitat Benouareth, A., Ennaji, A., Sellami, M.: HMMs with explicit state duration applied to handwritten Arabic word recognition. In: ICPR, vol. 2, pp. 897–900 (2006) Benouareth, A., Ennaji, A., Sellami, M.: HMMs with explicit state duration applied to handwritten Arabic word recognition. In: ICPR, vol. 2, pp. 897–900 (2006)
5.
Zurück zum Zitat Bianne, A.L., Kermorvant, C., Likforman-Sulem, L.: Context-dependent HMM modeling using tree-based clustering for the recognition of handwritten words. In: Proceedings of Electronic Imaging-Document Recognition and Retrieval XVII—DRR 2010 SPIE, vol. 7534 (2010) Bianne, A.L., Kermorvant, C., Likforman-Sulem, L.: Context-dependent HMM modeling using tree-based clustering for the recognition of handwritten words. In: Proceedings of Electronic Imaging-Document Recognition and Retrieval XVII—DRR 2010 SPIE, vol. 7534 (2010)
6.
Zurück zum Zitat Bianne-Bernard, A.L., Menasri, F., Al-Hajj, R.M., Mokbel, C., Kermorvant, C., Likforman-Sulem, L.: Dynamic and contextual information in HMM modeling for handwritten word recognition. IEEE Trans. Pattern Anal. Mach. Intell. 33(10), 2066–2080 (2011) CrossRef Bianne-Bernard, A.L., Menasri, F., Al-Hajj, R.M., Mokbel, C., Kermorvant, C., Likforman-Sulem, L.: Dynamic and contextual information in HMM modeling for handwritten word recognition. IEEE Trans. Pattern Anal. Mach. Intell. 33(10), 2066–2080 (2011) CrossRef
7.
Zurück zum Zitat Blumenstein, M., Cheng, C.K., Liu, X.Y.: New pre-processing techniques for handwritten word recognition. In: Proceedings of the Second IASTED International Conference on Visualization, Imaging and Image Processing, pp. 480–484 (2002) Blumenstein, M., Cheng, C.K., Liu, X.Y.: New pre-processing techniques for handwritten word recognition. In: Proceedings of the Second IASTED International Conference on Visualization, Imaging and Image Processing, pp. 480–484 (2002)
8.
Zurück zum Zitat Caesar, T., Gloger, J.M., Mandler, E.: Pre-processing and feature extraction for a handwritten recognition system. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 408–411 (1993) Caesar, T., Gloger, J.M., Mandler, E.: Pre-processing and feature extraction for a handwritten recognition system. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 408–411 (1993)
9.
Zurück zum Zitat Chelba, C., Morton, R.: Mutual information phone clustering for decision tree induction. In: Proceedings of the International Conference on Spoken Language Processing—ICSLP02 (2002) Chelba, C., Morton, R.: Mutual information phone clustering for decision tree induction. In: Proceedings of the International Conference on Spoken Language Processing—ICSLP02 (2002)
10.
Zurück zum Zitat Dehghan, M., Faez, K., Ahmadi, M., Shridhar, M.: Handwritten Farsi (Arabic) word recognition: a holistic approach using discrete HMM. Pattern Recognit. 34(5), 1057–1065 (2001) MATHCrossRef Dehghan, M., Faez, K., Ahmadi, M., Shridhar, M.: Handwritten Farsi (Arabic) word recognition: a holistic approach using discrete HMM. Pattern Recognit. 34(5), 1057–1065 (2001) MATHCrossRef
11.
Zurück zum Zitat Dreuw, P., Rybach, D., Gollan, C., Ney, H.: Writer adaptive training and writing variant model refinement for offline Arabic handwriting recognition. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 21–25 (2009) CrossRef Dreuw, P., Rybach, D., Gollan, C., Ney, H.: Writer adaptive training and writing variant model refinement for offline Arabic handwriting recognition. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 21–25 (2009) CrossRef
12.
Zurück zum Zitat Grosicki, E., El-Abed, H.: ICDAR 2009 handwriting recognition competition. In: ICDAR, pp. 1398–1402 (2009) Grosicki, E., El-Abed, H.: ICDAR 2009 handwriting recognition competition. In: ICDAR, pp. 1398–1402 (2009)
15.
Zurück zum Zitat Kanoun, S., Alimi, A.M., Lecourtier, Y.: Natural language morphology integration in off-line Arabic optical text recognition. IEEE Trans. Syst. Man Cybern., Part B, Cybern. 41(2), 579–590 (2011) CrossRef Kanoun, S., Alimi, A.M., Lecourtier, Y.: Natural language morphology integration in off-line Arabic optical text recognition. IEEE Trans. Syst. Man Cybern., Part B, Cybern. 41(2), 579–590 (2011) CrossRef
16.
Zurück zum Zitat Khorsheed, M.S.: Recognising handwritten Arabic manuscripts using a single hidden Markov model. Pattern Recognit. Lett. 24(14), 2235–2242 (2003) MATHCrossRef Khorsheed, M.S.: Recognising handwritten Arabic manuscripts using a single hidden Markov model. Pattern Recognit. Lett. 24(14), 2235–2242 (2003) MATHCrossRef
17.
Zurück zum Zitat Lorigo, L.M., Govindaraju, V.: Offline Arabic handwriting recognition: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 28(5), 712–724 (2006) CrossRef Lorigo, L.M., Govindaraju, V.: Offline Arabic handwriting recognition: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 28(5), 712–724 (2006) CrossRef
18.
Zurück zum Zitat Märgner, V., Abed, H.E.: ICFHR 2010—Arabic handwriting recognition competition. In: International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 709–714 (2010) CrossRef Märgner, V., Abed, H.E.: ICFHR 2010—Arabic handwriting recognition competition. In: International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 709–714 (2010) CrossRef
19.
Zurück zum Zitat Marti, U., Bunke, H.: A full English sentence database for off-line handwriting recognition. In: Proceedings of the Fifth International Conference on Document Analysis and Recognition—ICDAR99, pp. 705–708 (1999) Marti, U., Bunke, H.: A full English sentence database for off-line handwriting recognition. In: Proceedings of the Fifth International Conference on Document Analysis and Recognition—ICDAR99, pp. 705–708 (1999)
20.
Zurück zum Zitat Menasri, F., Vincent, N., Cheriet, M., Augustin, E.: Shape-based alphabet for off-line Arabic handwriting recognition. In: International Conference on Document Analysis and Recognition vol. 2, pp. 969–973 (2007) Menasri, F., Vincent, N., Cheriet, M., Augustin, E.: Shape-based alphabet for off-line Arabic handwriting recognition. In: International Conference on Document Analysis and Recognition vol. 2, pp. 969–973 (2007)
21.
Zurück zum Zitat Mohamad, R.A.H., Likforman-Sulem, L., Mokbel, C.: Combining slanted-frame classifiers for improved HMM-based Arabic handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(7), 1165–1177 (2009) CrossRef Mohamad, R.A.H., Likforman-Sulem, L., Mokbel, C.: Combining slanted-frame classifiers for improved HMM-based Arabic handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(7), 1165–1177 (2009) CrossRef
22.
Zurück zum Zitat Natarajan, P., Lu, Z., Schwartz, R.M., Bazzi, I., Makhoul, J.: Multilingual machine printed OCR. Int. J. Pattern Recognit. Artif. Intell. 15(1), 43–63 (2001) CrossRef Natarajan, P., Lu, Z., Schwartz, R.M., Bazzi, I., Makhoul, J.: Multilingual machine printed OCR. Int. J. Pattern Recognit. Artif. Intell. 15(1), 43–63 (2001) CrossRef
23.
Zurück zum Zitat Pechwitz, M., Märgner, V.: HMM-based approach for handwritten Arabic word recognition using the IFN/ENIT database. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 890–894 (2003) Pechwitz, M., Märgner, V.: HMM-based approach for handwritten Arabic word recognition using the IFN/ENIT database. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 890–894 (2003)
24.
Zurück zum Zitat Pechwitz, M., Maddouri, S.S., Märgner, V., Ellouze, N., Amiri, H.: IFN/ENIT database of handwritten Arabic words. In: CIFED (2002) Pechwitz, M., Maddouri, S.S., Märgner, V., Ellouze, N., Amiri, H.: IFN/ENIT database of handwritten Arabic words. In: CIFED (2002)
25.
Zurück zum Zitat Plotz, T., Fink, G.: Markov models for offline handwriting recognition: a survey. Int. J. Doc. Anal. Recognit. 12, 269–298 (2009) CrossRef Plotz, T., Fink, G.: Markov models for offline handwriting recognition: a survey. Int. J. Doc. Anal. Recognit. 12, 269–298 (2009) CrossRef
26.
Zurück zum Zitat Saleem, S., Cao, H., Subramanian, K., Kamali, M., Prasad, R., Natarajan, P.: Improvements in BBN’s HMM-based offline Arabic handwriting recognition system. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 773–777 (2009) CrossRef Saleem, S., Cao, H., Subramanian, K., Kamali, M., Prasad, R., Natarajan, P.: Improvements in BBN’s HMM-based offline Arabic handwriting recognition system. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 773–777 (2009) CrossRef
27.
Zurück zum Zitat Schambach, M.P., Rottland, J., Alary, T.: How to convert a Latin handwriting recognition system to Arabic. In: Proceedings of ICFHR’08 (2008) Schambach, M.P., Rottland, J., Alary, T.: How to convert a Latin handwriting recognition system to Arabic. In: Proceedings of ICFHR’08 (2008)
28.
Zurück zum Zitat Touj, S.M., Amara, N.E.B., Amiri, H.: A hybrid approach for off-line Arabic handwriting recognition based on a planar hidden Markov modeling. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 964–968 (2007) Touj, S.M., Amara, N.E.B., Amiri, H.: A hybrid approach for off-line Arabic handwriting recognition based on a planar hidden Markov modeling. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 964–968 (2007)
29.
Zurück zum Zitat Young, S.J., Odell, J.J., Woodland, P.C.: Tree-based state tying for high accuracy acoustic modelling. In: Proceedings of the Workshop on Human Language Technology (HLT94), pp. 307–312 (1994) CrossRef Young, S.J., Odell, J.J., Woodland, P.C.: Tree-based state tying for high accuracy acoustic modelling. In: Proceedings of the Workshop on Human Language Technology (HLT94), pp. 307–312 (1994) CrossRef
30.
Zurück zum Zitat Young, S., et al.: The HTK Book V3.4. Cambridge University Press, Cambridge (2006) Young, S., et al.: The HTK Book V3.4. Cambridge University Press, Cambridge (2006)
31.
Zurück zum Zitat Zen, H., Tokuda, K., Kitamura, T.: Decision tree based simultaneous clustering of phonetic contexts, dimensions, and state positions for acoustic modeling. In: Proceedings Eurospeech, pp. 3189–3192 (2003) Zen, H., Tokuda, K., Kitamura, T.: Decision tree based simultaneous clustering of phonetic contexts, dimensions, and state positions for acoustic modeling. In: Proceedings Eurospeech, pp. 3189–3192 (2003)
32.
Zurück zum Zitat Zimmermann, M., Bunke, H.: Automatic segmentation of the IAM off-line database for handwritten English text. In: Proceedings of the 15th International Conference on Pattern Recognition—ICPR2000, vol. 4, pp. 35–39 (2000) Zimmermann, M., Bunke, H.: Automatic segmentation of the IAM off-line database for handwritten English text. In: Proceedings of the 15th International Conference on Pattern Recognition—ICPR2000, vol. 4, pp. 35–39 (2000)
Metadaten
Titel
Features for HMM-Based Arabic Handwritten Word Recognition Systems
verfasst von
Laurence Likforman-Sulem
Ramy Al Hajj Mohammad
Chafic Mokbel
Fares Menasri
Anne-Laure Bianne-Bernard
Christopher Kermorvant
Copyright-Jahr
2012
Verlag
Springer London
DOI
https://doi.org/10.1007/978-1-4471-4072-6_6

Premium Partner