Skip to main content
Top
Published in: Neural Computing and Applications 2/2017

16-09-2015 | Original Article

Urdu Nasta’liq text recognition system based on multi-dimensional recurrent neural network and statistical features

Authors: Saeeda Naz, Arif I. Umar, Riaz Ahmad, Saad B. Ahmed, Syed H. Shirazi, Muhammad I. Razzak

Published in: Neural Computing and Applications | Issue 2/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Character recognition for cursive script like Arabic, handwritten English and French is a challenging task which becomes more complicated for Urdu Nasta’liq text due to complexity of this script over Arabic. Recurrent neural network (RNN) has proved excellent performance for English, French as well as cursive Arabic script due to sequence learning property. Most of the recent approaches perform segmentation-based character recognition, whereas, due to the complexity of the Nasta’liq script, segmentation error is quite high as compared to Arabic Naskh script. RNN has provided promising results in such scenarios. In this paper, we achieved high accuracy for Urdu Nasta’liq using statistical features and multi-dimensional long short-term memory. We present a robust feature extraction approach that extracts feature based on right-to-left sliding window. Results showed that selected features significantly reduce the label error. For evaluation purposes, we have used Urdu printed text images dataset and compared the proposed approach with the recent work. The system provided 94.97 % recognition accuracy for unconstrained printed Nasta’liq text lines and outperforms the state-of-the-art results.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Naz S, Razzak MI, Hayat K, Anwar MW, Khan SZ (2014) Challenges in baseline detection of arabic script based languages. Intell Syst Sci Inf 542:181–196CrossRef Naz S, Razzak MI, Hayat K, Anwar MW, Khan SZ (2014) Challenges in baseline detection of arabic script based languages. Intell Syst Sci Inf 542:181–196CrossRef
2.
go back to reference Sabbour N, Shafait F, Sabbour N, Shafait F (2013) A segmentation-free approach to Arabic and Urdu OCR. In: Proceedings of the SPIE international society for optics and photonics, vol 86580, p 86580 N Sabbour N, Shafait F, Sabbour N, Shafait F (2013) A segmentation-free approach to Arabic and Urdu OCR. In: Proceedings of the SPIE international society for optics and photonics, vol 86580, p 86580 N
4.
go back to reference Ul-Hasan A, Ahmed SB, Rashid F, Shafait F, Breuel TM (2013) Offline printed Urdu Nastaleeq script recognition with bidirectional LSTM networks. In: 12th International conference on document analysis and recognition (ICDAR’13), pp. 1061–1065 Ul-Hasan A, Ahmed SB, Rashid F, Shafait F, Breuel TM (2013) Offline printed Urdu Nastaleeq script recognition with bidirectional LSTM networks. In: 12th International conference on document analysis and recognition (ICDAR’13), pp. 1061–1065
5.
go back to reference McCulloch WPWS (1990) A logical calculus of the ideas immanent in nervous activity. Bull Math Biol 52(1–2):99–115CrossRef McCulloch WPWS (1990) A logical calculus of the ideas immanent in nervous activity. Bull Math Biol 52(1–2):99–115CrossRef
6.
go back to reference Rosenblatt F (1961) Principles of neurodynamics: perceptrons theory brain mechanism. No. VG-1196-G-8. Cornell Aeronautical Lab Inc. Rosenblatt F (1961) Principles of neurodynamics: perceptrons theory brain mechanism. No. VG-1196-G-8. Cornell Aeronautical Lab Inc.
7.
go back to reference Jaeger H (2002) Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the ‘echo state network’ approach. GMD Rep 159, Ger Natl Res Cent Inf Technol, p 48 Jaeger H (2002) Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the ‘echo state network’ approach. GMD Rep 159, Ger Natl Res Cent Inf Technol, p 48
8.
go back to reference Hochreiter J, Schmidhuber S (1997) Long short-term memory. Neural Comput 9(8):1735–1780CrossRef Hochreiter J, Schmidhuber S (1997) Long short-term memory. Neural Comput 9(8):1735–1780CrossRef
9.
go back to reference Schmidhuber J, Gers FA (2001) LSTM recurrent networks learn simple context free and context sensitive languages. IEEE Trans Neural Netw 12(6):1333–1340CrossRef Schmidhuber J, Gers FA (2001) LSTM recurrent networks learn simple context free and context sensitive languages. IEEE Trans Neural Netw 12(6):1333–1340CrossRef
10.
go back to reference Graves A (2012) Offline arabic handwriting recognition with multidimensional recurrent neural networks. Springer, LondonCrossRef Graves A (2012) Offline arabic handwriting recognition with multidimensional recurrent neural networks. Springer, LondonCrossRef
11.
go back to reference Schuster M, Paliwal K (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45:2673–2681CrossRef Schuster M, Paliwal K (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45:2673–2681CrossRef
12.
go back to reference Graves A, Schmidhuber J (2005) Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw 18:602–610CrossRef Graves A, Schmidhuber J (2005) Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw 18:602–610CrossRef
13.
go back to reference Graves A, Fernández S, Schmidhuber J (2007) Multidimensional recurrent neural networks. In: Proceedings of the international conference on artificial neural networks Graves A, Fernández S, Schmidhuber J (2007) Multidimensional recurrent neural networks. In: Proceedings of the international conference on artificial neural networks
14.
go back to reference Naz S, Umar AI, Shirazi SH, Ahmed SB, Siddiqi I, Razzak MI (2015) Segmentation techniques for recognition of Arabic-like scripts: a comprehensive survey. Educ Inf Technol 20(2). doi:10.1007/s10639-015-9377-5 Naz S, Umar AI, Shirazi SH, Ahmed SB, Siddiqi I, Razzak MI (2015) Segmentation techniques for recognition of Arabic-like scripts: a comprehensive survey. Educ Inf Technol 20(2). doi:10.​1007/​s10639-015-9377-5
15.
go back to reference Naz S, Hayat K, Razzak MI, Anwar MW, Madani SA, Khan SU (2013) The optical character recognition of Urdu-like cursive scripts. Pattern Recognit 47(3):1229–1248CrossRef Naz S, Hayat K, Razzak MI, Anwar MW, Madani SA, Khan SU (2013) The optical character recognition of Urdu-like cursive scripts. Pattern Recognit 47(3):1229–1248CrossRef
16.
go back to reference Nishide S, Okuno HG, Ogata T, Tani J (2011) Handwriting prediction based character recognition using recurrent neural network. In: IEEE international conference on systems, man, and cybernetics (SMC), pp 2549–2554 Nishide S, Okuno HG, Ogata T, Tani J (2011) Handwriting prediction based character recognition using recurrent neural network. In: IEEE international conference on systems, man, and cybernetics (SMC), pp 2549–2554
17.
go back to reference Graves A, Liwicki M, Fernández S, Bertolami R, Bunke H, Schmidhuber J (2009) A novel connectionist system for unconstrained handwriting recognition. IEEE Trans Pattern Anal Mach Intell 31:855–868CrossRef Graves A, Liwicki M, Fernández S, Bertolami R, Bunke H, Schmidhuber J (2009) A novel connectionist system for unconstrained handwriting recognition. IEEE Trans Pattern Anal Mach Intell 31:855–868CrossRef
18.
go back to reference Graves A, Schmidhuber J (2009) Offline handwriting recognition with multidimensional recurrent neural networks. International conference on neural information processing systems, pp 545–552 Graves A, Schmidhuber J (2009) Offline handwriting recognition with multidimensional recurrent neural networks. International conference on neural information processing systems, pp 545–552
19.
go back to reference Liwicki M, Graves A, Bunke H, Schmidhuber J (2007) A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks. Proc 9th Int Conf Doc Anal Recognit 1:367–371 Liwicki M, Graves A, Bunke H, Schmidhuber J (2007) A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks. Proc 9th Int Conf Doc Anal Recognit 1:367–371
20.
go back to reference Graves A, Mohamed A, Hinton G (2013) Speech recognition with deep recurrent neural networks. Icassp 3:6645–6649 Graves A, Mohamed A, Hinton G (2013) Speech recognition with deep recurrent neural networks. Icassp 3:6645–6649
21.
go back to reference Ahmed SB, Naz S, Swati S, Razzak MI, Khan AA, Umar AI (2015) Ucom offline dataset: a Urdu handwritten dataset generation. Int Arab J Inf Technol 12(5) Ahmed SB, Naz S, Swati S, Razzak MI, Khan AA, Umar AI (2015) Ucom offline dataset: a Urdu handwritten dataset generation.  Int Arab J Inf Technol 12(5)
22.
go back to reference Rath TM, Manmatha R (2003) Features for word spotting in historical manuscripts. In: Proceedings of the seventh international conference on document analysis recognition, 2003 Rath TM, Manmatha R (2003) Features for word spotting in historical manuscripts. In: Proceedings of the seventh international conference on document analysis recognition, 2003
23.
go back to reference Al-Hajj Mohamad R, Likforman-Sulem L, Mokbel C (2009) Combining slanted-frame classifiers for improved HMM-based Arabic handwriting recognition. IEEE Trans Pattern Anal Mach Intell 31(7):1165–1177CrossRef Al-Hajj Mohamad R, Likforman-Sulem L, Mokbel C (2009) Combining slanted-frame classifiers for improved HMM-based Arabic handwriting recognition. IEEE Trans Pattern Anal Mach Intell 31(7):1165–1177CrossRef
24.
go back to reference Khorsheed MS, Al-Omari HK (2014) System and methods for arabic text recognition based on effective arabic text feature extraction. IS Patent No US 20140219562 A1 Khorsheed MS, Al-Omari HK (2014) System and methods for arabic text recognition based on effective arabic text feature extraction. IS Patent No US 20140219562 A1
25.
go back to reference Khorsheed MS (2007) Offline recognition of omnifont Arabic text using the HMM ToolKit (HTK). Pattern Recognit Lett 28(12):1563–1571CrossRef Khorsheed MS (2007) Offline recognition of omnifont Arabic text using the HMM ToolKit (HTK). Pattern Recognit Lett 28(12):1563–1571CrossRef
26.
go back to reference Naeimizaghiani M, Abdullah SNHS, Bataineh B, PirahanSiah F (eds) (2011) Character recognition based on global feature extraction. In: International conference on electrical engineering and informatics (ICEEI), pp 1–4 Naeimizaghiani M, Abdullah SNHS, Bataineh B, PirahanSiah F (eds) (2011) Character recognition based on global feature extraction. In: International conference on electrical engineering and informatics (ICEEI), pp 1–4
27.
go back to reference Singla L, Singh S (2014) Offline handwritten devanagari numerals recognition using GLCM features and neural networks. Int J Eng Res Technol 3(6):25CrossRef Singla L, Singh S (2014) Offline handwritten devanagari numerals recognition using GLCM features and neural networks. Int J Eng Res Technol 3(6):25CrossRef
28.
go back to reference Bharathi VC, Geetha MK (2013) Segregated handwritten character recognition using GLCM features. Int J Comput Appl 84(2):1–7 Bharathi VC, Geetha MK (2013) Segregated handwritten character recognition using GLCM features. Int J Comput Appl 84(2):1–7
29.
go back to reference Ahmad Z, Orakzai JK, Shamsher I (2009) Urdu compound character recognition using feed forward neural networks. In: Proceedings of the 2nd international conference on computer science and information technology (ICCSIT’09), pp 457–462 Ahmad Z, Orakzai JK, Shamsher I (2009) Urdu compound character recognition using feed forward neural networks. In: Proceedings of the 2nd international conference on computer science and information technology (ICCSIT’09), pp 457–462
30.
go back to reference Morillot O, Oprean C, Likforman-sulem L, Mokbel C, Chammas E, Grosicki E, Paristech IMT, Ltci C (2013) The UOB-telecom Paristech Arabic handwriting recognition and translation systems for the openhart 2013 competition. NIST-openhart Workshop, Washington Morillot O, Oprean C, Likforman-sulem L, Mokbel C, Chammas E, Grosicki E, Paristech IMT, Ltci C (2013) The UOB-telecom Paristech Arabic handwriting recognition and translation systems for the openhart 2013 competition. NIST-openhart Workshop, Washington
31.
go back to reference Chherawala Y, Roy PP, Cheriet M (2013) Feature design for offline Arabic handwriting recognition: handcrafted vs automated. In: Proceedings of the 12th international conference on document analysis and recognition (ICDAR) Chherawala Y, Roy PP, Cheriet M (2013) Feature design for offline Arabic handwriting recognition: handcrafted vs automated. In: Proceedings of the 12th international conference on document analysis and recognition (ICDAR)
32.
go back to reference Marti UV, Bunke H (2000) Using a statistical language model to improve the performance of an hmm based cursive handwriting recognition system. Int J Pattern Recognit Artif Intell 15(01):6–90 Marti UV, Bunke H (2000) Using a statistical language model to improve the performance of an hmm based cursive handwriting recognition system. Int J Pattern Recognit Artif Intell 15(01):6–90
33.
go back to reference Azeem SA, Ahmed H (2013) Effective technique for the recognition of offline Arabic handwritten words using hidden Markov models. Int J Doc Anal Recognit 16(4):399–412CrossRef Azeem SA, Ahmed H (2013) Effective technique for the recognition of offline Arabic handwritten words using hidden Markov models. Int J Doc Anal Recognit 16(4):399–412CrossRef
34.
go back to reference Morillot O, Likforman-Sulem L, Grosicki E (2013) New baseline correction algorithm for text-line recognition with bidirectional recurrent neural networks. J Electron Imaging 22(2):023028CrossRef Morillot O, Likforman-Sulem L, Grosicki E (2013) New baseline correction algorithm for text-line recognition with bidirectional recurrent neural networks. J Electron Imaging 22(2):023028CrossRef
35.
go back to reference Naz S, Hayat K, Razzak MI, Anwar MW, Akbar H (2013) Arabic script based character segmentation: a review. In: World congress on computer and information technology (WCCIT’13), pp 1–6 Naz S, Hayat K, Razzak MI, Anwar MW, Akbar H (2013) Arabic script based character segmentation: a review. In: World congress on computer and information technology (WCCIT’13), pp 1–6
36.
go back to reference Ul-Hasan A, Bin Ahmed S, Rashid F, Shafait F, Breuel TM (2013) Offline printed urdu nastaleeq script recognition with bidirectional LSTM networks. In: Proceedings of the international conference on document analysis recognition, ICDAR, pp 1061–1065 Ul-Hasan A, Bin Ahmed S, Rashid F, Shafait F, Breuel TM (2013) Offline printed urdu nastaleeq script recognition with bidirectional LSTM networks. In: Proceedings of the international conference on document analysis recognition, ICDAR, pp 1061–1065
37.
go back to reference Javed ST, Hussain S, Maqbool A, Asloob S, Jamil S, Moin H (2010) Segmentation free Nastalique Urdu OCR. Word Acad Sci Eng Technol 46:456–461 Javed ST, Hussain S, Maqbool A, Asloob S, Jamil S, Moin H (2010) Segmentation free Nastalique Urdu OCR. Word Acad Sci Eng Technol 46:456–461
38.
go back to reference Akram QUA, Hussain S, Niazi A, Anjum U, Irfan F (2014) Adapting tesseract for complex scripts: an example for Urdu Nastalique. In: Proceedings of the 11th IAPR international workshop on document analysis systems, pp 191–195 Akram QUA, Hussain S, Niazi A, Anjum U, Irfan F (2014) Adapting tesseract for complex scripts: an example for Urdu Nastalique. In: Proceedings of the 11th IAPR international workshop on document analysis systems, pp 191–195
39.
go back to reference Javed ST, Hussain S (2013) Segmentation based Urdu Nastalique OCR, Lecture notes in computer science, vol 8259, pp 41–49 Javed ST, Hussain S (2013) Segmentation based Urdu Nastalique OCR, Lecture notes in computer science, vol 8259, pp 41–49
Metadata
Title
Urdu Nasta’liq text recognition system based on multi-dimensional recurrent neural network and statistical features
Authors
Saeeda Naz
Arif I. Umar
Riaz Ahmad
Saad B. Ahmed
Syed H. Shirazi
Muhammad I. Razzak
Publication date
16-09-2015
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 2/2017
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-015-2051-4

Other articles of this Issue 2/2017

Neural Computing and Applications 2/2017 Go to the issue

Premium Partner