Skip to main content
Top
Published in: International Journal on Document Analysis and Recognition (IJDAR) 4/2022

07-10-2022 | Special Issue Paper

A novel holistic unconstrained handwritten urdu recognition system using convolutional neural networks

Authors: Aejaz Farooq Ganai, Farida Khursheed

Published in: International Journal on Document Analysis and Recognition (IJDAR) | Issue 4/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Handwritten Urdu recognition has been the least explored to date due to unavailability of a standard hand-written Urdu dataset, huge variation among writing styles of different Urdu writers, irregular positioning of diacritics associated with ligatures, similarity in shape of some Urdu characters in writing, and unavailability of an efficient learning and training technique. Few researchers have proposed the handwritten Urdu datasets among which only Urdu Nastaliq handwritten dataset (UNHD) is publicly available. The UNHD contains ligatures of only up to five characters and does not cover the entire Urdu ligature corpus. Hence, we present a novel comprehensive handwritten Urdu dataset named UHLD for the ‘Urdu Handwritten Ligature Dataset’:—which consists of ligatures of up to seven-character length and covers most of the ligature corpus of the Urdu language. The UHLD is written by both genders independent of age of person, paper color, paper type (blank or ruled), ink color, pen type. We propose an unconstrained handwritten Urdu recognition system that can recognize handwritten Urdu ligatures with up to six characters. A new robust algorithm has also been proposed here that is able to divide a complete ligature into primary and secondary components with 98% accuracy on a large Urdu dataset. Our proposed holistic handwritten Urdu recognition system ensures independent recognition of both primary and secondary components of a word/ligature. The proposed recognition technique is transformation invariant and computationally efficient and achieves a better recognition rate of 97% for UHLD and 93% for UNHD.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Naz, S., Umar, A.I., Shirazi, S.H., Khan, S.A., Ahmed, I., Khan, A.A.: Challenges of Urdu named entity recognition: a scarce resourced language. Res. J. Appl. Sci. Eng. Technol. 8(10), 1272–1278 (2014)CrossRef Naz, S., Umar, A.I., Shirazi, S.H., Khan, S.A., Ahmed, I., Khan, A.A.: Challenges of Urdu named entity recognition: a scarce resourced language. Res. J. Appl. Sci. Eng. Technol. 8(10), 1272–1278 (2014)CrossRef
2.
go back to reference Daud, A., Khan, W., Che, D.: Urdu language processing: a survey. Artif. Intell. Rev. 47(3), 279–311 (2017)CrossRef Daud, A., Khan, W., Che, D.: Urdu language processing: a survey. Artif. Intell. Rev. 47(3), 279–311 (2017)CrossRef
3.
go back to reference Weber, G.: Top languages. Retrieved April. 11, 2009 (2008) Weber, G.: Top languages. Retrieved April. 11, 2009 (2008)
4.
go back to reference Ahmed, S.B., Naz, S., Swati, S., Razzak, M.I.: Handwritten Urdu character recognition using one-dimensional BLSTM classifier. Neural Comput. Appl. 31(4), 1143–1151 (2019)CrossRef Ahmed, S.B., Naz, S., Swati, S., Razzak, M.I.: Handwritten Urdu character recognition using one-dimensional BLSTM classifier. Neural Comput. Appl. 31(4), 1143–1151 (2019)CrossRef
5.
go back to reference Alghamdi, M. A., Alkhazi, I. S., Teahan, W. J.: (July). Arabic OCR evaluation tool. In 2016 7th international conference on computer science and information technology (CSIT) (pp. 1-6). IEEE (2016) Alghamdi, M. A., Alkhazi, I. S., Teahan, W. J.: (July). Arabic OCR evaluation tool. In 2016 7th international conference on computer science and information technology (CSIT) (pp. 1-6). IEEE (2016)
6.
go back to reference Satti, D.A., Saleem, K.: (November). Complexities and implementation challenges in offline urdu Nastaliq OCR. In: Proceedings of the Conference on Language & Technology, 85-91m (2012) Satti, D.A., Saleem, K.: (November). Complexities and implementation challenges in offline urdu Nastaliq OCR. In: Proceedings of the Conference on Language & Technology, 85-91m (2012)
7.
go back to reference Khan, N.H., Adnan, A.: Urdu optical character recognition systems: Present contributions and future directions. IEEE Access 6, 46019–46046 (2018)CrossRef Khan, N.H., Adnan, A.: Urdu optical character recognition systems: Present contributions and future directions. IEEE Access 6, 46019–46046 (2018)CrossRef
8.
go back to reference Naz, S., Umar, A.I., Ahmed, R., Razzak, M.I., Rashid, S.F., Shafait, F.: Urdu Nasta’liq text recognition using implicit segmentation based on multi-dimensional long short term memory neural networks. Springerplus 5(1), 2010 (2016)CrossRef Naz, S., Umar, A.I., Ahmed, R., Razzak, M.I., Rashid, S.F., Shafait, F.: Urdu Nasta’liq text recognition using implicit segmentation based on multi-dimensional long short term memory neural networks. Springerplus 5(1), 2010 (2016)CrossRef
10.
go back to reference Din, I.U., Siddiqi, I., Khalid, S., Azam, T.: Segmentation-free optical character recognition for printed Urdu text. EURASIP J Image Video Process 2017(1), 62 (2017)CrossRef Din, I.U., Siddiqi, I., Khalid, S., Azam, T.: Segmentation-free optical character recognition for printed Urdu text. EURASIP J Image Video Process 2017(1), 62 (2017)CrossRef
11.
go back to reference Lehal, G.S.: December. Choice of recognizable units for Urdu OCR. In: Proceeding of the Workshop on Document Analysis and Recognition, pp. 79–85 (2012) Lehal, G.S.: December. Choice of recognizable units for Urdu OCR. In: Proceeding of the Workshop on Document Analysis and Recognition, pp. 79–85 (2012)
12.
go back to reference Ahmed, S.B., Naz, S., Swati, S., Razzak, I., Umar, A.I. Khan, A.A.: UCOM offline dataset-an urdu handwritten dataset generation. Int. Arab J. Inf. Technol. (IAJIT), 14(2) (2017) Ahmed, S.B., Naz, S., Swati, S., Razzak, I., Umar, A.I. Khan, A.A.: UCOM offline dataset-an urdu handwritten dataset generation. Int. Arab J. Inf. Technol. (IAJIT), 14(2) (2017)
13.
go back to reference Husnain, M., Saad Missen, M.M., Mumtaz, S., Jhanidr, M.Z., Coustaty, M., Muzzamil Luqman, M., Ogier, J.M., Sang Choi, G.: Recognition of urdu handwritten characters using convolutional neural network. Appl. Sci. 9(13), 2758 (2019)CrossRef Husnain, M., Saad Missen, M.M., Mumtaz, S., Jhanidr, M.Z., Coustaty, M., Muzzamil Luqman, M., Ogier, J.M., Sang Choi, G.: Recognition of urdu handwritten characters using convolutional neural network. Appl. Sci. 9(13), 2758 (2019)CrossRef
14.
go back to reference Hassan, S., Irfan, A., Mirza, A. Siddiqi, I.: Cursive handwritten text recognition using Bi-directional LSTMs: A case study on Urdu handwriting. In: 2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML), pp. 67-72. IEEE (2019) Hassan, S., Irfan, A., Mirza, A. Siddiqi, I.: Cursive handwritten text recognition using Bi-directional LSTMs: A case study on Urdu handwriting. In: 2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML), pp. 67-72. IEEE (2019)
15.
go back to reference Ahmed, S.B., Hameed, I.A., Naz, S., Razzak, M.I., Yusof, R.: Evaluation of handwritten Urdu text by integration of MNIST dataset learning experience. IEEE Access 7, 153566–153578 (2019)CrossRef Ahmed, S.B., Hameed, I.A., Naz, S., Razzak, M.I., Yusof, R.: Evaluation of handwritten Urdu text by integration of MNIST dataset learning experience. IEEE Access 7, 153566–153578 (2019)CrossRef
16.
go back to reference Naeem, M.F., Raza, S.M., Khan, M.M., Ul-Hasan, A., Shafait, F.: A convolutional recursive deep architecture for unconstrained Urdu handwriting recognition. Neural Comput. Appl. 34(2), 1635–48 (2022)CrossRef Naeem, M.F., Raza, S.M., Khan, M.M., Ul-Hasan, A., Shafait, F.: A convolutional recursive deep architecture for unconstrained Urdu handwriting recognition. Neural Comput. Appl. 34(2), 1635–48 (2022)CrossRef
17.
go back to reference GuoDong, Z., KimTeng, L.: Interpolation of n-gram and mutual information based trigger pair language models for Mandarin speech recognition. Comput. Speech Language. 13(2), 125–41 (1999)CrossRef GuoDong, Z., KimTeng, L.: Interpolation of n-gram and mutual information based trigger pair language models for Mandarin speech recognition. Comput. Speech Language. 13(2), 125–41 (1999)CrossRef
18.
go back to reference Ganai, A.F. Koul, A.: September. Projection profile based ligature segmentation of Nastaleeq Urdu OCR. In: 2016 4th International Symposium on Computational and Business Intelligence (ISCBI), pp. 170–175. IEEE (2016) Ganai, A.F. Koul, A.: September. Projection profile based ligature segmentation of Nastaleeq Urdu OCR. In: 2016 4th International Symposium on Computational and Business Intelligence (ISCBI), pp. 170–175. IEEE (2016)
20.
go back to reference Rehman, K.U.U., Khan, Y.D.: A scale and rotation invariant Urdu Nastalique ligature recognition using cascade forward backpropagation neural network. IEEE Access 7, 120648–120669 (2019)CrossRef Rehman, K.U.U., Khan, Y.D.: A scale and rotation invariant Urdu Nastalique ligature recognition using cascade forward backpropagation neural network. IEEE Access 7, 120648–120669 (2019)CrossRef
21.
go back to reference Mostafavi, S.M., Kazerouni, I.A. Haddadnia, J.: Noise removal from printed text and handwriting images using coordinate logic filters. In: 2010 International Conference on Computer Applications and Industrial Electronics, pp. 160-164. IEEE (2010) Mostafavi, S.M., Kazerouni, I.A. Haddadnia, J.: Noise removal from printed text and handwriting images using coordinate logic filters. In: 2010 International Conference on Computer Applications and Industrial Electronics, pp. 160-164. IEEE (2010)
22.
go back to reference Devi, H.: Thresholding: A Pixel-Level image processing methodology preprocessing technique for an OCR system for the Brahmi script. Ancient Asia, 1 (2006) Devi, H.: Thresholding: A Pixel-Level image processing methodology preprocessing technique for an OCR system for the Brahmi script. Ancient Asia, 1 (2006)
23.
go back to reference Kumar, V., Gupta, P.: Importance of statistical measures in digital image processing. Int. J. Emerging Technol. Adv. Eng. 2(8), 56–62 (2012) Kumar, V., Gupta, P.: Importance of statistical measures in digital image processing. Int. J. Emerging Technol. Adv. Eng. 2(8), 56–62 (2012)
24.
go back to reference Singh, Y.K.: Finding connected components in a gray scale image. ADBU J. Eng. Technol. 5(2) (2016) Singh, Y.K.: Finding connected components in a gray scale image. ADBU J. Eng. Technol. 5(2) (2016)
25.
go back to reference Sabbour, N., Shafait, F.: A segmentation-free approach to Arabic and Urdu OCR. In: Document Recognition and Retrieval XX (Vol. 8658, p. 86580N). International Society for Optics and Photonics (2013) Sabbour, N., Shafait, F.: A segmentation-free approach to Arabic and Urdu OCR. In: Document Recognition and Retrieval XX (Vol. 8658, p. 86580N). International Society for Optics and Photonics (2013)
26.
go back to reference Yang, L., Hanneke, S., Carbonell, J.: A theory of transfer learning with applications to active learning. Mach. Learn. 90(2), 161–189 (2013)MathSciNetCrossRefMATH Yang, L., Hanneke, S., Carbonell, J.: A theory of transfer learning with applications to active learning. Mach. Learn. 90(2), 161–189 (2013)MathSciNetCrossRefMATH
27.
go back to reference Ng, H.W., Nguyen, V.D., Vonikakis, V. Winkler, S.: November. Deep learning for emotion recognition on small datasets using transfer learning. In: Proceedings of the 2015 ACM on international conference on multimodal interaction, pp. 443-449 (2015) Ng, H.W., Nguyen, V.D., Vonikakis, V. Winkler, S.: November. Deep learning for emotion recognition on small datasets using transfer learning. In: Proceedings of the 2015 ACM on international conference on multimodal interaction, pp. 443-449 (2015)
28.
go back to reference Jogin, M., Madhulika, M.S., Divya, G.D., Meghana, R.K. Apoorva, S.: May. Feature extraction using convolution neural networks (CNN) and deep learning. In: 2018 3rd IEEE International Conference on Recent Trends in Electronics, In-formation & Communication Technology (RTEICT), pp. 2319–2323. IEEE (2018) Jogin, M., Madhulika, M.S., Divya, G.D., Meghana, R.K. Apoorva, S.: May. Feature extraction using convolution neural networks (CNN) and deep learning. In: 2018 3rd IEEE International Conference on Recent Trends in Electronics, In-formation & Communication Technology (RTEICT), pp. 2319–2323. IEEE (2018)
29.
go back to reference Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)CrossRef Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)CrossRef
30.
go back to reference He, K., Zhang, X., Ren, S. Sun, J.: Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016) He, K., Zhang, X., Ren, S. Sun, J.: Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
31.
go back to reference Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: European Conference on Computer Vision, pp. 818-833. Springer, Cham (2014) Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: European Conference on Computer Vision, pp. 818-833. Springer, Cham (2014)
32.
go back to reference Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:​1409.​1556 (2014)
33.
go back to reference Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V. Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015) Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V. Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
34.
go back to reference Uddin, I., Javed, N., Siddiqi, I.A., Khalid, S., Khurshid, K.: Recognition of printed Urdu ligatures using convolutional neural networks. J. Electron. Imaging 28(3), 033004 (2019)CrossRef Uddin, I., Javed, N., Siddiqi, I.A., Khalid, S., Khurshid, K.: Recognition of printed Urdu ligatures using convolutional neural networks. J. Electron. Imaging 28(3), 033004 (2019)CrossRef
Metadata
Title
A novel holistic unconstrained handwritten urdu recognition system using convolutional neural networks
Authors
Aejaz Farooq Ganai
Farida Khursheed
Publication date
07-10-2022
Publisher
Springer Berlin Heidelberg
Published in
International Journal on Document Analysis and Recognition (IJDAR) / Issue 4/2022
Print ISSN: 1433-2833
Electronic ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-022-00414-7

Other articles of this Issue 4/2022

International Journal on Document Analysis and Recognition (IJDAR) 4/2022 Go to the issue

Premium Partner