Skip to main content
Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) 4/2022

07.10.2022 | Special Issue Paper

A novel holistic unconstrained handwritten urdu recognition system using convolutional neural networks

verfasst von: Aejaz Farooq Ganai, Farida Khursheed

Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) | Ausgabe 4/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Handwritten Urdu recognition has been the least explored to date due to unavailability of a standard hand-written Urdu dataset, huge variation among writing styles of different Urdu writers, irregular positioning of diacritics associated with ligatures, similarity in shape of some Urdu characters in writing, and unavailability of an efficient learning and training technique. Few researchers have proposed the handwritten Urdu datasets among which only Urdu Nastaliq handwritten dataset (UNHD) is publicly available. The UNHD contains ligatures of only up to five characters and does not cover the entire Urdu ligature corpus. Hence, we present a novel comprehensive handwritten Urdu dataset named UHLD for the ‘Urdu Handwritten Ligature Dataset’:—which consists of ligatures of up to seven-character length and covers most of the ligature corpus of the Urdu language. The UHLD is written by both genders independent of age of person, paper color, paper type (blank or ruled), ink color, pen type. We propose an unconstrained handwritten Urdu recognition system that can recognize handwritten Urdu ligatures with up to six characters. A new robust algorithm has also been proposed here that is able to divide a complete ligature into primary and secondary components with 98% accuracy on a large Urdu dataset. Our proposed holistic handwritten Urdu recognition system ensures independent recognition of both primary and secondary components of a word/ligature. The proposed recognition technique is transformation invariant and computationally efficient and achieves a better recognition rate of 97% for UHLD and 93% for UNHD.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Naz, S., Umar, A.I., Shirazi, S.H., Khan, S.A., Ahmed, I., Khan, A.A.: Challenges of Urdu named entity recognition: a scarce resourced language. Res. J. Appl. Sci. Eng. Technol. 8(10), 1272–1278 (2014)CrossRef Naz, S., Umar, A.I., Shirazi, S.H., Khan, S.A., Ahmed, I., Khan, A.A.: Challenges of Urdu named entity recognition: a scarce resourced language. Res. J. Appl. Sci. Eng. Technol. 8(10), 1272–1278 (2014)CrossRef
2.
Zurück zum Zitat Daud, A., Khan, W., Che, D.: Urdu language processing: a survey. Artif. Intell. Rev. 47(3), 279–311 (2017)CrossRef Daud, A., Khan, W., Che, D.: Urdu language processing: a survey. Artif. Intell. Rev. 47(3), 279–311 (2017)CrossRef
3.
Zurück zum Zitat Weber, G.: Top languages. Retrieved April. 11, 2009 (2008) Weber, G.: Top languages. Retrieved April. 11, 2009 (2008)
4.
Zurück zum Zitat Ahmed, S.B., Naz, S., Swati, S., Razzak, M.I.: Handwritten Urdu character recognition using one-dimensional BLSTM classifier. Neural Comput. Appl. 31(4), 1143–1151 (2019)CrossRef Ahmed, S.B., Naz, S., Swati, S., Razzak, M.I.: Handwritten Urdu character recognition using one-dimensional BLSTM classifier. Neural Comput. Appl. 31(4), 1143–1151 (2019)CrossRef
5.
Zurück zum Zitat Alghamdi, M. A., Alkhazi, I. S., Teahan, W. J.: (July). Arabic OCR evaluation tool. In 2016 7th international conference on computer science and information technology (CSIT) (pp. 1-6). IEEE (2016) Alghamdi, M. A., Alkhazi, I. S., Teahan, W. J.: (July). Arabic OCR evaluation tool. In 2016 7th international conference on computer science and information technology (CSIT) (pp. 1-6). IEEE (2016)
6.
Zurück zum Zitat Satti, D.A., Saleem, K.: (November). Complexities and implementation challenges in offline urdu Nastaliq OCR. In: Proceedings of the Conference on Language & Technology, 85-91m (2012) Satti, D.A., Saleem, K.: (November). Complexities and implementation challenges in offline urdu Nastaliq OCR. In: Proceedings of the Conference on Language & Technology, 85-91m (2012)
7.
Zurück zum Zitat Khan, N.H., Adnan, A.: Urdu optical character recognition systems: Present contributions and future directions. IEEE Access 6, 46019–46046 (2018)CrossRef Khan, N.H., Adnan, A.: Urdu optical character recognition systems: Present contributions and future directions. IEEE Access 6, 46019–46046 (2018)CrossRef
8.
Zurück zum Zitat Naz, S., Umar, A.I., Ahmed, R., Razzak, M.I., Rashid, S.F., Shafait, F.: Urdu Nasta’liq text recognition using implicit segmentation based on multi-dimensional long short term memory neural networks. Springerplus 5(1), 2010 (2016)CrossRef Naz, S., Umar, A.I., Ahmed, R., Razzak, M.I., Rashid, S.F., Shafait, F.: Urdu Nasta’liq text recognition using implicit segmentation based on multi-dimensional long short term memory neural networks. Springerplus 5(1), 2010 (2016)CrossRef
10.
Zurück zum Zitat Din, I.U., Siddiqi, I., Khalid, S., Azam, T.: Segmentation-free optical character recognition for printed Urdu text. EURASIP J Image Video Process 2017(1), 62 (2017)CrossRef Din, I.U., Siddiqi, I., Khalid, S., Azam, T.: Segmentation-free optical character recognition for printed Urdu text. EURASIP J Image Video Process 2017(1), 62 (2017)CrossRef
11.
Zurück zum Zitat Lehal, G.S.: December. Choice of recognizable units for Urdu OCR. In: Proceeding of the Workshop on Document Analysis and Recognition, pp. 79–85 (2012) Lehal, G.S.: December. Choice of recognizable units for Urdu OCR. In: Proceeding of the Workshop on Document Analysis and Recognition, pp. 79–85 (2012)
12.
Zurück zum Zitat Ahmed, S.B., Naz, S., Swati, S., Razzak, I., Umar, A.I. Khan, A.A.: UCOM offline dataset-an urdu handwritten dataset generation. Int. Arab J. Inf. Technol. (IAJIT), 14(2) (2017) Ahmed, S.B., Naz, S., Swati, S., Razzak, I., Umar, A.I. Khan, A.A.: UCOM offline dataset-an urdu handwritten dataset generation. Int. Arab J. Inf. Technol. (IAJIT), 14(2) (2017)
13.
Zurück zum Zitat Husnain, M., Saad Missen, M.M., Mumtaz, S., Jhanidr, M.Z., Coustaty, M., Muzzamil Luqman, M., Ogier, J.M., Sang Choi, G.: Recognition of urdu handwritten characters using convolutional neural network. Appl. Sci. 9(13), 2758 (2019)CrossRef Husnain, M., Saad Missen, M.M., Mumtaz, S., Jhanidr, M.Z., Coustaty, M., Muzzamil Luqman, M., Ogier, J.M., Sang Choi, G.: Recognition of urdu handwritten characters using convolutional neural network. Appl. Sci. 9(13), 2758 (2019)CrossRef
14.
Zurück zum Zitat Hassan, S., Irfan, A., Mirza, A. Siddiqi, I.: Cursive handwritten text recognition using Bi-directional LSTMs: A case study on Urdu handwriting. In: 2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML), pp. 67-72. IEEE (2019) Hassan, S., Irfan, A., Mirza, A. Siddiqi, I.: Cursive handwritten text recognition using Bi-directional LSTMs: A case study on Urdu handwriting. In: 2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML), pp. 67-72. IEEE (2019)
15.
Zurück zum Zitat Ahmed, S.B., Hameed, I.A., Naz, S., Razzak, M.I., Yusof, R.: Evaluation of handwritten Urdu text by integration of MNIST dataset learning experience. IEEE Access 7, 153566–153578 (2019)CrossRef Ahmed, S.B., Hameed, I.A., Naz, S., Razzak, M.I., Yusof, R.: Evaluation of handwritten Urdu text by integration of MNIST dataset learning experience. IEEE Access 7, 153566–153578 (2019)CrossRef
16.
Zurück zum Zitat Naeem, M.F., Raza, S.M., Khan, M.M., Ul-Hasan, A., Shafait, F.: A convolutional recursive deep architecture for unconstrained Urdu handwriting recognition. Neural Comput. Appl. 34(2), 1635–48 (2022)CrossRef Naeem, M.F., Raza, S.M., Khan, M.M., Ul-Hasan, A., Shafait, F.: A convolutional recursive deep architecture for unconstrained Urdu handwriting recognition. Neural Comput. Appl. 34(2), 1635–48 (2022)CrossRef
17.
Zurück zum Zitat GuoDong, Z., KimTeng, L.: Interpolation of n-gram and mutual information based trigger pair language models for Mandarin speech recognition. Comput. Speech Language. 13(2), 125–41 (1999)CrossRef GuoDong, Z., KimTeng, L.: Interpolation of n-gram and mutual information based trigger pair language models for Mandarin speech recognition. Comput. Speech Language. 13(2), 125–41 (1999)CrossRef
18.
Zurück zum Zitat Ganai, A.F. Koul, A.: September. Projection profile based ligature segmentation of Nastaleeq Urdu OCR. In: 2016 4th International Symposium on Computational and Business Intelligence (ISCBI), pp. 170–175. IEEE (2016) Ganai, A.F. Koul, A.: September. Projection profile based ligature segmentation of Nastaleeq Urdu OCR. In: 2016 4th International Symposium on Computational and Business Intelligence (ISCBI), pp. 170–175. IEEE (2016)
20.
Zurück zum Zitat Rehman, K.U.U., Khan, Y.D.: A scale and rotation invariant Urdu Nastalique ligature recognition using cascade forward backpropagation neural network. IEEE Access 7, 120648–120669 (2019)CrossRef Rehman, K.U.U., Khan, Y.D.: A scale and rotation invariant Urdu Nastalique ligature recognition using cascade forward backpropagation neural network. IEEE Access 7, 120648–120669 (2019)CrossRef
21.
Zurück zum Zitat Mostafavi, S.M., Kazerouni, I.A. Haddadnia, J.: Noise removal from printed text and handwriting images using coordinate logic filters. In: 2010 International Conference on Computer Applications and Industrial Electronics, pp. 160-164. IEEE (2010) Mostafavi, S.M., Kazerouni, I.A. Haddadnia, J.: Noise removal from printed text and handwriting images using coordinate logic filters. In: 2010 International Conference on Computer Applications and Industrial Electronics, pp. 160-164. IEEE (2010)
22.
Zurück zum Zitat Devi, H.: Thresholding: A Pixel-Level image processing methodology preprocessing technique for an OCR system for the Brahmi script. Ancient Asia, 1 (2006) Devi, H.: Thresholding: A Pixel-Level image processing methodology preprocessing technique for an OCR system for the Brahmi script. Ancient Asia, 1 (2006)
23.
Zurück zum Zitat Kumar, V., Gupta, P.: Importance of statistical measures in digital image processing. Int. J. Emerging Technol. Adv. Eng. 2(8), 56–62 (2012) Kumar, V., Gupta, P.: Importance of statistical measures in digital image processing. Int. J. Emerging Technol. Adv. Eng. 2(8), 56–62 (2012)
24.
Zurück zum Zitat Singh, Y.K.: Finding connected components in a gray scale image. ADBU J. Eng. Technol. 5(2) (2016) Singh, Y.K.: Finding connected components in a gray scale image. ADBU J. Eng. Technol. 5(2) (2016)
25.
Zurück zum Zitat Sabbour, N., Shafait, F.: A segmentation-free approach to Arabic and Urdu OCR. In: Document Recognition and Retrieval XX (Vol. 8658, p. 86580N). International Society for Optics and Photonics (2013) Sabbour, N., Shafait, F.: A segmentation-free approach to Arabic and Urdu OCR. In: Document Recognition and Retrieval XX (Vol. 8658, p. 86580N). International Society for Optics and Photonics (2013)
26.
Zurück zum Zitat Yang, L., Hanneke, S., Carbonell, J.: A theory of transfer learning with applications to active learning. Mach. Learn. 90(2), 161–189 (2013)MathSciNetCrossRefMATH Yang, L., Hanneke, S., Carbonell, J.: A theory of transfer learning with applications to active learning. Mach. Learn. 90(2), 161–189 (2013)MathSciNetCrossRefMATH
27.
Zurück zum Zitat Ng, H.W., Nguyen, V.D., Vonikakis, V. Winkler, S.: November. Deep learning for emotion recognition on small datasets using transfer learning. In: Proceedings of the 2015 ACM on international conference on multimodal interaction, pp. 443-449 (2015) Ng, H.W., Nguyen, V.D., Vonikakis, V. Winkler, S.: November. Deep learning for emotion recognition on small datasets using transfer learning. In: Proceedings of the 2015 ACM on international conference on multimodal interaction, pp. 443-449 (2015)
28.
Zurück zum Zitat Jogin, M., Madhulika, M.S., Divya, G.D., Meghana, R.K. Apoorva, S.: May. Feature extraction using convolution neural networks (CNN) and deep learning. In: 2018 3rd IEEE International Conference on Recent Trends in Electronics, In-formation & Communication Technology (RTEICT), pp. 2319–2323. IEEE (2018) Jogin, M., Madhulika, M.S., Divya, G.D., Meghana, R.K. Apoorva, S.: May. Feature extraction using convolution neural networks (CNN) and deep learning. In: 2018 3rd IEEE International Conference on Recent Trends in Electronics, In-formation & Communication Technology (RTEICT), pp. 2319–2323. IEEE (2018)
29.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)CrossRef Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)CrossRef
30.
Zurück zum Zitat He, K., Zhang, X., Ren, S. Sun, J.: Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016) He, K., Zhang, X., Ren, S. Sun, J.: Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
31.
Zurück zum Zitat Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: European Conference on Computer Vision, pp. 818-833. Springer, Cham (2014) Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: European Conference on Computer Vision, pp. 818-833. Springer, Cham (2014)
32.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:​1409.​1556 (2014)
33.
Zurück zum Zitat Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V. Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015) Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V. Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
34.
Zurück zum Zitat Uddin, I., Javed, N., Siddiqi, I.A., Khalid, S., Khurshid, K.: Recognition of printed Urdu ligatures using convolutional neural networks. J. Electron. Imaging 28(3), 033004 (2019)CrossRef Uddin, I., Javed, N., Siddiqi, I.A., Khalid, S., Khurshid, K.: Recognition of printed Urdu ligatures using convolutional neural networks. J. Electron. Imaging 28(3), 033004 (2019)CrossRef
Metadaten
Titel
A novel holistic unconstrained handwritten urdu recognition system using convolutional neural networks
verfasst von
Aejaz Farooq Ganai
Farida Khursheed
Publikationsdatum
07.10.2022
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Document Analysis and Recognition (IJDAR) / Ausgabe 4/2022
Print ISSN: 1433-2833
Elektronische ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-022-00414-7

Weitere Artikel der Ausgabe 4/2022

International Journal on Document Analysis and Recognition (IJDAR) 4/2022 Zur Ausgabe

Premium Partner