Skip to main content
Erschienen in: Arabian Journal for Science and Engineering 8/2022

18.03.2022 | Research Article-Computer Engineering and Computer Science

UnCNN: A New Directed CNN Model for Isolated Arabic Handwritten Characters Recognition

verfasst von: Mohsine Elkhayati, Youssfi Elkettani

Erschienen in: Arabian Journal for Science and Engineering | Ausgabe 8/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Deeper and wider Convolutional Neural Networks (CNN) have achieved peak performance in many applications. However, the greater the depth/width of the network, the more complexity increases resulting in high memory and time consumption. Complexity comes mainly from the large number of free parameters in Fully Connected Layers (FCL) and the number of Multiple Accumulation Operations (MAC) in Convolutional Layers (CL). In most CNNs, 90% of parameters come from FCLs and 99% of MAC operations come from CLs. Complexity directly affects network size, learning time, inference time, and model deployability. In this article, we propose a new approach to reduce complexity and improve the performance of a CNN model for the task of Isolated Handwritten Arabic Character (IHAC) recognition. The idea of the approach was inspired by the unconscious perception concept, which is widely studied in psychology and cognitive sciences. We have attempted to model this concept at the level of hidden FCLs as follows: The neurons of hidden FCLs are divided into two blocks, namely Conscious Block (CB) and Unconscious Block (UB). CB and UB are entirely separate from each other and fully connected to the output layer. CB is large and processes few features (relevant), while UB is small and processes most features (irrelevant). Feature selection is performed at the flatten layer using what we call virtual max-pooling. This strategy significantly reduces connectivity inside FCLs, resulting in fewer free parameters. Fewer parameters mean more freedom to add more FCLs without high costs. This allows reducing the number of CLs without affecting performance, thereby reducing MAC operations. Furthermore, the strategy improves performance by focusing the network's attention on representative features. To evaluate our approach, experiments were carried out on four benchmark databases for IHAC that are: IFHCDB, AHCD, AIA9K, and HACDB. Compared to a basic CNN model, the proposed model achieved higher performance, reduced free parameters, the network’s size, training, and inference time. It also outperformed many recent models for IHAC recognition.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Chen, Y.-Y.; Lin, Y.-H.; Kung, C.-C.; Chung, M.-H.; Yen, I.-H.: Design and implementation of cloud analytics-assisted smart power meters considering advanced artificial intelligence as edge analytics in demand-side management for smart homes. Sensors (2019). https://doi.org/10.3390/s19092047CrossRef Chen, Y.-Y.; Lin, Y.-H.; Kung, C.-C.; Chung, M.-H.; Yen, I.-H.: Design and implementation of cloud analytics-assisted smart power meters considering advanced artificial intelligence as edge analytics in demand-side management for smart homes. Sensors (2019). https://​doi.​org/​10.​3390/​s19092047CrossRef
7.
10.
Zurück zum Zitat Simonyan, K.; Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: ICLR (2014) Simonyan, K.; Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: ICLR (2014)
18.
Zurück zum Zitat Ba, J.; Mnih, V.; Kavukcuoglu, K.: Multiple object recognition with visual attention. In: 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, Conference Track Proceedings, 2015. [Online]. http://arxiv.org/abs/1412.7755. Accessed 17 Nov 2021 Ba, J.; Mnih, V.; Kavukcuoglu, K.: Multiple object recognition with visual attention. In: 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, Conference Track Proceedings, 2015. [Online]. http://​arxiv.​org/​abs/​1412.​7755. Accessed 17 Nov 2021
20.
Zurück zum Zitat Gregor, K.; Danihelka, I.; Graves, A.; Rezende, D.; Wierstra, D.: DRAW: a recurrent neural network for image generation. In: Proceedings of the 32nd International Conference on Machine Learning, pp. 1462–1471 (2015). Accessed: Nov. 16, 2021. [Online]. https://proceedings.mlr.press/v37/gregor15.html. Accessed 16 Nov 2021 Gregor, K.; Danihelka, I.; Graves, A.; Rezende, D.; Wierstra, D.: DRAW: a recurrent neural network for image generation. In: Proceedings of the 32nd International Conference on Machine Learning, pp. 1462–1471 (2015). Accessed: Nov. 16, 2021. [Online]. https://​proceedings.​mlr.​press/​v37/​gregor15.​html. Accessed 16 Nov 2021
21.
Zurück zum Zitat Li, H.; Kadav, A.; Durdanovic, I.; Samet, H.; Graf, H.P.: Pruning filters for efficient ConvNets. Presented at the 5th International Conference on Learning Representations, ICLR 2017, Toulon, France. [Online]. https://openreview.net/forum?id=rJqFGTslg. Accessed 6 Nov 2021 Li, H.; Kadav, A.; Durdanovic, I.; Samet, H.; Graf, H.P.: Pruning filters for efficient ConvNets. Presented at the 5th International Conference on Learning Representations, ICLR 2017, Toulon, France. [Online]. https://​openreview.​net/​forum?​id=​rJqFGTslg. Accessed 6 Nov 2021
22.
Zurück zum Zitat Mozaffari, S.; Faez, K.; Faradji, F.; Ziaratban, M.; Golzan, S.M.: A comprehensive isolated Farsi/Arabic character database for handwritten Ocr research. In: Proceedings of the 10th International Workshop on Frontiers in, La Baule, France, pp. 385–389 (2006) Mozaffari, S.; Faez, K.; Faradji, F.; Ziaratban, M.; Golzan, S.M.: A comprehensive isolated Farsi/Arabic character database for handwritten Ocr research. In: Proceedings of the 10th International Workshop on Frontiers in, La Baule, France, pp. 385–389 (2006)
23.
Zurück zum Zitat Elsawy, A.; Loey, M.; El-Bakry, H.: Arabic handwritten characters recognition using convolutional neural network. WSEAS Trans. Comput. Res. 5, 11–19 (2017) Elsawy, A.; Loey, M.; El-Bakry, H.: Arabic handwritten characters recognition using convolutional neural network. WSEAS Trans. Comput. Res. 5, 11–19 (2017)
24.
Zurück zum Zitat Torki, M.; Hussein, M.E.; Elsallamy, A.; Fayyaz, M.; Yaser, S.: Window-based descriptors for arabic handwritten alphabet recognition: a comparative study on a novel dataset. arXiv:1411.3519 [cs] (2014). [Online]. http://arxiv.org/abs/1411.3519. Accessed 11 Jun 2021 Torki, M.; Hussein, M.E.; Elsallamy, A.; Fayyaz, M.; Yaser, S.: Window-based descriptors for arabic handwritten alphabet recognition: a comparative study on a novel dataset. arXiv:1411.3519 [cs] (2014). [Online]. http://​arxiv.​org/​abs/​1411.​3519. Accessed 11 Jun 2021
25.
Zurück zum Zitat Lawgali, A.; Angelova, M.; Bouridane, A.: HACDB: handwritten Arabic characters database for automatic character recognition. In: European Workshop on Visual Information Processing (EUVIP), pp. 255–259 (2013) Lawgali, A.; Angelova, M.; Bouridane, A.: HACDB: handwritten Arabic characters database for automatic character recognition. In: European Workshop on Visual Information Processing (EUVIP), pp. 255–259 (2013)
27.
Zurück zum Zitat Freud, S.: Die traumdeutung. Franz Deuticke, Vienna (1900) Freud, S.: Die traumdeutung. Franz Deuticke, Vienna (1900)
33.
46.
Zurück zum Zitat Zhao, R. et al.: Accelerating binarized convolutional neural networks with software-programmable FPGAs. In: Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, Monterey California USA, pp. 15–24 (2017). https://doi.org/10.1145/3020078.3021741 Zhao, R. et al.: Accelerating binarized convolutional neural networks with software-programmable FPGAs. In: Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, Monterey California USA, pp. 15–24 (2017). https://​doi.​org/​10.​1145/​3020078.​3021741
47.
Zurück zum Zitat Chen, W.; Wilson, J.T.; Tyree, S.; Weinberger, K.Q.; Chen, Y.: Compressing neural networks with the hashing trick. In: Proceedings of the 32nd International Conference on International Conference on Machine Learning, vol. 37, Lille, France, pp. 2285–2294 (2015) Chen, W.; Wilson, J.T.; Tyree, S.; Weinberger, K.Q.; Chen, Y.: Compressing neural networks with the hashing trick. In: Proceedings of the 32nd International Conference on International Conference on Machine Learning, vol. 37, Lille, France, pp. 2285–2294 (2015)
52.
Zurück zum Zitat Golik, P.; Doetsch, P.; Ney, H.: Cross-entropy vs. squared error training: a theoretical and experimental comparison. In: 14th Annual Conference of the International Speech Communication Association, Lyon, France, pp. 1756–1760 (2013) Golik, P.; Doetsch, P.; Ney, H.: Cross-entropy vs. squared error training: a theoretical and experimental comparison. In: 14th Annual Conference of the International Speech Communication Association, Lyon, France, pp. 1756–1760 (2013)
61.
Zurück zum Zitat Elkhayati, M.; Elkettani, Y.: Towards directing convolutional neural networks using computational geometry algorithms: application to handwritten Arabic character recognition. Adv. Sci. Technol. Eng. Syst. J. 5(5), 137–147 (2020). https://doi.org/10.25046/aj050519CrossRef Elkhayati, M.; Elkettani, Y.: Towards directing convolutional neural networks using computational geometry algorithms: application to handwritten Arabic character recognition. Adv. Sci. Technol. Eng. Syst. J. 5(5), 137–147 (2020). https://​doi.​org/​10.​25046/​aj050519CrossRef
62.
64.
Zurück zum Zitat , Nair, V.; Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: Proceedings of the 27th International Conference on International Conference on Machine Learning, Madison, WI, USA, pp. 807–814 (2010) , Nair, V.; Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: Proceedings of the 27th International Conference on International Conference on Machine Learning, Madison, WI, USA, pp. 807–814 (2010)
65.
Zurück zum Zitat Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(56), 1929–1958 (2014)MathSciNetMATH Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(56), 1929–1958 (2014)MathSciNetMATH
68.
Zurück zum Zitat Mohd, K.N.; Adnan, A.H.M.; Yusof, A.A.; Ahmad, M.K.; Kamal, M.A.M. : Teaching Arabic language to Malaysian University Students using education technologies based on education 4.0 principles, pp. 38–51 (2019) Mohd, K.N.; Adnan, A.H.M.; Yusof, A.A.; Ahmad, M.K.; Kamal, M.A.M. : Teaching Arabic language to Malaysian University Students using education technologies based on education 4.0 principles, pp. 38–51 (2019)
75.
Zurück zum Zitat Chollet, F.: Deep Learning with Python. Manning Publications, Shelter Island (2017) Chollet, F.: Deep Learning with Python. Manning Publications, Shelter Island (2017)
77.
Zurück zum Zitat Najadat, H.M.; Alshboul, A.A.; Alabed, A.F.: Arabic handwritten characters recognition using convolutional neural network. In: 2019 10th International Conference on Information and Communication Systems (ICICS), Irbid, Jordan, pp. 147–151 (2019). https://doi.org/10.1109/IACS.2019.8809122 Najadat, H.M.; Alshboul, A.A.; Alabed, A.F.: Arabic handwritten characters recognition using convolutional neural network. In: 2019 10th International Conference on Information and Communication Systems (ICICS), Irbid, Jordan, pp. 147–151 (2019). https://​doi.​org/​10.​1109/​IACS.​2019.​8809122
Metadaten
Titel
UnCNN: A New Directed CNN Model for Isolated Arabic Handwritten Characters Recognition
verfasst von
Mohsine Elkhayati
Youssfi Elkettani
Publikationsdatum
18.03.2022
Verlag
Springer Berlin Heidelberg
Erschienen in
Arabian Journal for Science and Engineering / Ausgabe 8/2022
Print ISSN: 2193-567X
Elektronische ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-022-06652-5

Weitere Artikel der Ausgabe 8/2022

Arabian Journal for Science and Engineering 8/2022 Zur Ausgabe

Research Article-Computer Engineering and Computer Science

Fingerprint Denoising Using Iterative Rule-Based Filter

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.