Skip to main content
Top
Published in: Arabian Journal for Science and Engineering 2/2022

16-09-2021 | Research Article-Computer Engineering and Computer Science

Isolated Video-Based Arabic Sign Language Recognition Using Convolutional and Recursive Neural Networks

Authors: Abdelbasset Boukdir, Mohamed Benaddy, Ayoub Ellahyani, Othmane El Meslouhi, Mustapha Kardouchi

Published in: Arabian Journal for Science and Engineering | Issue 2/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

For natural and meaningful communication between the deaf community and the hearing population, sign language is very important. Most of the Arab sign recognition studies have focused on the identification of the sign action based on the descriptor of the feature. However, the limitation of this traditional method is the need to choose which features are important in each particular sequence. To address this issue, we propose a novel approach based on a deep learning architecture to classify video sequences of Arabic sign language, especially Moroccan sign language. Two methods of classification are applied, namely 2D convolutional recurring neural network (2DCRNN) and 3D convolutional neural network (3DCNN). Concerning the first method, a 2DCRNN model is used to extract features with a recurring network pattern to detect the relationship between frames. The second method uses a 3DCNN model learning the spatiotemporal features out of small patches. After 2DCRNN and the 3DCNN models extracted feature, the video data are classified into various classes, using a fully connected network. The proposed approach is trained over a collection of 224 videos of five individuals performing 56 different signs. The results achieved through the fourfold cross-validation technique demonstrate the performance of the proposed approach in terms of recall, F1 score, and AUROC, with the level accuracy of 92% for 2DCRNN and 99% for 3DCNN.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Napier, J.; Leeson, L.: Sign language in action. In: Sign language in action, pp. 50–84. Springer (2016) Napier, J.; Leeson, L.: Sign language in action. In: Sign language in action, pp. 50–84. Springer (2016)
2.
go back to reference Sandler, W.; Lillo-Martin, D.: Sign Language and Linguistic Universals. Cambridge University Press, Cambridge (2006)CrossRef Sandler, W.; Lillo-Martin, D.: Sign Language and Linguistic Universals. Cambridge University Press, Cambridge (2006)CrossRef
3.
go back to reference Yeasin, M.; Chaudhuri, S.: Visual understanding of dynamic hand gestures. Pattern Recognit. 33(11), 1805–1817 (2000)CrossRef Yeasin, M.; Chaudhuri, S.: Visual understanding of dynamic hand gestures. Pattern Recognit. 33(11), 1805–1817 (2000)CrossRef
4.
go back to reference NIDCD Fact Sheet, H., Balance: American sign language. NIH Publication No. 11-4756 March 2019 (2019) NIDCD Fact Sheet, H., Balance: American sign language. NIH Publication No. 11-4756 March 2019 (2019)
5.
go back to reference Yang, S.; Zhu, Q.: Video-based chinese sign language recognition using convolutional neural network. In: 2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN), pp. 929–934. IEEE (2017) Yang, S.; Zhu, Q.: Video-based chinese sign language recognition using convolutional neural network. In: 2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN), pp. 929–934. IEEE (2017)
6.
go back to reference Al-Fityani, K.; Padden, C.: Sign language geography in the arab world. Sign languages: a Cambridge survey 20,(2010) Al-Fityani, K.; Padden, C.: Sign language geography in the arab world. Sign languages: a Cambridge survey 20,(2010)
7.
go back to reference Esam, N.; Abbas, A.; Krause, P.: Towards empowering hearing impaired students’ skills in computing and technology. Int. J. Adv. Comput. Sci. Appl. 8(1) (2017) Esam, N.; Abbas, A.; Krause, P.: Towards empowering hearing impaired students’ skills in computing and technology. Int. J. Adv. Comput. Sci. Appl. 8(1) (2017)
8.
go back to reference Abdel-Fattah, M.A.: Arabic sign language: a perspective. J. Deaf Stud. Deaf Educ. 10(2), 212–221 (2005)CrossRef Abdel-Fattah, M.A.: Arabic sign language: a perspective. J. Deaf Stud. Deaf Educ. 10(2), 212–221 (2005)CrossRef
9.
go back to reference Gugenheimer, J.; Plaumann, K.; Schaub, F.; Di Campli San Vito, P.; Duck, S.; Rabus, M.; Rukzio, E.: The impact of assistive technology on communication quality between deaf and hearing individuals. In: Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, pp. 669–682 (2017) Gugenheimer, J.; Plaumann, K.; Schaub, F.; Di Campli San Vito, P.; Duck, S.; Rabus, M.; Rukzio, E.: The impact of assistive technology on communication quality between deaf and hearing individuals. In: Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, pp. 669–682 (2017)
10.
go back to reference Mohandes, M.; Deriche, M.; Liu, J.: Image-based and sensor-based approaches to arabic sign language recognition. IEEE Trans. Hum.-Mach. Syst. 44(4), 551–557 (2014) Mohandes, M.; Deriche, M.; Liu, J.: Image-based and sensor-based approaches to arabic sign language recognition. IEEE Trans. Hum.-Mach. Syst. 44(4), 551–557 (2014)
11.
go back to reference Tubaiz, N.; Shanableh, T.; Assaleh, K.: Glove-based continuous arabic sign language recognition in user-dependent mode. IEEE Trans. Hum.-Mach. Syst. 45(4), 526–533 (2015) Tubaiz, N.; Shanableh, T.; Assaleh, K.: Glove-based continuous arabic sign language recognition in user-dependent mode. IEEE Trans. Hum.-Mach. Syst. 45(4), 526–533 (2015)
12.
go back to reference Mohandes, M.; A-Buraiky, S.; Halawani, T.; Al-Baiyat, S.: Automation of the arabic sign language recognition. In: Proceedings. 2004 International Conference on Information and Communication Technologies: From Theory to Applications, 2004., pp. 479–480. IEEE (2004) Mohandes, M.; A-Buraiky, S.; Halawani, T.; Al-Baiyat, S.: Automation of the arabic sign language recognition. In: Proceedings. 2004 International Conference on Information and Communication Technologies: From Theory to Applications, 2004., pp. 479–480. IEEE (2004)
13.
go back to reference Ibrahim, N.B.; Zayed, H.H.; Selim, M.M.: Advances, challenges and opportunities in continuous sign language recognition. J. Eng. Appl. Sci. 15(5), 1205–1227 (2020) Ibrahim, N.B.; Zayed, H.H.; Selim, M.M.: Advances, challenges and opportunities in continuous sign language recognition. J. Eng. Appl. Sci. 15(5), 1205–1227 (2020)
14.
go back to reference Elons, A.; Ahmed, M.; Shedid, H.; Tolba, M.: Arabic sign language recognition using leap motion sensor. In: 2014 9th International Conference on Computer Engineering & Systems (ICCES), pp. 368–373. IEEE (2014) Elons, A.; Ahmed, M.; Shedid, H.; Tolba, M.: Arabic sign language recognition using leap motion sensor. In: 2014 9th International Conference on Computer Engineering & Systems (ICCES), pp. 368–373. IEEE (2014)
15.
go back to reference Ahmed, A.A.; Aly, S.: Appearance-based arabic sign language recognition using hidden markov models. In: 2014 International Conference on Engineering and Technology (ICET), pp. 1–6. IEEE (2014) Ahmed, A.A.; Aly, S.: Appearance-based arabic sign language recognition using hidden markov models. In: 2014 International Conference on Engineering and Technology (ICET), pp. 1–6. IEEE (2014)
16.
go back to reference Abdi, H.; Williams, L.J.: Principal component analysis. Wiley Interdiscip. Rev. Comput. Stat. 2(4), 433–459 (2010)CrossRef Abdi, H.; Williams, L.J.: Principal component analysis. Wiley Interdiscip. Rev. Comput. Stat. 2(4), 433–459 (2010)CrossRef
17.
go back to reference Heikkilä, M.; Pietikäinen, M.; Schmid, C.: Description of interest regions with local binary patterns. Pattern Recognit. 42(3), 425–436 (2009)CrossRef Heikkilä, M.; Pietikäinen, M.; Schmid, C.: Description of interest regions with local binary patterns. Pattern Recognit. 42(3), 425–436 (2009)CrossRef
18.
go back to reference Shanableh, T.; Assaleh, K.; Al-Rousan, M.: Spatio-temporal feature-extraction techniques for isolated gesture recognition in arabic sign language. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 37(3), 641–650 (2007) Shanableh, T.; Assaleh, K.; Al-Rousan, M.: Spatio-temporal feature-extraction techniques for isolated gesture recognition in arabic sign language. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 37(3), 641–650 (2007)
19.
go back to reference Hemayed, E.E.; Hassanien, A.S.: Edge-based recognizer for arabic sign language alphabet (ars2v-arabic sign to voice). In: 2010 International Computer Engineering Conference (ICENCO), pp. 121–127. IEEE (2010) Hemayed, E.E.; Hassanien, A.S.: Edge-based recognizer for arabic sign language alphabet (ars2v-arabic sign to voice). In: 2010 International Computer Engineering Conference (ICENCO), pp. 121–127. IEEE (2010)
20.
go back to reference Dahmani, D.; Larabi, S.: User-independent system for sign language finger spelling recognition. J. Vis. Commun. Image Represent. 25(5), 1240–1250 (2014)CrossRef Dahmani, D.; Larabi, S.: User-independent system for sign language finger spelling recognition. J. Vis. Commun. Image Represent. 25(5), 1240–1250 (2014)CrossRef
21.
go back to reference Hassan, M.; Assaleh, K.; Shanableh, T.: User-dependent sign language recognition using motion detection. In: 2016 International Conference on Computational Science and Computational Intelligence (CSCI), pp. 852–856. IEEE (2016) Hassan, M.; Assaleh, K.; Shanableh, T.: User-dependent sign language recognition using motion detection. In: 2016 International Conference on Computational Science and Computational Intelligence (CSCI), pp. 852–856. IEEE (2016)
22.
go back to reference Alzohairi, R.; Alghonaim, R.; Alshehri, W.; Aloqeely, S.; Alzaidan, M.; Bchir, O.: Image based arabic sign language recognition system. Int. J. Adv. Comput. Sci. Appl. (IJACSA) 9(3) (2018) Alzohairi, R.; Alghonaim, R.; Alshehri, W.; Aloqeely, S.; Alzaidan, M.; Bchir, O.: Image based arabic sign language recognition system. Int. J. Adv. Comput. Sci. Appl. (IJACSA) 9(3) (2018)
23.
go back to reference Soora, N.R.; Deshpande, P.S.: Review of feature extraction techniques for character recognition. IETE J. Res. 64(2), 280–295 (2018)CrossRef Soora, N.R.; Deshpande, P.S.: Review of feature extraction techniques for character recognition. IETE J. Res. 64(2), 280–295 (2018)CrossRef
24.
go back to reference Adams, S.; Beling, P.A.: A survey of feature selection methods for Gaussian mixture models and hidden Markov models. Artif. Intell. Rev. 52(3), 1739–1779 (2019)CrossRef Adams, S.; Beling, P.A.: A survey of feature selection methods for Gaussian mixture models and hidden Markov models. Artif. Intell. Rev. 52(3), 1739–1779 (2019)CrossRef
25.
go back to reference Deriche, M.; Aliyu, S.O.; Mohandes, M.: An intelligent arabic sign language recognition system using a pair of LMCs with GMM based classification. IEEE Sensors J. 19(18), 8067–8078 (2019)CrossRef Deriche, M.; Aliyu, S.O.; Mohandes, M.: An intelligent arabic sign language recognition system using a pair of LMCs with GMM based classification. IEEE Sensors J. 19(18), 8067–8078 (2019)CrossRef
26.
go back to reference Hayani, S.; Benaddy, M.; El Meslouhi, O.; Kardouchi, M.: Arab sign language recognition with convolutional neural networks. In: 2019 International Conference of Computer Science and Renewable Energies (ICCSRE), pp. 1–4. IEEE (2019) Hayani, S.; Benaddy, M.; El Meslouhi, O.; Kardouchi, M.: Arab sign language recognition with convolutional neural networks. In: 2019 International Conference of Computer Science and Renewable Energies (ICCSRE), pp. 1–4. IEEE (2019)
27.
go back to reference Maraqa, M.; Abu-Zaiter, R.: Recognition of arabic sign language (arsl) using recurrent neural networks. In: 2008 First International Conference on the Applications of Digital Information and Web Technologies (ICADIWT), pp. 478–481. IEEE (2008) Maraqa, M.; Abu-Zaiter, R.: Recognition of arabic sign language (arsl) using recurrent neural networks. In: 2008 First International Conference on the Applications of Digital Information and Web Technologies (ICADIWT), pp. 478–481. IEEE (2008)
28.
go back to reference Li, D.; Rodriguez, C.; Yu, X.; Li, H.: Word-level deep sign language recognition from video: a new large-scale dataset and methods comparison. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 1459–1469 (2020) Li, D.; Rodriguez, C.; Yu, X.; Li, H.: Word-level deep sign language recognition from video: a new large-scale dataset and methods comparison. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 1459–1469 (2020)
29.
go back to reference Li, D.; Yu, X.; Xu, C.; Petersson, L.; Li, H.: Transferring cross-domain knowledge for video sign language recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6205–6214 (2020) Li, D.; Yu, X.; Xu, C.; Petersson, L.; Li, H.: Transferring cross-domain knowledge for video sign language recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6205–6214 (2020)
30.
go back to reference Albanie, S.; Varol, G.; Momeni, L.; Afouras, T.; Chung, J.S.; Fox, N.; Zisserman, A.: Bsl-1k: scaling up co-articulated sign language recognition using mouthing cues. In: European Conference on Computer Vision, pp. 35–53. Springer (2020) Albanie, S.; Varol, G.; Momeni, L.; Afouras, T.; Chung, J.S.; Fox, N.; Zisserman, A.: Bsl-1k: scaling up co-articulated sign language recognition using mouthing cues. In: European Conference on Computer Vision, pp. 35–53. Springer (2020)
31.
go back to reference Momeni, L.; Varol, G.; Albanie, S.; Afouras, T.; Zisserman, A.: Watch, read and lookup: learning to spot signs from multiple supervisors. In: Proceedings of the Asian Conference on Computer Vision (2020) Momeni, L.; Varol, G.; Albanie, S.; Afouras, T.; Zisserman, A.: Watch, read and lookup: learning to spot signs from multiple supervisors. In: Proceedings of the Asian Conference on Computer Vision (2020)
32.
go back to reference Li, D.; Xu, C.; Yu, X.; Zhang, K.; Swift, B.; Suominen, H.; Li, H.: Tspnet: hierarchical feature learning via temporal semantic pyramid for sign language translation. arXiv:2010.05468 (2020) Li, D.; Xu, C.; Yu, X.; Zhang, K.; Swift, B.; Suominen, H.; Li, H.: Tspnet: hierarchical feature learning via temporal semantic pyramid for sign language translation. arXiv:​2010.​05468 (2020)
33.
go back to reference Renz, K.; Stache, N.C.; Albanie, S.; Varol, G.: Sign language segmentation with temporal convolutional networks. In: ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2135–2139. IEEE (2021) Renz, K.; Stache, N.C.; Albanie, S.; Varol, G.: Sign language segmentation with temporal convolutional networks. In: ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2135–2139. IEEE (2021)
34.
go back to reference Deng, L.; Yu, D.: Deep learning: methods and applications. Found. Trends Signal Process. 7(3–4), 197–387 (2014)MathSciNetCrossRef Deng, L.; Yu, D.: Deep learning: methods and applications. Found. Trends Signal Process. 7(3–4), 197–387 (2014)MathSciNetCrossRef
35.
go back to reference Ibrahim, N.B.; Selim, M.M.; Zayed, H.H.: An automatic arabic sign language recognition system (arslrs). J. King Saud Univ.-Comput. Inf. Sci. 30(4), 470–477 (2018) Ibrahim, N.B.; Selim, M.M.; Zayed, H.H.: An automatic arabic sign language recognition system (arslrs). J. King Saud Univ.-Comput. Inf. Sci. 30(4), 470–477 (2018)
36.
go back to reference ElBadawy, M.; Elons, A.; Shedeed, H.A.; Tolba, M.: Arabic sign language recognition with 3d convolutional neural networks. In: 2017 Eighth International Conference on Intelligent Computing and Information Systems (ICICIS), pp. 66–71. IEEE (2017) ElBadawy, M.; Elons, A.; Shedeed, H.A.; Tolba, M.: Arabic sign language recognition with 3d convolutional neural networks. In: 2017 Eighth International Conference on Intelligent Computing and Information Systems (ICICIS), pp. 66–71. IEEE (2017)
Metadata
Title
Isolated Video-Based Arabic Sign Language Recognition Using Convolutional and Recursive Neural Networks
Authors
Abdelbasset Boukdir
Mohamed Benaddy
Ayoub Ellahyani
Othmane El Meslouhi
Mustapha Kardouchi
Publication date
16-09-2021
Publisher
Springer Berlin Heidelberg
Published in
Arabian Journal for Science and Engineering / Issue 2/2022
Print ISSN: 2193-567X
Electronic ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-021-06167-5

Other articles of this Issue 2/2022

Arabian Journal for Science and Engineering 2/2022 Go to the issue

Research Article-Computer Engineering and Computer Science

A Model-Driven Framework for the Development of MVC-Based (Web) Application

Research Article-Computer Engineering and Computer Science

Resource Provisioning Through Machine Learning in Cloud Services

Research Article-Computer Engineering and Computer Science

UAV Communications with Machine Learning: Challenges, Applications and Open Issues

Premium Partners