Skip to main content
Erschienen in: Pattern Analysis and Applications 3/2023

23.02.2023 | Industrial and Commercial Application

Detection, tracking, and recognition of isolated multi-stroke gesticulated characters

verfasst von: Kuldeep Singh Yadav, Anish Monsley Kirupakaran, Rabul Hussain Laskar, M. K. Bhuyan

Erschienen in: Pattern Analysis and Applications | Ausgabe 3/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The detection and tracking of the bare hand are the most vital stages in the bare hand gesticulated character recognition system. Applying detection and tracking in an uncontrolled environment is quite challenging due to the variations in pose, position, illumination, occlusion, scale, rotation, speed, and bare hand-like impostors in the background or foreground. The motion blur due to speed variation makes it more complex. A computationally efficient object localization approach, i.e., region-based convolutional neural network on the selective region (RCNN-SR), is introduced in this work to detect the bare hand by overcoming these challenges at a certain level. To handle the motion blur and computational complexity, the paper presents a motion information-based detection and tracking approach using RCNN-SR, point tracker, and Kalman filter. The trajectory of the gesticulated characters is formed by mapping all the centroid points of the localized bare hand. To recognize this gesticulated trajectory, the original-existence-based gesticulated character recognition model is designed in this work by utilizing prior information about the characters. In this, we have tried to overcome the variations in pattern, style, scale, rotation, and illumination. In addition, we have addressed the case sensitivity between the English lowercase and uppercase alphabets by incorporating the boundary information. We have also proposed NITS R-Net, NITS hand gesture database VIIIB, and NITS Gesture image databases in this work. To evaluate proposed models, several benchmark databases are considered.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
6.
Zurück zum Zitat Bhuyan MK, Bora PK, Ghosh D (2008) Trajectory guided recognition of hand gestures having only global motions. World Acad Sci Eng Technol 21:753–764 Bhuyan MK, Bora PK, Ghosh D (2008) Trajectory guided recognition of hand gestures having only global motions. World Acad Sci Eng Technol 21:753–764
13.
Zurück zum Zitat Sharma P, Kokare PM, Kolekar MH (2019) Performance comparison of KLT and CAMSHIFT algorithms for video object tracking. In: Khare A, Tiwary US, Sethi IK, Singh N (eds) Recent trends in communication, computing, and electronics. Springer, Singapore, pp 323–331CrossRef Sharma P, Kokare PM, Kolekar MH (2019) Performance comparison of KLT and CAMSHIFT algorithms for video object tracking. In: Khare A, Tiwary US, Sethi IK, Singh N (eds) Recent trends in communication, computing, and electronics. Springer, Singapore, pp 323–331CrossRef
15.
Zurück zum Zitat Masilang RAA, Cabatuan MK, Dadios EP (2014) Hand initialization and tracking using a modified KLT tracker for a computer vision-based breast self-examination system. In: 2014 international conference on humanoid, nanotechnology, information technology, communication, and control, environment and management (HNICEM). IEEE, Palawan, Philippines, pp 1–5 Masilang RAA, Cabatuan MK, Dadios EP (2014) Hand initialization and tracking using a modified KLT tracker for a computer vision-based breast self-examination system. In: 2014 international conference on humanoid, nanotechnology, information technology, communication, and control, environment and management (HNICEM). IEEE, Palawan, Philippines, pp 1–5
16.
Zurück zum Zitat Singha J, Semwal VB, Laskar RH (2016) An accurate hand tracking system for complex background based on modified KLT tracker. In: 2016 IEEE region 10 conference (TENCON). IEEE, Singapore, pp 3644–3647 Singha J, Semwal VB, Laskar RH (2016) An accurate hand tracking system for complex background based on modified KLT tracker. In: 2016 IEEE region 10 conference (TENCON). IEEE, Singapore, pp 3644–3647
22.
Zurück zum Zitat Ronneberger O, Fischer P, Brox T (2015) U-Net: convolutional networks for biomedical image segmentation. In: Navab N, Hornegger J, Wells WM, Frangi AF (eds) Medical image computing and computer-assisted intervention—MICCAI 2015. Springer, Cham, pp 234–241 Ronneberger O, Fischer P, Brox T (2015) U-Net: convolutional networks for biomedical image segmentation. In: Navab N, Hornegger J, Wells WM, Frangi AF (eds) Medical image computing and computer-assisted intervention—MICCAI 2015. Springer, Cham, pp 234–241
28.
Zurück zum Zitat McBride TJ, Vandayar N, Nixon KJ (2019) A comparison of skin detection algorithms for hand gesture recognition. In: 2019 Southern African Universities power engineering conference/robotics and mechatronics/Pattern Recognition Association of South Africa (SAUPEC/RobMech/PRASA). IEEE, Bloemfontein, South Africa, pp 211–216 McBride TJ, Vandayar N, Nixon KJ (2019) A comparison of skin detection algorithms for hand gesture recognition. In: 2019 Southern African Universities power engineering conference/robotics and mechatronics/Pattern Recognition Association of South Africa (SAUPEC/RobMech/PRASA). IEEE, Bloemfontein, South Africa, pp 211–216
29.
Zurück zum Zitat Misra S, Laskar RH (2018) Multi-level analysis of Bit-Plane based GLAC feature and other existing texture features for a Robust hand detection system. In: 2018 international conference on advances in computing, communications and informatics (ICACCI). IEEE, Bangalore, pp 2318–2324 Misra S, Laskar RH (2018) Multi-level analysis of Bit-Plane based GLAC feature and other existing texture features for a Robust hand detection system. In: 2018 international conference on advances in computing, communications and informatics (ICACCI). IEEE, Bangalore, pp 2318–2324
31.
Zurück zum Zitat Le THN, Quach KG, Zhu C et al (2017) Robust hand detection and classification in vehicles and in the wild. In: 2017 IEEE conference on computer vision and pattern recognition workshops (CVPRW). IEEE, Honolulu, HI, USA, pp 1203–1210 Le THN, Quach KG, Zhu C et al (2017) Robust hand detection and classification in vehicles and in the wild. In: 2017 IEEE conference on computer vision and pattern recognition workshops (CVPRW). IEEE, Honolulu, HI, USA, pp 1203–1210
33.
Zurück zum Zitat Zhang M, Cheng X, Copeland D et al (2020) Using computer vision to automate hand detection and tracking of surgeon movements in videos of open surgery. In AMIA annual symposium proceedings, vol. 2020. American Medical Informatics Association, p 1373. Zhang M, Cheng X, Copeland D et al (2020) Using computer vision to automate hand detection and tracking of surgeon movements in videos of open surgery. In AMIA annual symposium proceedings, vol. 2020. American Medical Informatics Association, p 1373.
38.
Zurück zum Zitat Han Y, Kim C, Jang Y, Kim HJ (2020) Parametric analysis of KLT algorithm in autonomous driving. In: 2020 20th international conference on control, automation and systems (ICCAS). IEEE, Busan, Korea (South), pp 184–189 Han Y, Kim C, Jang Y, Kim HJ (2020) Parametric analysis of KLT algorithm in autonomous driving. In: 2020 20th international conference on control, automation and systems (ICCAS). IEEE, Busan, Korea (South), pp 184–189
39.
Zurück zum Zitat Yongyong D, Xinhua H, Yujie Y, Zongling W (2020) Image stabilization algorithm based on KLT motion tracking. In: 2020 international conference on computer vision, image and deep learning (CVIDL). IEEE, Chongqing, China, pp 44–47 Yongyong D, Xinhua H, Yujie Y, Zongling W (2020) Image stabilization algorithm based on KLT motion tracking. In: 2020 international conference on computer vision, image and deep learning (CVIDL). IEEE, Chongqing, China, pp 44–47
40.
Zurück zum Zitat Misra S, Laskar RH (2019) A novel approach towards pattern and speed invariant holistic analysis of dynamic gesture recognition system. In: 2019 9th annual information technology, electromechanical engineering and microelectronics conference (IEMECON). IEEE, Jaipur, India, pp 161–167 Misra S, Laskar RH (2019) A novel approach towards pattern and speed invariant holistic analysis of dynamic gesture recognition system. In: 2019 9th annual information technology, electromechanical engineering and microelectronics conference (IEMECON). IEEE, Jaipur, India, pp 161–167
43.
Zurück zum Zitat Mittal A, Zisserman A, Torr P (2011) Hand detection using multiple proposals. In: Procedings of the British machine vision conference 2011. British Machine Vision Association, Dundee, p 75.1-75.11 Mittal A, Zisserman A, Torr P (2011) Hand detection using multiple proposals. In: Procedings of the British machine vision conference 2011. British Machine Vision Association, Dundee, p 75.1-75.11
44.
Zurück zum Zitat Bambach S, Lee S, Crandall DJ, Yu C (2015) Lending a hand: detecting hands and recognizing activities in complex egocentric interactions. In: 2015 IEEE international conference on computer vision (ICCV). IEEE, Santiago, Chile, pp 1949–1957 Bambach S, Lee S, Crandall DJ, Yu C (2015) Lending a hand: detecting hands and recognizing activities in complex egocentric interactions. In: 2015 IEEE international conference on computer vision (ICCV). IEEE, Santiago, Chile, pp 1949–1957
45.
Zurück zum Zitat Cohen G, Afshar S, Tapson J, van Schaik A (2017) EMNIST: extending MNIST to handwritten letters. In: 2017 international joint conference on neural networks (IJCNN). IEEE, Anchorage, AK, USA, pp 2921–2926 Cohen G, Afshar S, Tapson J, van Schaik A (2017) EMNIST: extending MNIST to handwritten letters. In: 2017 international joint conference on neural networks (IJCNN). IEEE, Anchorage, AK, USA, pp 2921–2926
46.
Zurück zum Zitat Liao Z, Carneiro G (2015) Competitive multi-scale convolution Liao Z, Carneiro G (2015) Competitive multi-scale convolution
48.
Zurück zum Zitat Lee J, Bang J, Yang S-I (2017) Object detection with sliding window in images including multiple similar objects. In: 2017 international conference on information and communication technology convergence (ICTC). IEEE, Jeju, pp 803–806 Lee J, Bang J, Yang S-I (2017) Object detection with sliding window in images including multiple similar objects. In: 2017 international conference on information and communication technology convergence (ICTC). IEEE, Jeju, pp 803–806
49.
Zurück zum Zitat Cruz SR, Chan AB (2018) Hand detection using deformable part models on an egocentric perspective. 2018 digital image computing: techniques and applications (DICTA). IEEE, Canberra, pp 1–7 Cruz SR, Chan AB (2018) Hand detection using deformable part models on an egocentric perspective. 2018 digital image computing: techniques and applications (DICTA). IEEE, Canberra, pp 1–7
Metadaten
Titel
Detection, tracking, and recognition of isolated multi-stroke gesticulated characters
verfasst von
Kuldeep Singh Yadav
Anish Monsley Kirupakaran
Rabul Hussain Laskar
M. K. Bhuyan
Publikationsdatum
23.02.2023
Verlag
Springer London
Erschienen in
Pattern Analysis and Applications / Ausgabe 3/2023
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-023-01137-z

Weitere Artikel der Ausgabe 3/2023

Pattern Analysis and Applications 3/2023 Zur Ausgabe

Premium Partner