skip to main content
10.1145/2674396.2674421acmotherconferencesArticle/Chapter ViewAbstractPublication PagespetraConference Proceedingsconference-collections
research-article

Sign language recognition using dynamic time warping and hand shape distance based on histogram of oriented gradient features

Published:27 May 2014Publication History

ABSTRACT

Recognizing sign language is a very challenging task in computer vision. One of the more popular approaches, Dynamic Time Warping (DTW), utilizes hand trajectory information to compare a query sign with those in a database of examples. In this work, we conducted an American Sign Language (ASL) recognition experiment on Kinect sign data using DTW for sign trajectory similarity and Histogram of Oriented Gradient (HoG) [5] for hand shape representation. Our results show an improvement over the original work of [14], achieving an 82% accuracy in ranking signs in the 10 matches. In addition to our method that improves sign recognition accuracy, we propose a simple RGB-D alignment tool that can help roughly approximate alignment parameters between the color (RGB) and depth frames.

References

  1. J. Alon, V. Athitsos, Q. Yuan, and S. Sclaroff. A unified framework for gesture recognition and spatiotemporal gesture segmentation. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 31(9):1685--1699, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. F.-S. Chen, C.-M. Fu, and C.-L. Huang. Hand gesture recognition using a real-time tracking method and hidden markov models. Image and Vision Computing, 21(8):745--758, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  3. C. Conly, P. Doliotis, P. Jangyodsuk, R. Alonzo, and V. Athitsos. Toward a 3d body part detection video dataset and hand tracking benchmark. In Proceedings of the 6th International Conference on PErvasive Technologies Related to Assistive Environments, PETRA '13, pages 2:1--2:6, New York, NY, USA, 2013. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Corradini. Dynamic time warping for off-line recognition of a small gesture vocabulary. In Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems, 2001. Proceedings. IEEE ICCV Workshop on, pages 82--89. IEEE, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, volume 1, pages 886--893. IEEE, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. R. Eddy. Hidden markov models. Current opinion in structural biology, 6(3):361--365, 1996.Google ScholarGoogle Scholar
  7. A. Kar. Skeletal tracking using microsoft kinect. Methodology, 1: 1--11, 2010.Google ScholarGoogle Scholar
  8. L. Karlinsky, M. Dinerstein, D. Harari, and S. Ullman. The chains model for detecting parts by their context. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 25--32, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  9. J. Lafferty, A. McCallum, and F. C. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. 2001.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Mittal, A. Zisserman, and P. H. S. Torr. Hand detection using multiple proposals. In British Machine Vision Conference, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  11. T. Starner and A. Pentland. Real-time american sign language recognition from video using hidden markov models. In Motion-Based Recognition, pages 227--243. Springer, 1997.Google ScholarGoogle ScholarCross RefCross Ref
  12. G. Ten Holt, M. Reinders, and E. Hendriks. Multi-dimensional dynamic time warping for gesture recognition. In Thirteenth annual conference of the Advanced School for Computing and Imaging, volume 300, 2007.Google ScholarGoogle Scholar
  13. e. a. Vondrick Carl. Hoggles: Visualizing object detection features, Jan. 2013.Google ScholarGoogle Scholar
  14. H. Wang, R. Stefan, S. Moradi, V. Athitsos, C. Neidle, and F. Kamangar. A system for large vocabulary sign search.Google ScholarGoogle Scholar
  15. S. B. Wang, A. Quattoni, L.-P. Morency, D. Demirdjian, and T. Darrell. Hidden conditional random fields for gesture recognition. In Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, volume 2, pages 1521--1527. IEEE, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. D. Wilson and A. F. Bobick. Parametric hidden markov models for gesture recognition. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 21(9):884--900, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. Yang and S. Sarkar. Gesture recognition using hidden markov models from fragmented observations. In Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, volume 1, pages 766--773. IEEE, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. V. M. Z. Zhang, W. H. Liu and V. Athitsos. A viewpoint-independent statistical method for fall detection. In International Conference on Pattern Recognition, pages 3626--3630, Nov 2012.Google ScholarGoogle Scholar
  19. Z. Zhang, R. Alonzo, and V. Athitsos. Experiments with computer vision methods for hand detection. In Proceedings of the 4th International Conference on PErvasive Technologies Related to Assistive Environments, PETRA '11, pages 21:1--21:6, New York, NY, USA, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Z. Zhang, R. Alonzo, and V. Athitsos. Experiments with computer vision methods for hand detection. In Proceedings of the 4th International Conference on PErvasive Technologies Related to Assistive Environments, pages 21:1--21:6, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Sign language recognition using dynamic time warping and hand shape distance based on histogram of oriented gradient features

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      PETRA '14: Proceedings of the 7th International Conference on PErvasive Technologies Related to Assistive Environments
      May 2014
      408 pages
      ISBN:9781450327466
      DOI:10.1145/2674396

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 May 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader