Skip to main content
Top

2015 | OriginalPaper | Chapter

Compressed-Domain Based Camera Motion Estimation for Realtime Action Recognition

Authors : Huafeng Chen, Jun Chen, Hongyang Li, Zengmin Xu, Ruimin Hu

Published in: Advances in Multimedia Information Processing -- PCM 2015

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Camera motions seriously affect the accuracy of action recognition. Traditional methods address this issue through estimating and compensating camera motions based on optical flow in pixel-domain. But the high computational complexity of optical flow hinders these methods from applying to realtime scenarios. In this paper, we advance an efficient camera motion estimation and compensation method for realtime action recognition by exploiting motion vectors in video compressed-domain (a.k.a. compressed-domain global motion estimation, CGME). Taking advantage of geometric symmetry and differential theory of motion vectors, we estimate the parameters of camera affine transformation. These parameters are then used to compensate the initial motion vectors to retain crucial object motions. Finally, we extract video features for action recognition based on compensated motion vectors. Experimental results show that our method improves the speed of camera motion estimation by over 100 times with a minor reduction of about \(4\,\%\) in recognition accuracy compared with iDT.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Wang, H., Schmid, C.: Action recognition with improved trajectories. In: IEEE International Conference on Computer Vision (ICCV) (2013) Wang, H., Schmid, C.: Action recognition with improved trajectories. In: IEEE International Conference on Computer Vision (ICCV) (2013)
2.
go back to reference Wu, S., Oreifej, O., Shah, M.: Action recognition in videos acquired by a moving camera using motion decomposition of lagrangian particle trajectories. In: IEEE International Conference on Computer Vision (ICCV) (2011) Wu, S., Oreifej, O., Shah, M.: Action recognition in videos acquired by a moving camera using motion decomposition of lagrangian particle trajectories. In: IEEE International Conference on Computer Vision (ICCV) (2011)
3.
go back to reference Park, D., Zitnick, C.L., Ramanan, D., Dollr, P.: Exploring weak stabilization for motion feature extraction. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013) Park, D., Zitnick, C.L., Ramanan, D., Dollr, P.: Exploring weak stabilization for motion feature extraction. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013)
4.
go back to reference Jain, M., Jgou, H., Bouthemy, P.: Better exploiting motion for better action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013) Jain, M., Jgou, H., Bouthemy, P.: Better exploiting motion for better action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013)
5.
go back to reference Kantorov, V., Laptev, I.: Efficient feature extraction, encoding, and classification for action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2014) Kantorov, V., Laptev, I.: Efficient feature extraction, encoding, and classification for action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2014)
6.
go back to reference Aggarwal, J.K., Ryoo, M.S.: Human activity analysis: A review. ACM Computing Surveys (CSUR) (2011) Aggarwal, J.K., Ryoo, M.S.: Human activity analysis: A review. ACM Computing Surveys (CSUR) (2011)
7.
go back to reference Wang, H., Klaser, A., Schmid, C., Liu, C.-L.: Action recognition by dense trajectories. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2011) Wang, H., Klaser, A., Schmid, C., Liu, C.-L.: Action recognition by dense trajectories. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2011)
8.
go back to reference Wang, H., Klaser, A., Schmid, C., Liu, C.-L.: Dense trajectories and motion boundary descriptors for action recognition. International Journal of Computer Vision (IJCV) (2013) Wang, H., Klaser, A., Schmid, C., Liu, C.-L.: Dense trajectories and motion boundary descriptors for action recognition. International Journal of Computer Vision (IJCV) (2013)
9.
go back to reference Laptev, I.: On space-time interest points. International Journal of Computer Vision (IJCV) (2005) Laptev, I.: On space-time interest points. International Journal of Computer Vision (IJCV) (2005)
10.
go back to reference Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. In: ACM International Conference on Multimedia (ACM MM) (2007) Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. In: ACM International Conference on Multimedia (ACM MM) (2007)
11.
go back to reference Willems, G., Tuytelaars, T., Van Gool, L.: An efficient dense and scale-invariant spatio-temporal interest point detector. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 650–663. Springer, Heidelberg (2008) CrossRef Willems, G., Tuytelaars, T., Van Gool, L.: An efficient dense and scale-invariant spatio-temporal interest point detector. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 650–663. Springer, Heidelberg (2008) CrossRef
12.
go back to reference Klaser, A., Marszalek, M.: A spatio-temporal descriptor based on 3d-gradients. In: British Machine Vision Conference (BMVC) (2008) Klaser, A., Marszalek, M.: A spatio-temporal descriptor based on 3d-gradients. In: British Machine Vision Conference (BMVC) (2008)
13.
go back to reference Yeffet, L., Wolf, L.: Local trinary patterns for human action recognition. In: IEEE International Conference on Computer Vision (ICCV) (2009) Yeffet, L., Wolf, L.: Local trinary patterns for human action recognition. In: IEEE International Conference on Computer Vision (ICCV) (2009)
14.
go back to reference Chen, M., Hauptmann, A.: Mosift: Recognizing human actions in surveillance videos (2009) Chen, M., Hauptmann, A.: Mosift: Recognizing human actions in surveillance videos (2009)
15.
go back to reference Zheng, Y., Tian, X., Chen, Y.: Fast global motion estimation based on symmetry elimination and difference of motion vectors. Journal of Electronics & Information Technology (2009) Zheng, Y., Tian, X., Chen, Y.: Fast global motion estimation based on symmetry elimination and difference of motion vectors. Journal of Electronics & Information Technology (2009)
16.
go back to reference Reddy, K., Shah, M.: Recognizing 50 human action categories of web videos. In: Machine Vision and Applications (MVA) (2012) Reddy, K., Shah, M.: Recognizing 50 human action categories of web videos. In: Machine Vision and Applications (MVA) (2012)
17.
go back to reference Kuehne, H., Jhuang, H., Garrote, E., Poggio, T., Serre, T.: Hmdb: A large video database for human motion recognition. In: IEEE International Conference on Computer Vision (ICCV) (2011) Kuehne, H., Jhuang, H., Garrote, E., Poggio, T., Serre, T.: Hmdb: A large video database for human motion recognition. In: IEEE International Conference on Computer Vision (ICCV) (2011)
18.
go back to reference Chen, H., Liang, C., Peng, Y., Chang, H.: Integration of digital stabilizer with video codec for digital video cameras. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) (2007) Chen, H., Liang, C., Peng, Y., Chang, H.: Integration of digital stabilizer with video codec for digital video cameras. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) (2007)
Metadata
Title
Compressed-Domain Based Camera Motion Estimation for Realtime Action Recognition
Authors
Huafeng Chen
Jun Chen
Hongyang Li
Zengmin Xu
Ruimin Hu
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-24075-6_9