Skip to main content

2015 | OriginalPaper | Buchkapitel

Compressed-Domain Based Camera Motion Estimation for Realtime Action Recognition

verfasst von : Huafeng Chen, Jun Chen, Hongyang Li, Zengmin Xu, Ruimin Hu

Erschienen in: Advances in Multimedia Information Processing -- PCM 2015

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Camera motions seriously affect the accuracy of action recognition. Traditional methods address this issue through estimating and compensating camera motions based on optical flow in pixel-domain. But the high computational complexity of optical flow hinders these methods from applying to realtime scenarios. In this paper, we advance an efficient camera motion estimation and compensation method for realtime action recognition by exploiting motion vectors in video compressed-domain (a.k.a. compressed-domain global motion estimation, CGME). Taking advantage of geometric symmetry and differential theory of motion vectors, we estimate the parameters of camera affine transformation. These parameters are then used to compensate the initial motion vectors to retain crucial object motions. Finally, we extract video features for action recognition based on compensated motion vectors. Experimental results show that our method improves the speed of camera motion estimation by over 100 times with a minor reduction of about \(4\,\%\) in recognition accuracy compared with iDT.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Wang, H., Schmid, C.: Action recognition with improved trajectories. In: IEEE International Conference on Computer Vision (ICCV) (2013) Wang, H., Schmid, C.: Action recognition with improved trajectories. In: IEEE International Conference on Computer Vision (ICCV) (2013)
2.
Zurück zum Zitat Wu, S., Oreifej, O., Shah, M.: Action recognition in videos acquired by a moving camera using motion decomposition of lagrangian particle trajectories. In: IEEE International Conference on Computer Vision (ICCV) (2011) Wu, S., Oreifej, O., Shah, M.: Action recognition in videos acquired by a moving camera using motion decomposition of lagrangian particle trajectories. In: IEEE International Conference on Computer Vision (ICCV) (2011)
3.
Zurück zum Zitat Park, D., Zitnick, C.L., Ramanan, D., Dollr, P.: Exploring weak stabilization for motion feature extraction. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013) Park, D., Zitnick, C.L., Ramanan, D., Dollr, P.: Exploring weak stabilization for motion feature extraction. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013)
4.
Zurück zum Zitat Jain, M., Jgou, H., Bouthemy, P.: Better exploiting motion for better action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013) Jain, M., Jgou, H., Bouthemy, P.: Better exploiting motion for better action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013)
5.
Zurück zum Zitat Kantorov, V., Laptev, I.: Efficient feature extraction, encoding, and classification for action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2014) Kantorov, V., Laptev, I.: Efficient feature extraction, encoding, and classification for action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2014)
6.
Zurück zum Zitat Aggarwal, J.K., Ryoo, M.S.: Human activity analysis: A review. ACM Computing Surveys (CSUR) (2011) Aggarwal, J.K., Ryoo, M.S.: Human activity analysis: A review. ACM Computing Surveys (CSUR) (2011)
7.
Zurück zum Zitat Wang, H., Klaser, A., Schmid, C., Liu, C.-L.: Action recognition by dense trajectories. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2011) Wang, H., Klaser, A., Schmid, C., Liu, C.-L.: Action recognition by dense trajectories. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2011)
8.
Zurück zum Zitat Wang, H., Klaser, A., Schmid, C., Liu, C.-L.: Dense trajectories and motion boundary descriptors for action recognition. International Journal of Computer Vision (IJCV) (2013) Wang, H., Klaser, A., Schmid, C., Liu, C.-L.: Dense trajectories and motion boundary descriptors for action recognition. International Journal of Computer Vision (IJCV) (2013)
9.
Zurück zum Zitat Laptev, I.: On space-time interest points. International Journal of Computer Vision (IJCV) (2005) Laptev, I.: On space-time interest points. International Journal of Computer Vision (IJCV) (2005)
10.
Zurück zum Zitat Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. In: ACM International Conference on Multimedia (ACM MM) (2007) Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. In: ACM International Conference on Multimedia (ACM MM) (2007)
11.
Zurück zum Zitat Willems, G., Tuytelaars, T., Van Gool, L.: An efficient dense and scale-invariant spatio-temporal interest point detector. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 650–663. Springer, Heidelberg (2008) CrossRef Willems, G., Tuytelaars, T., Van Gool, L.: An efficient dense and scale-invariant spatio-temporal interest point detector. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 650–663. Springer, Heidelberg (2008) CrossRef
12.
Zurück zum Zitat Klaser, A., Marszalek, M.: A spatio-temporal descriptor based on 3d-gradients. In: British Machine Vision Conference (BMVC) (2008) Klaser, A., Marszalek, M.: A spatio-temporal descriptor based on 3d-gradients. In: British Machine Vision Conference (BMVC) (2008)
13.
Zurück zum Zitat Yeffet, L., Wolf, L.: Local trinary patterns for human action recognition. In: IEEE International Conference on Computer Vision (ICCV) (2009) Yeffet, L., Wolf, L.: Local trinary patterns for human action recognition. In: IEEE International Conference on Computer Vision (ICCV) (2009)
14.
Zurück zum Zitat Chen, M., Hauptmann, A.: Mosift: Recognizing human actions in surveillance videos (2009) Chen, M., Hauptmann, A.: Mosift: Recognizing human actions in surveillance videos (2009)
15.
Zurück zum Zitat Zheng, Y., Tian, X., Chen, Y.: Fast global motion estimation based on symmetry elimination and difference of motion vectors. Journal of Electronics & Information Technology (2009) Zheng, Y., Tian, X., Chen, Y.: Fast global motion estimation based on symmetry elimination and difference of motion vectors. Journal of Electronics & Information Technology (2009)
16.
Zurück zum Zitat Reddy, K., Shah, M.: Recognizing 50 human action categories of web videos. In: Machine Vision and Applications (MVA) (2012) Reddy, K., Shah, M.: Recognizing 50 human action categories of web videos. In: Machine Vision and Applications (MVA) (2012)
17.
Zurück zum Zitat Kuehne, H., Jhuang, H., Garrote, E., Poggio, T., Serre, T.: Hmdb: A large video database for human motion recognition. In: IEEE International Conference on Computer Vision (ICCV) (2011) Kuehne, H., Jhuang, H., Garrote, E., Poggio, T., Serre, T.: Hmdb: A large video database for human motion recognition. In: IEEE International Conference on Computer Vision (ICCV) (2011)
18.
Zurück zum Zitat Chen, H., Liang, C., Peng, Y., Chang, H.: Integration of digital stabilizer with video codec for digital video cameras. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) (2007) Chen, H., Liang, C., Peng, Y., Chang, H.: Integration of digital stabilizer with video codec for digital video cameras. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) (2007)
Metadaten
Titel
Compressed-Domain Based Camera Motion Estimation for Realtime Action Recognition
verfasst von
Huafeng Chen
Jun Chen
Hongyang Li
Zengmin Xu
Ruimin Hu
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-24075-6_9

Neuer Inhalt