Skip to main content
Erschienen in: Machine Vision and Applications 5/2013

01.07.2013 | Original Paper

Fast spatiotemporal MACH filter for action recognition

verfasst von: Javed Ahmed, Sadaf Abbasi, M. Zakir Shaikh

Erschienen in: Machine Vision and Applications | Ausgabe 5/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Human action recognition has been an active field of research in computer vision community for the last decade. The spatiotemporal MACH (maximum average correlation height) filter approach has proved to be a very efficient method to solve the problem. It captures the intra-class variability and produces a very high response at the spatiotemporal location \((x,y,t)\) where the action is present in a video. Its computation cost is significantly lower than any other action recognition approach. However, faster algorithm is always needed to perform a computer vision task in real-time. Therefore, we propose a very efficient algorithm for normalized spatiotemporal MACH filtering for action recognition. It is based on the computations performed both in the frequency domain as well as the spatiotemporal domain exploiting integral video. We compare its speed with that of the relevant traditional algorithms and show that our approach drastically outperforms all of them.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Ahmed, J., Jafri, M.N., Shah, M., Akbar, M.: Real-time edge-enhanced dynamic correlation and predictive open-loop car-following control for robust tracking. Mach. Vision Apll. J. 19(1), 1–25 (2008)CrossRef Ahmed, J., Jafri, M.N., Shah, M., Akbar, M.: Real-time edge-enhanced dynamic correlation and predictive open-loop car-following control for robust tracking. Mach. Vision Apll. J. 19(1), 1–25 (2008)CrossRef
2.
Zurück zum Zitat Blank, M., Gorelick, L., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. In: International Conference on Computer Vision, pp. 1395–1402 (2005) Blank, M., Gorelick, L., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. In: International Conference on Computer Vision, pp. 1395–1402 (2005)
3.
Zurück zum Zitat Bobick, A., Davis, J.: The recognition of human movement using temporal templates. IEEE Trans. Pattern Anal. Mach. Intell. 23(3), 257–267 (2001)CrossRef Bobick, A., Davis, J.: The recognition of human movement using temporal templates. IEEE Trans. Pattern Anal. Mach. Intell. 23(3), 257–267 (2001)CrossRef
4.
Zurück zum Zitat Cohn, J.F., Zlochower, A.J., Lien, J., Kanade, T., Analysis, A.F.: Automated face analysis by feature point tracking has high concurrent validity with manual FACS coding. Psychophysiology 36, 35–43 (1999) Cohn, J.F., Zlochower, A.J., Lien, J., Kanade, T., Analysis, A.F.: Automated face analysis by feature point tracking has high concurrent validity with manual FACS coding. Psychophysiology 36, 35–43 (1999)
5.
Zurück zum Zitat Crow, F.: Summed-area tables for texture mapping. Comp Graph 18, 207–211 (1984)CrossRef Crow, F.: Summed-area tables for texture mapping. Comp Graph 18, 207–211 (1984)CrossRef
6.
Zurück zum Zitat Efros, A., Berg, A., Mori, G., Malik, J.: Recognizing action at a distance. IEEE Int Conf Comp Vision 2, 726–733 (2003)CrossRef Efros, A., Berg, A., Mori, G., Malik, J.: Recognizing action at a distance. IEEE Int Conf Comp Vision 2, 726–733 (2003)CrossRef
7.
Zurück zum Zitat Essa, I., Pentland, A.: Coding, analysis, interpretation, and recognition of facial expressions. IEEE Trans. Pattern Anal. Mach. Intell. 19(7), 757–763 (1997)CrossRef Essa, I., Pentland, A.: Coding, analysis, interpretation, and recognition of facial expressions. IEEE Trans. Pattern Anal. Mach. Intell. 19(7), 757–763 (1997)CrossRef
8.
Zurück zum Zitat Frigo, M., Johnson, S.: FFTW: an adaptive software architecture for the FFT. IEEE Int. Conf. Acoust. Speech Signal Process. 3, 1381–1384 (1998) Frigo, M., Johnson, S.: FFTW: an adaptive software architecture for the FFT. IEEE Int. Conf. Acoust. Speech Signal Process. 3, 1381–1384 (1998)
9.
Zurück zum Zitat Gonzalez, R.C., Woods, R.E.: Digital Image Processing, 3rd edn. Pearson Prentice Hall, Delhi (2008) Gonzalez, R.C., Woods, R.E.: Digital Image Processing, 3rd edn. Pearson Prentice Hall, Delhi (2008)
10.
Zurück zum Zitat Gorelick, L., Blank, M., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. Trans. Pattern Anal. Mach. Intell. 29(12), 2247–2253 (2007)CrossRef Gorelick, L., Blank, M., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. Trans. Pattern Anal. Mach. Intell. 29(12), 2247–2253 (2007)CrossRef
11.
Zurück zum Zitat Lewis, J.P.: Fast normalized cross-correlation. Industrial Light and Magic (1995) Lewis, J.P.: Fast normalized cross-correlation. Industrial Light and Magic (1995)
12.
Zurück zum Zitat Mahalanobis, A., Kumar, B.V.K.V., Song, S., Sims, S., Epperson, J.: Unconstrained correlation filters. Applied Optics 33(17), 3751–3759 (1994) Mahalanobis, A., Kumar, B.V.K.V., Song, S., Sims, S., Epperson, J.: Unconstrained correlation filters. Applied Optics 33(17), 3751–3759 (1994)
13.
Zurück zum Zitat Polana, R., Nelson, R.: Low level recognition of human motion (or how to get your man without finding his body parts). In: IEEE Workshop on Motion of Non-Rigid and Articulated Objects, pp. 77–82 (1994) Polana, R., Nelson, R.: Low level recognition of human motion (or how to get your man without finding his body parts). In: IEEE Workshop on Motion of Non-Rigid and Articulated Objects, pp. 77–82 (1994)
14.
Zurück zum Zitat Porikli, F.: Integral histogram: a fast way to extract histograms in cartesian spaces. In: IEEE Conference on Computer Vision and Pattern Recognition (1995) Porikli, F.: Integral histogram: a fast way to extract histograms in cartesian spaces. In: IEEE Conference on Computer Vision and Pattern Recognition (1995)
15.
Zurück zum Zitat Refregier, P.: Optimal trade-off filters for noise robustness, sharpness of the correlation peak, and horner efficiency. Optics Lett 16(11), 829–831 (1991)CrossRef Refregier, P.: Optimal trade-off filters for noise robustness, sharpness of the correlation peak, and horner efficiency. Optics Lett 16(11), 829–831 (1991)CrossRef
16.
Zurück zum Zitat Rodriguez, M., Ahmed, J., Shah, M.: Action MACH: A spatio-temporal maximum average correlation height filter for action recognition. In: IEEE Conference on Computer Vision and, Pattern Recognition, pp. 1–8 (2008) Rodriguez, M., Ahmed, J., Shah, M.: Action MACH: A spatio-temporal maximum average correlation height filter for action recognition. In: IEEE Conference on Computer Vision and, Pattern Recognition, pp. 1–8 (2008)
17.
Zurück zum Zitat Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local SVM approach. In: International Conference on, Pattern Recognition, pp. 32–36 (2004) Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local SVM approach. In: International Conference on, Pattern Recognition, pp. 32–36 (2004)
18.
Zurück zum Zitat Shechtman, E., Irani, M.: Space-time behavior based correlation. In: IEEE Conference on Computer Vision and Pattern Recognition (2005) Shechtman, E., Irani, M.: Space-time behavior based correlation. In: IEEE Conference on Computer Vision and Pattern Recognition (2005)
19.
Zurück zum Zitat Tapia, E.: A note on the computation of high dimensional integral images. Pattern Recognit Lett 32, 197–201 (2011)CrossRef Tapia, E.: A note on the computation of high dimensional integral images. Pattern Recognit Lett 32, 197–201 (2011)CrossRef
20.
Zurück zum Zitat li Tian, Y., Kanade, T., Cohn, T.F.: Recognizing action units for facial expression analysis. IEEE Trans. Pattern Anal. Mach. Intell. 23, 97–115 (1999) li Tian, Y., Kanade, T., Cohn, T.F.: Recognizing action units for facial expression analysis. IEEE Trans. Pattern Anal. Mach. Intell. 23, 97–115 (1999)
21.
Zurück zum Zitat Viola, P., Jones, M.: Robust real-time object detection. Int. J. Comput. Vision 57(2), 137–154 (2004)CrossRef Viola, P., Jones, M.: Robust real-time object detection. Int. J. Comput. Vision 57(2), 137–154 (2004)CrossRef
Metadaten
Titel
Fast spatiotemporal MACH filter for action recognition
verfasst von
Javed Ahmed
Sadaf Abbasi
M. Zakir Shaikh
Publikationsdatum
01.07.2013
Verlag
Springer-Verlag
Erschienen in
Machine Vision and Applications / Ausgabe 5/2013
Print ISSN: 0932-8092
Elektronische ISSN: 1432-1769
DOI
https://doi.org/10.1007/s00138-013-0484-2

Weitere Artikel der Ausgabe 5/2013

Machine Vision and Applications 5/2013 Zur Ausgabe