Skip to main content

2018 | OriginalPaper | Buchkapitel

A New Large Scale Dynamic Texture Dataset with Application to ConvNet Understanding

verfasst von : Isma Hadji, Richard P. Wildes

Erschienen in: Computer Vision – ECCV 2018

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We introduce a new large scale dynamic texture dataset. With over 10,000 videos, our Dynamic Texture DataBase (DTDB) is two orders of magnitude larger than any previously available dynamic texture dataset. DTDB comes with two complementary organizations, one based on dynamics independent of spatial appearance and one based on spatial appearance independent of dynamics. The complementary organizations allow for uniquely insightful experiments regarding the abilities of major classes of spatiotemporal ConvNet architectures to exploit appearance vs. dynamic information. We also present a new two-stream ConvNet that provides an alternative to the standard optical-flow-based motion stream to broaden the range of dynamic patterns that can be encompassed. The resulting motion stream is shown to outperform the traditional optical flow stream by considerable margins. Finally, the utility of DTDB as a pretraining substrate is demonstrated via transfer learning on a different dynamic texture dataset as well as the companion task of dynamic scene recognition resulting in a new state-of-the-art.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
3.
Zurück zum Zitat Carreira, J., Zisserman, A.: Quo vadis, action recognition? A new model and the kinetics dataset. In: CVPR (2017) Carreira, J., Zisserman, A.: Quo vadis, action recognition? A new model and the kinetics dataset. In: CVPR (2017)
4.
Zurück zum Zitat Chetverikov, D., Peteri, R.: A brief survey of dynamic texture description and recognition. In: CORES (2005) Chetverikov, D., Peteri, R.: A brief survey of dynamic texture description and recognition. In: CORES (2005)
5.
Zurück zum Zitat Cimpoi, M., Maji, S., Kokkinos, I., Mohamed, S., Vedaldi, A.: Describing textures in the wild. In: CVPR (2014) Cimpoi, M., Maji, S., Kokkinos, I., Mohamed, S., Vedaldi, A.: Describing textures in the wild. In: CVPR (2014)
6.
Zurück zum Zitat Cimpoi, M., Maji, S., Vedaldi, A.: Deep filter banks for texture recognition and segmentation. In: CVPR (2015) Cimpoi, M., Maji, S., Vedaldi, A.: Deep filter banks for texture recognition and segmentation. In: CVPR (2015)
7.
Zurück zum Zitat Dai, D., Riemenschneider, H., Gool, L.: The synthesizability of texture examples. In: CVPR (2014) Dai, D., Riemenschneider, H., Gool, L.: The synthesizability of texture examples. In: CVPR (2014)
8.
Zurück zum Zitat Derpanis, K., Wildes, R.P.: Spacetime texture representation and recognition based on spatiotemporal orientation analysis. PAMI 34, 1193–1205 (2012)CrossRef Derpanis, K., Wildes, R.P.: Spacetime texture representation and recognition based on spatiotemporal orientation analysis. PAMI 34, 1193–1205 (2012)CrossRef
9.
Zurück zum Zitat Derpanis, K.G., Wildes, R.P.: Dynamic texture recognition based on distributions of spacetime oriented structure. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 191–198, June 2010 Derpanis, K.G., Wildes, R.P.: Dynamic texture recognition based on distributions of spacetime oriented structure. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 191–198, June 2010
10.
Zurück zum Zitat Doretto, G., Chiuso, A., Wu, Y., Soatto, S.: Dynamic textures. IJCV 51, 91–109 (2003)CrossRef Doretto, G., Chiuso, A., Wu, Y., Soatto, S.: Dynamic textures. IJCV 51, 91–109 (2003)CrossRef
11.
Zurück zum Zitat Dubois, S., Peteri, R., Michel, M.: Characterization and recognition of dynamic textures based on the 2D+T curvelet. Sig. Im. Vid. Proc. 9, 819–830 (2013) Dubois, S., Peteri, R., Michel, M.: Characterization and recognition of dynamic textures based on the 2D+T curvelet. Sig. Im. Vid. Proc. 9, 819–830 (2013)
12.
Zurück zum Zitat Feichtenhofer, C., Pinz, A., Wildes., R.P.: Spatiotemporal residual networks for video action recognition. In: NIPS (2016) Feichtenhofer, C., Pinz, A., Wildes., R.P.: Spatiotemporal residual networks for video action recognition. In: NIPS (2016)
13.
Zurück zum Zitat Feichtenhofer, C., Pinz, A., Wildes., R.P.: Temporal residual networks for dynamic scene recognition. In: CVPR (2017) Feichtenhofer, C., Pinz, A., Wildes., R.P.: Temporal residual networks for dynamic scene recognition. In: CVPR (2017)
15.
Zurück zum Zitat Hadji, I., Wildes, R.P.: A spatiotemporal oriented energy network for dynamic texture recognition. In: ICCV (2017) Hadji, I., Wildes, R.P.: A spatiotemporal oriented energy network for dynamic texture recognition. In: ICCV (2017)
16.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun., J.: Deep residual learning for image recognition. In: CVPR (2016) He, K., Zhang, X., Ren, S., Sun., J.: Deep residual learning for image recognition. In: CVPR (2016)
17.
Zurück zum Zitat Ji, S., Xu, W., Yang, M., Yu, K.: 3D convolutional neural networks for human action recognition. PAMI 35, 1915–1929 (2013)CrossRef Ji, S., Xu, W., Yang, M., Yu, K.: 3D convolutional neural networks for human action recognition. PAMI 35, 1915–1929 (2013)CrossRef
18.
Zurück zum Zitat Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., Fei-Fei, L.: Large-scale video classification with convolutional neural networks. In: CVPR (2014) Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., Fei-Fei, L.: Large-scale video classification with convolutional neural networks. In: CVPR (2014)
19.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS (2012)
20.
21.
Zurück zum Zitat Lin, T.Y., Maji, S.: Visualizing and understanding deep texture representations. In: CVPR (2016) Lin, T.Y., Maji, S.: Visualizing and understanding deep texture representations. In: CVPR (2016)
22.
Zurück zum Zitat Ng, J., Hausknecht, M., Vijayanarasimhan, S., Vinyals, O., Monga, R., Toderici., G.: Beyond short snippets: deep networks for video classification. In: CVPR (2015) Ng, J., Hausknecht, M., Vijayanarasimhan, S., Vinyals, O., Monga, R., Toderici., G.: Beyond short snippets: deep networks for video classification. In: CVPR (2015)
24.
Zurück zum Zitat Peteri, R., Sandor, F., Huiskes, M.: DynTex: a comprehensive database of dynamic textures. PRL 31, 1627–1632 (2010)CrossRef Peteri, R., Sandor, F., Huiskes, M.: DynTex: a comprehensive database of dynamic textures. PRL 31, 1627–1632 (2010)CrossRef
26.
Zurück zum Zitat Quan, Y., Bao, C., Ji, H.: Equiangular kernel dicitionary learning with applications to dynamic textures analysis. In: CVPR (2016) Quan, Y., Bao, C., Ji, H.: Equiangular kernel dicitionary learning with applications to dynamic textures analysis. In: CVPR (2016)
27.
Zurück zum Zitat Quan, Y., Huang, Y., Ji, H.: Dynamic texture recognition via orthogonal tensor dictionary learning. In: ICCV (2015) Quan, Y., Huang, Y., Ji, H.: Dynamic texture recognition via orthogonal tensor dictionary learning. In: ICCV (2015)
28.
Zurück zum Zitat Ravichandran, A., Chaudhry, R., R. Vidal, R.: View-invariant dynamic texture recognition using a bag of dynamical systems. In: CVPR (2009) Ravichandran, A., Chaudhry, R., R. Vidal, R.: View-invariant dynamic texture recognition using a bag of dynamical systems. In: CVPR (2009)
29.
30.
Zurück zum Zitat Saisan, P., Doretto, G., Wu, Y., Soatto, S.: Dynamic texture recognition. In: CVPR (2001) Saisan, P., Doretto, G., Wu, Y., Soatto, S.: Dynamic texture recognition. In: CVPR (2001)
31.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Two-stream convolutional networks for action recognition in videos. In: NIPS (2014) Simonyan, K., Zisserman, A.: Two-stream convolutional networks for action recognition in videos. In: NIPS (2014)
32.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: ICLR (2015) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: ICLR (2015)
33.
Zurück zum Zitat Soomro, K., Zamir, A.R., Shah, M.: UCF101: A dataset of 101 human actions classes from videos in the wild. Technical report. CRCV-TR-12-01, University of Central Florida (2012) Soomro, K., Zamir, A.R., Shah, M.: UCF101: A dataset of 101 human actions classes from videos in the wild. Technical report. CRCV-TR-12-01, University of Central Florida (2012)
34.
Zurück zum Zitat Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3D convolutional networks. In: ICCV (2015) Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3D convolutional networks. In: ICCV (2015)
35.
Zurück zum Zitat Varma, M., Zisserman, A.: Texture classification: are filter banks necessary? In: CVPR (2003) Varma, M., Zisserman, A.: Texture classification: are filter banks necessary? In: CVPR (2003)
36.
Zurück zum Zitat Varma, M., Zisserman, A.: A statistical approach to texture classification from single images. IJCV 62, 61–81 (2005)CrossRef Varma, M., Zisserman, A.: A statistical approach to texture classification from single images. IJCV 62, 61–81 (2005)CrossRef
38.
Zurück zum Zitat Yang, F., Xia, G., Liu, G., Zhang, L., Huang, X.: Dynamic texture recognition by aggregating spatial and temporal features via SVMs. Neurocomp. 173, 1310–1321 (2016)CrossRef Yang, F., Xia, G., Liu, G., Zhang, L., Huang, X.: Dynamic texture recognition by aggregating spatial and temporal features via SVMs. Neurocomp. 173, 1310–1321 (2016)CrossRef
41.
Zurück zum Zitat Zhou, B., Lapedriza, A., Xiao, J., Torralba, A., Oliva, A.: Learning deep features for scene recognition using places database. In: NIPS (2014) Zhou, B., Lapedriza, A., Xiao, J., Torralba, A., Oliva, A.: Learning deep features for scene recognition using places database. In: NIPS (2014)
Metadaten
Titel
A New Large Scale Dynamic Texture Dataset with Application to ConvNet Understanding
verfasst von
Isma Hadji
Richard P. Wildes
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-01264-9_20

Premium Partner