Skip to main content

2019 | OriginalPaper | Buchkapitel

Spatiotemporal CNNs for Pornography Detection in Videos

verfasst von : Murilo Varges da Silva, Aparecido Nilceu Marana

Erschienen in: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

With the increasing use of social networks and mobile devices, the number of videos posted on the Internet is growing exponentially. Among the inappropriate contents published on the Internet, pornography is one of the most worrying as it can be accessed by teens and children. Two spatiotemporal CNNs, VGG-C3D CNN and ResNet R\((2+1)\)D CNN, were assessed for pornography detection in videos in the present study. Experimental results using the Pornography-800 dataset showed that these spatiotemporal CNNs performed better than some state-of-the-art methods based on bag of visual words and are competitive with other CNN-based approaches, reaching accuracy of \(95.1\%\).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Avila, S., Thome, N., Cord, M., Valle, E., Araújo, A.D.A.: Bossa: extended bow formalism for image classification. In: 18th IEEE ICIP, pp. 2909–2912 (2011) Avila, S., Thome, N., Cord, M., Valle, E., Araújo, A.D.A.: Bossa: extended bow formalism for image classification. In: 18th IEEE ICIP, pp. 2909–2912 (2011)
2.
Zurück zum Zitat Avila, S., Thome, N., Cord, M., Valle, E., Araújo, A.D.A.: Pooling in image representation: the visual codeword point of view. Comput. Vis. Image Underst. 117(5), 453–465 (2013) Avila, S., Thome, N., Cord, M., Valle, E., Araújo, A.D.A.: Pooling in image representation: the visual codeword point of view. Comput. Vis. Image Underst. 117(5), 453–465 (2013)
3.
Zurück zum Zitat Caetano, C., Avila, S., Guimarães, S., Araújo, A.D.A.: Pornography detection using BossaNova video descriptor. In: 2014 22nd (EUSIPCO), pp. 1681–1685 (2014) Caetano, C., Avila, S., Guimarães, S., Araújo, A.D.A.: Pornography detection using BossaNova video descriptor. In: 2014 22nd (EUSIPCO), pp. 1681–1685 (2014)
4.
Zurück zum Zitat Caetano, C., Avila, S., Schwartz, W.R., Guimarães, S.J.F., Araújo, A.D.A.: A mid-level video representation based on binary descriptors: a case study for pornography detection. CoRR abs/1605.03804 (2016) Caetano, C., Avila, S., Schwartz, W.R., Guimarães, S.J.F., Araújo, A.D.A.: A mid-level video representation based on binary descriptors: a case study for pornography detection. CoRR abs/1605.03804 (2016)
5.
Zurück zum Zitat Carreira, J., Zisserman, A.: Quo vadis, action recognition? A new model and the kinetics dataset. CoRR abs/1705.07750 (2017) Carreira, J., Zisserman, A.: Quo vadis, action recognition? A new model and the kinetics dataset. CoRR abs/1705.07750 (2017)
7.
Zurück zum Zitat Dollar, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pp. 65–72 (2005) Dollar, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pp. 65–72 (2005)
8.
Zurück zum Zitat Ji, S., Xu, W., Yang, M., Yu, K.: 3D convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 221–231 (2013)CrossRef Ji, S., Xu, W., Yang, M., Yu, K.: 3D convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 221–231 (2013)CrossRef
9.
10.
Zurück zum Zitat Klaser, A., Marszalek, M., Schmid, C.: A spatio-temporal descriptor based on 3D-gradients. In: Everingham, M., Needham, C., Fraile, R. (eds.) BMVC 2008–19th British Machine Vision Conference, pp. 275:1–10. British Machine Vision Association, Leeds, United Kingdom (2008) Klaser, A., Marszalek, M., Schmid, C.: A spatio-temporal descriptor based on 3D-gradients. In: Everingham, M., Needham, C., Fraile, R. (eds.) BMVC 2008–19th British Machine Vision Conference, pp. 275:1–10. British Machine Vision Association, Leeds, United Kingdom (2008)
11.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1, NIPS 2012, pp. 1097–1105. Curran Associates Inc., USA (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1, NIPS 2012, pp. 1097–1105. Curran Associates Inc., USA (2012)
12.
Zurück zum Zitat Laptev, I., Lindeberg, T.: Space-time interest points. In: Proceedings Ninth IEEE International Conference on Computer Vision, vol. 1, pp. 432–439 (2003) Laptev, I., Lindeberg, T.: Space-time interest points. In: Proceedings Ninth IEEE International Conference on Computer Vision, vol. 1, pp. 432–439 (2003)
13.
Zurück zum Zitat van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008)MATH van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008)MATH
14.
Zurück zum Zitat Moreira, D., et al.: Pornography classification: the hidden clues invideo spacetime. Forensic Sci. Int. 268, 46–61 (2016)CrossRef Moreira, D., et al.: Pornography classification: the hidden clues invideo spacetime. Forensic Sci. Int. 268, 46–61 (2016)CrossRef
15.
Zurück zum Zitat Moustafa, M.: Applying deep learning to classify pornographic images and videos. CoRR abs/1511.08899 (2015) Moustafa, M.: Applying deep learning to classify pornographic images and videos. CoRR abs/1511.08899 (2015)
16.
Zurück zum Zitat Perez, M., et al.: Video pornography detection through deep learning techniques and motion information. Neurocomputing 230, 279–293 (2017)CrossRef Perez, M., et al.: Video pornography detection through deep learning techniques and motion information. Neurocomputing 230, 279–293 (2017)CrossRef
17.
Zurück zum Zitat Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. In: Proceedings of the 15th ACM International Conference on Multimedia, MM 2007, pp. 357–360. ACM, New York (2007) Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. In: Proceedings of the 15th ACM International Conference on Multimedia, MM 2007, pp. 357–360. ACM, New York (2007)
18.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Two-stream convolutional networks for action recognition in videos. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 27, pp. 568–576. Curran Associates, Inc. (2014) Simonyan, K., Zisserman, A.: Two-stream convolutional networks for action recognition in videos. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 27, pp. 568–576. Curran Associates, Inc. (2014)
19.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR abs/1409.1556 (2014) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR abs/1409.1556 (2014)
20.
Zurück zum Zitat de Souza, F.D.M., Valle, E., Cámara-Chávez, G., Araújo, A.: An evaluation on color invariant based local spatiotemporal features for action recognition. In: IEEE SIBGRAPI (2012) de Souza, F.D.M., Valle, E., Cámara-Chávez, G., Araújo, A.: An evaluation on color invariant based local spatiotemporal features for action recognition. In: IEEE SIBGRAPI (2012)
21.
Zurück zum Zitat Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3d convolutional networks. In: IEEE ICCV, pp. 4489–4497. Washington, DC, USA (2015) Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3d convolutional networks. In: IEEE ICCV, pp. 4489–4497. Washington, DC, USA (2015)
22.
Zurück zum Zitat Tran, D., Wang, H., Torresani, L., Ray, J., LeCun, Y., Paluri, M.: A closer look at spatiotemporal convolutions for action recognition. CoRR abs/1711.11248 (2017) Tran, D., Wang, H., Torresani, L., Ray, J., LeCun, Y., Paluri, M.: A closer look at spatiotemporal convolutions for action recognition. CoRR abs/1711.11248 (2017)
23.
Zurück zum Zitat Valle, E., de Avila, S., da Luz Jr., A., de Souza, F., Coelho, M., Araújo, A.: Content-based filtering for video sharing social networks. CoRR abs/1101.2427 (2011) Valle, E., de Avila, S., da Luz Jr., A., de Souza, F., Coelho, M., Araújo, A.: Content-based filtering for video sharing social networks. CoRR abs/1101.2427 (2011)
24.
Zurück zum Zitat Wang, H., Schmid, C.: Action recognition with improved trajectories. In: 2013 IEEE International Conference on Computer Vision, pp. 3551–3558 (2013) Wang, H., Schmid, C.: Action recognition with improved trajectories. In: 2013 IEEE International Conference on Computer Vision, pp. 3551–3558 (2013)
25.
Zurück zum Zitat Xie, S., Sun, C., Huang, J., Tu, Z., Murphy, K.: Rethinking spatiotemporal feature learning for video understanding. CoRR abs/1712.04851 (2017) Xie, S., Sun, C., Huang, J., Tu, Z., Murphy, K.: Rethinking spatiotemporal feature learning for video understanding. CoRR abs/1712.04851 (2017)
Metadaten
Titel
Spatiotemporal CNNs for Pornography Detection in Videos
verfasst von
Murilo Varges da Silva
Aparecido Nilceu Marana
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-13469-3_64