Skip to main content

2021 | OriginalPaper | Buchkapitel

Pickpocketing Recognition in Still Images

verfasst von : Prisa Damrongsiri, Hossein Malekmohamadi

Erschienen in: Pattern Recognition. ICPR International Workshops and Challenges

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Human activity recognition (HAR) is a challenging topic in the computer vision field. Pickpocketing is a type of human criminal actions. It needs extensive research and development for detection. This paper researches how it’s possible of pickpocketing recognition in still images. This paper takes consideration both of classification and detection. We develop our models from state-of-art pre-trained models: VGG16, ResNet50, ResNet101, and ResNet152. Moreover, we also include a convolutional block attention module (CBAM [27]) in the model. The attention mechanism enhances model performances by focusing on informative features. For classification, the highest accuracy (89%) is ResNet152 with CBAM [27] (ResNet152+CBAM). We also examine pickpocketing detection on RetinaNet [14] and YOLOv.3 [34]. The mean average precision (mAP) of pickpocketing detection is consistent with Redmon et al. [34]. RetinaNet’s precision (80 mAP) is higher than YOLOv.3 (78 mAP), but YOLOv.3 is much faster detection. ResNet152+CBAM model detection on RetinaNet approach provides the highest mAP. However, it is much slower detection than YOLOv.3 (only 10 ms). This paper proves that It is possible to implement pickpocketing on still images in a reliable time and with outstanding accuracy. This proposed model possibly apply to the other HAR tasks.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bastidas, A., Tang, H.: Channel attention networks (2019) Bastidas, A., Tang, H.: Channel attention networks (2019)
3.
Zurück zum Zitat Elgendy, M.: Deep Learning For Vision Systems, 1st edn. (n.d.) Elgendy, M.: Deep Learning For Vision Systems, 1st edn. (n.d.)
4.
5.
Zurück zum Zitat Guo, G., Lai, A.: A survey on still image based human action recognition. Pattern Recogn.47(10), 3343–3361 (2014) Guo, G., Lai, A.: A survey on still image based human action recognition. Pattern Recogn.47(10), 3343–3361 (2014)
6.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition (2015) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition (2015)
8.
Zurück zum Zitat Hidalgo, G., et al.: OpenPose: whole-body pose estimation (2019) Hidalgo, G., et al.: OpenPose: whole-body pose estimation (2019)
9.
Zurück zum Zitat Hochreiter, S.: The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int. J. Uncertain. Fuzziness Knowl. Based Syst. 6(2), 107–116 (1998)MathSciNetCrossRef Hochreiter, S.: The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int. J. Uncertain. Fuzziness Knowl. Based Syst. 6(2), 107–116 (1998)MathSciNetCrossRef
10.
Zurück zum Zitat Hu, J., Shen, L., Albanie, S., Sun, G., Wu, E.: Squeeze-and-excitation networks (2019) Hu, J., Shen, L., Albanie, S., Sun, G., Wu, E.: Squeeze-and-excitation networks (2019)
12.
Zurück zum Zitat Khandelwal, S., Sigal, L.: AttentionRNN: a structured spatial attention mechanism (2019) Khandelwal, S., Sigal, L.: AttentionRNN: a structured spatial attention mechanism (2019)
13.
Zurück zum Zitat Lin, T., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection (2016) Lin, T., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection (2016)
14.
Zurück zum Zitat Lin, T., Goyal, P., Girshick, R., He, K., Dollar, P.: Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. 42(2), 318–327 (2020)CrossRef Lin, T., Goyal, P., Girshick, R., He, K., Dollar, P.: Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. 42(2), 318–327 (2020)CrossRef
15.
Zurück zum Zitat Liu, W., et al.: SSD: single shot MultiBox detector (2015) Liu, W., et al.: SSD: single shot MultiBox detector (2015)
17.
Zurück zum Zitat Mohammadi, S., Ghofrani Majelan, S., Shokouhi, S.B.: Ensembles of deep neural networks for action recognition in still images, pp. 1–4 (2020) Mohammadi, S., Ghofrani Majelan, S., Shokouhi, S.B.: Ensembles of deep neural networks for action recognition in still images, pp. 1–4 (2020)
18.
Zurück zum Zitat Park, J., Woo, S., Lee, J.-Y., Kweon, I.S.: BAM: bottleneck attention module (2018) Park, J., Woo, S., Lee, J.-Y., Kweon, I.S.: BAM: bottleneck attention module (2018)
19.
Zurück zum Zitat Peng, X., Wang, L., Wang, X., Qiao, Y.: Bag of visual words and fusion methods for action recognition: comprehensive study and good practice. Comput. Vis. Image Underst. 150, 109–125 (2016) Peng, X., Wang, L., Wang, X., Qiao, Y.: Bag of visual words and fusion methods for action recognition: comprehensive study and good practice. Comput. Vis. Image Underst. 150, 109–125 (2016)
20.
Zurück zum Zitat Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection (2015) Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection (2015)
21.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014)
22.
Zurück zum Zitat Soomro, K., Zamir, A., Shah, M.: UCF101: a dataset of 101 human actions classes from videos in the wild (2012) Soomro, K., Zamir, A., Shah, M.: UCF101: a dataset of 101 human actions classes from videos in the wild (2012)
23.
Zurück zum Zitat Sreela, S., Idicula, S.: Action recognition in still images using residual neural network features. Proc. Comput. Sci. 143, 563–569 (2018) Sreela, S., Idicula, S.: Action recognition in still images using residual neural network features. Proc. Comput. Sci. 143, 563–569 (2018)
24.
Zurück zum Zitat Sultani, W., Chen, C., Shah, M.: Real-world anomaly detection in surveillance videos (2018) Sultani, W., Chen, C., Shah, M.: Real-world anomaly detection in surveillance videos (2018)
26.
Zurück zum Zitat Wang, F., et al.: Residual attention network for image classification. In: CVPR (2015) Wang, F., et al.: Residual attention network for image classification. In: CVPR (2015)
27.
Zurück zum Zitat Woo, S., Park, J., Lee, J., Kweon, I.: CBAM: convolutional block attention module (2018) Woo, S., Park, J., Lee, J., Kweon, I.: CBAM: convolutional block attention module (2018)
28.
Zurück zum Zitat Yan, S., Smith, J., Lu, W., Zhang, B.: Multibranch attention networks for action recognition in still images. IEEE Trans. Cogn. Dev. Syst. 10(4), 1116–1125 (2018) Yan, S., Smith, J., Lu, W., Zhang, B.: Multibranch attention networks for action recognition in still images. IEEE Trans. Cogn. Dev. Syst. 10(4), 1116–1125 (2018)
32.
Zurück zum Zitat Lim, J.: Gun detection in surveillance videos using deep neural networks. In: 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 1998–2002. IEEE, Lanzhou (2019) Lim, J.: Gun detection in surveillance videos using deep neural networks. In: 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 1998–2002. IEEE, Lanzhou (2019)
33.
Zurück zum Zitat Kanehisa, R., Neto, A.: Firearm detection using convolutional neural networks. In: Proceedings of the 11th International Conference on Agents and Artificial Intelligence, pp. 707–714. SCITEPRESS - Science and Technology Publications, Prague (2019) Kanehisa, R., Neto, A.: Firearm detection using convolutional neural networks. In: Proceedings of the 11th International Conference on Agents and Artificial Intelligence, pp. 707–714. SCITEPRESS - Science and Technology Publications, Prague (2019)
34.
Zurück zum Zitat Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement, p. 6 (2018) Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement, p. 6 (2018)
35.
Metadaten
Titel
Pickpocketing Recognition in Still Images
verfasst von
Prisa Damrongsiri
Hossein Malekmohamadi
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-68799-1_11

Premium Partner