nach oben

Erschienen in:

2021 | OriginalPaper | Buchkapitel

Pickpocketing Recognition in Still Images

verfasst von : Prisa Damrongsiri, Hossein Malekmohamadi

Erschienen in: Pattern Recognition. ICPR International Workshops and Challenges

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Human activity recognition (HAR) is a challenging topic in the computer vision field. Pickpocketing is a type of human criminal actions. It needs extensive research and development for detection. This paper researches how it’s possible of pickpocketing recognition in still images. This paper takes consideration both of classification and detection. We develop our models from state-of-art pre-trained models: VGG16, ResNet50, ResNet101, and ResNet152. Moreover, we also include a convolutional block attention module (CBAM [27]) in the model. The attention mechanism enhances model performances by focusing on informative features. For classification, the highest accuracy (89%) is ResNet152 with CBAM [27] (ResNet152+CBAM). We also examine pickpocketing detection on RetinaNet [14] and YOLOv.3 [34]. The mean average precision (mAP) of pickpocketing detection is consistent with Redmon et al. [34]. RetinaNet’s precision (80 mAP) is higher than YOLOv.3 (78 mAP), but YOLOv.3 is much faster detection. ResNet152+CBAM model detection on RetinaNet approach provides the highest mAP. However, it is much slower detection than YOLOv.3 (only 10 ms). This paper proves that It is possible to implement pickpocketing on still images in a reliable time and with outstanding accuracy. This proposed model possibly apply to the other HAR tasks.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Social Modeling Meets Virtual Reality: An Immersive Implication

Nächstes Kapitel t-EVA: Time-Efficient t-SNE Video Annotation

Bastidas, A., Tang, H.: Channel attention networks (2019)

Koshti, D., Kamoji, S., Kalnad, N., Sreekumar, S., Bhujbal, S.: Video anomaly detection using inflated 3D convolution network. In: 2020 International Conference on Inventive Computation Technologies (ICICT), Coimbatore, India, pp. 729–733 (2020). https://doi.org/10.1109/ICICT48043.2020.9112552

Elgendy, M.: Deep Learning For Vision Systems, 1st edn. (n.d.)

Guillermo, A., José, R., José, C.: Suspicious behavior detection on shoplifting cases for crime prevention by using 3D convolutional neural networks, pp. 1–7 (2020). https://arxiv.org/abs/2005.02142. Accessed 27 July 2020

Guo, G., Lai, A.: A survey on still image based human action recognition. Pattern Recogn.47(10), 3343–3361 (2014)

He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition (2015)

Henrys, S.: Pickpocketing Statistics (2019). Safes International. https://www.safesinternational.com/items/Safes-International-News/pickpocketing-statistics

Hidalgo, G., et al.: OpenPose: whole-body pose estimation (2019)

Hochreiter, S.: The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int. J. Uncertain. Fuzziness Knowl. Based Syst. 6(2), 107–116 (1998)MathSciNetCrossRef

10.

Hu, J., Shen, L., Albanie, S., Sun, G., Wu, E.: Squeeze-and-excitation networks (2019)

11.

Image-net.org: 2010 ImageNet Large Scale Visual Recognition Competition, ILSVRC 2010 (2010). http://www.image-net.org/challenges/LSVRC/2010

12.

Khandelwal, S., Sigal, L.: AttentionRNN: a structured spatial attention mechanism (2019)

13.

Lin, T., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection (2016)

14.

Lin, T., Goyal, P., Girshick, R., He, K., Dollar, P.: Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. 42(2), 318–327 (2020)CrossRef

15.

Liu, W., et al.: SSD: single shot MultiBox detector (2015)

16.

Marcelino, P.: Transfer learning from pre-trained models (2020). https://towardsdatascience.com/transfer-learning-from-pre-trained-models-f2393f124751. Accessed 2 Jul 2020

17.

Mohammadi, S., Ghofrani Majelan, S., Shokouhi, S.B.: Ensembles of deep neural networks for action recognition in still images, pp. 1–4 (2020)

18.

Park, J., Woo, S., Lee, J.-Y., Kweon, I.S.: BAM: bottleneck attention module (2018)

19.

Peng, X., Wang, L., Wang, X., Qiao, Y.: Bag of visual words and fusion methods for action recognition: comprehensive study and good practice. Comput. Vis. Image Underst. 150, 109–125 (2016)

20.

Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection (2015)

21.

Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014)

22.

Soomro, K., Zamir, A., Shah, M.: UCF101: a dataset of 101 human actions classes from videos in the wild (2012)

23.

Sreela, S., Idicula, S.: Action recognition in still images using residual neural network features. Proc. Comput. Sci. 143, 563–569 (2018)

24.

Sultani, W., Chen, C., Shah, M.: Real-world anomaly detection in surveillance videos (2018)

25.

The Verge: This Japanese AI security camera shows the future of surveillance will be automated (2018). https://www.theverge.com/2018/6/26/17479068/ai-guardman-security-camera-shoplifter-japan-automated-surveillance. Accessed 27 Jul 2020

26.

Wang, F., et al.: Residual attention network for image classification. In: CVPR (2015)

27.

Woo, S., Park, J., Lee, J., Kweon, I.: CBAM: convolutional block attention module (2018)

28.

Yan, S., Smith, J., Lu, W., Zhang, B.: Multibranch attention networks for action recognition in still images. IEEE Trans. Cogn. Dev. Syst. 10(4), 1116–1125 (2018)

29.

Sultani, W., Chen, C., Shah, M.: UCF-Crime (2018). https://www.crcv.ucf.edu/projects/real-world/

30.

Russakovsky, O., et al.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2015). https://doi.org/10.1007/s11263-015-0816-yMathSciNetCrossRef

31.

Majhi, S., Dash, R., Sa, P.K.: Two-stream CNN architecture for anomalous event detection in real world scenarios. In: Nain, N., Vipparthi, S.K., Raman, B. (eds.) CVIP 2019. CCIS, vol. 1148, pp. 343–353. Springer, Singapore (2020). https://doi.org/10.1007/978-981-15-4018-9_31CrossRef

32.

Lim, J.: Gun detection in surveillance videos using deep neural networks. In: 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 1998–2002. IEEE, Lanzhou (2019)

33.

Kanehisa, R., Neto, A.: Firearm detection using convolutional neural networks. In: Proceedings of the 11th International Conference on Agents and Artificial Intelligence, pp. 707–714. SCITEPRESS - Science and Technology Publications, Prague (2019)

34.

Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement, p. 6 (2018)

35.

Olmos, R., Tabik, S., Herrera, F.: Automatic handgun detection alarm in videos using deep learning. arXiv:1702.05147 (2017)

36.

tzutalin/labelImg (2018). https://github.com/tzutalin/labelImg

Titel: Pickpocketing Recognition in Still Images
verfasst von: Prisa Damrongsiri
Hossein Malekmohamadi
Verlag: Springer International Publishing
Buch: Pattern Recognition. ICPR International Workshops and Challenges
Print ISBN: 978-3-030-68798-4

Electronic ISBN: 978-3-030-68799-1

Copyright-Jahr: 2021
DOI: https://doi.org/10.1007/978-3-030-68799-1_11

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner