nach oben

Erschienen in:

2018 | OriginalPaper | Buchkapitel

Face Detection for Crowd Analysis Using Deep Convolutional Neural Networks

verfasst von : Bryan Kneis

Erschienen in: Engineering Applications of Neural Networks

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Crowd analysis is a challenging topic within computer vision, current state of the art methods for face detection in crowds suffer from poor results due to visual occlusions, scene semantics and overlapping subjects. In this work, we propose a novel approach of utilizing existing semantic segmentation methods to detect and segment faces in obscured images. We use an implementation of Mask RCNN trained on the popular Labelled Faces in the Wild (LFW) database to compare performance with Viola Jones, histogram of orientated gradients and max-margin object detection using a synthetically generated occluded subset of LFW. Results show that when images contain fair sized occlusions, Mask RCNN outperforms the current state of the art method. State of the art performance was achieved on this dataset and context specific improvements are suggested for further work. The contribution of this paper is not to regurgitate the finding from the original paper on Mask RCNN but provide results on the efficiency of using the method in the context of face detection for crowd analysis. Additionally, exploration of suitable hyper parameters for this context has been performed and described. Code has been made publicly available.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel RR-FCN: Rotational Region-Based Fully Convolutional Networks for Object Detection

Nächstes Kapitel Smoothing Regularized Extreme Learning Machine

Leonardi, F., Marcii, D.: An uncertainty model for people counters based on video sensors. In: Advanced Methods for Uncertainty Estimation in Measurement, Italy, pp. 62–66 (2008)

Liu, C., Shum, H.Y., Freeman, W.: Face hallucination: theory and practice. Int. J. Comput. Vision 75(1), 115–134 (2007)CrossRef

Eshed, O.B., Trivedi, M.: To boost or not to boost? On the limits of boosted trees for object detection. In: 23rd International Conference on Pattern Recognition (ICPR), pp. 3350–3355, Mexico (2016)

Li, H., Lin, Z., Shen, X., Brandt, J., Hua, G.: A convolutional neural network cascade for face detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, pp. 5325–5334 (2015)

Vaillant, R., Monrocq, C., Le Cun, Y.: Original approach for the localisation of objects in images. In: IEE Proceedings-Vision, Image and Signal Processing, pp. 245–250 (1994)CrossRef

Girshick, R.: Fast R-CNN. arXiv preprint (2015). arXiv:1504.08083

Everingham, M., Van Gool, L., Williams, C., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results. http://host.robots.ox.ac.uk/pascal/VOC/voc2012

He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8691, pp. 346–361. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10578-9_23CrossRef

Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)

10.

He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: IEEE International Conference on Computer Vision (ICCV), Venice, pp. 2980–2988 (2017)

11.

Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Conference on Computer Vision and Pattern Recognition (CVPR). vol. 1, no. 2, pp. 4–13, Hawaii (2017)

12.

Gary, B., Marwan, M., Honglak, L.: Erik Learned-Miller: Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments. http://vis-www.cs.umass.edu/lfw/#reference

13.

Amos, B., Bartosz, L., Satyanarayanan, M.: OpenFace: A general-purpose face recognition library with mobile applications. https://cmusatyalab.github.io/openface

14.

Vidit, J., Erik, LM.: FDDB: A Benchmark for Face Detection in Unconstrained Settings. http://vis-www.cs.umass.edu/fddb

15.

Kinjal, J., Safvan, V.: Crowd behavior analysis. Int. J. Sci. Res. (IJSR), 3(12) (2014)

16.

Garnier, S., Gautrais, J., Theraulaz, G.: The biological principles of swarm intelligence. Swarm Intell. 1(1), 3–31 (2007)CrossRef

17.

Junior, J.C.S.J., Musse, S.R., Jung, C.R.: Crowd analysis using computer vision techniques. IEEE Signal Process. Mag. 27(5), 66–77 (2010)

18.

Moussaïd, M., Perozo, N., Garnier, S., Helbing, D., Theraulaz, G.: The walking behaviour of pedestrian social groups and its impact on crowd dynamics. PloS one. 5(4) (2010)CrossRef

19.

Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Ohio, pp. 580–587 (2014)

20.

Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., LeCun, Y.: Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint (2013). arXiv:1312.6229

21.

Wang, X., Han, T.X., Yan, S.: An HOG-LBP human detector with partial occlusion handling. In: IEEE 12th International Conference on Computer Vision, pp. 32–29, Kyoto (2009)

22.

Maji, S., Malik, J.: Object detection using a max-margin hough transform. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1038–1045, Miami (2009)

23.

Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779–788, Las Vegas (2016)

24.

Zhang, W., Zelinsky, G., Samaras, D.: Real-time accurate object detection using multiple resolutions. In: 11th International Conference on Computer Vision (ICCV), pp. 1–8 (2007)

25.

Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Computer Vision and Pattern Recognition (CVPR), vol. 1, pp. 886–893 (2005)

26.

He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778, Las Vegas (2016)

27.

King, D.E.: Max-margin object detection. arXiv preprint (2015) arXiv:1502.00046

28.

Barr, J.R., Bowyer, K.W., Flynn, P.J.: The effectiveness of face detection algorithms in unconstrained crowd scenes. In: IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1020–1027, Hayden (2014)

29.

King D.: Easily Create High Quality Object Detectors with Deep Learning. http://blog.dlib.net/2016/10/easily-create-high-quality-object.html

30.

Waleed, K.: Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow. https://github.com/matterport/Mask_RCNN

Titel: Face Detection for Crowd Analysis Using Deep Convolutional Neural Networks
verfasst von: Bryan Kneis
Verlag: Springer International Publishing
Buch: Engineering Applications of Neural Networks
Print ISBN: 978-3-319-98203-8

Electronic ISBN: 978-3-319-98204-5

Copyright-Jahr: 2018
DOI: https://doi.org/10.1007/978-3-319-98204-5_6

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner