Skip to main content

2018 | OriginalPaper | Buchkapitel

Face Detection for Crowd Analysis Using Deep Convolutional Neural Networks

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Crowd analysis is a challenging topic within computer vision, current state of the art methods for face detection in crowds suffer from poor results due to visual occlusions, scene semantics and overlapping subjects. In this work, we propose a novel approach of utilizing existing semantic segmentation methods to detect and segment faces in obscured images. We use an implementation of Mask RCNN trained on the popular Labelled Faces in the Wild (LFW) database to compare performance with Viola Jones, histogram of orientated gradients and max-margin object detection using a synthetically generated occluded subset of LFW. Results show that when images contain fair sized occlusions, Mask RCNN outperforms the current state of the art method. State of the art performance was achieved on this dataset and context specific improvements are suggested for further work. The contribution of this paper is not to regurgitate the finding from the original paper on Mask RCNN but provide results on the efficiency of using the method in the context of face detection for crowd analysis. Additionally, exploration of suitable hyper parameters for this context has been performed and described. Code has been made publicly available.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Leonardi, F., Marcii, D.: An uncertainty model for people counters based on video sensors. In: Advanced Methods for Uncertainty Estimation in Measurement, Italy, pp. 62–66 (2008) Leonardi, F., Marcii, D.: An uncertainty model for people counters based on video sensors. In: Advanced Methods for Uncertainty Estimation in Measurement, Italy, pp. 62–66 (2008)
2.
Zurück zum Zitat Liu, C., Shum, H.Y., Freeman, W.: Face hallucination: theory and practice. Int. J. Comput. Vision 75(1), 115–134 (2007)CrossRef Liu, C., Shum, H.Y., Freeman, W.: Face hallucination: theory and practice. Int. J. Comput. Vision 75(1), 115–134 (2007)CrossRef
3.
Zurück zum Zitat Eshed, O.B., Trivedi, M.: To boost or not to boost? On the limits of boosted trees for object detection. In: 23rd International Conference on Pattern Recognition (ICPR), pp. 3350–3355, Mexico (2016) Eshed, O.B., Trivedi, M.: To boost or not to boost? On the limits of boosted trees for object detection. In: 23rd International Conference on Pattern Recognition (ICPR), pp. 3350–3355, Mexico (2016)
4.
Zurück zum Zitat Li, H., Lin, Z., Shen, X., Brandt, J., Hua, G.: A convolutional neural network cascade for face detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, pp. 5325–5334 (2015) Li, H., Lin, Z., Shen, X., Brandt, J., Hua, G.: A convolutional neural network cascade for face detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, pp. 5325–5334 (2015)
5.
Zurück zum Zitat Vaillant, R., Monrocq, C., Le Cun, Y.: Original approach for the localisation of objects in images. In: IEE Proceedings-Vision, Image and Signal Processing, pp. 245–250 (1994)CrossRef Vaillant, R., Monrocq, C., Le Cun, Y.: Original approach for the localisation of objects in images. In: IEE Proceedings-Vision, Image and Signal Processing, pp. 245–250 (1994)CrossRef
9.
Zurück zum Zitat Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015) Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
10.
Zurück zum Zitat He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: IEEE International Conference on Computer Vision (ICCV), Venice, pp. 2980–2988 (2017) He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: IEEE International Conference on Computer Vision (ICCV), Venice, pp. 2980–2988 (2017)
11.
Zurück zum Zitat Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Conference on Computer Vision and Pattern Recognition (CVPR). vol. 1, no. 2, pp. 4–13, Hawaii (2017) Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Conference on Computer Vision and Pattern Recognition (CVPR). vol. 1, no. 2, pp. 4–13, Hawaii (2017)
15.
Zurück zum Zitat Kinjal, J., Safvan, V.: Crowd behavior analysis. Int. J. Sci. Res. (IJSR), 3(12) (2014) Kinjal, J., Safvan, V.: Crowd behavior analysis. Int. J. Sci. Res. (IJSR), 3(12) (2014)
16.
Zurück zum Zitat Garnier, S., Gautrais, J., Theraulaz, G.: The biological principles of swarm intelligence. Swarm Intell. 1(1), 3–31 (2007)CrossRef Garnier, S., Gautrais, J., Theraulaz, G.: The biological principles of swarm intelligence. Swarm Intell. 1(1), 3–31 (2007)CrossRef
17.
Zurück zum Zitat Junior, J.C.S.J., Musse, S.R., Jung, C.R.: Crowd analysis using computer vision techniques. IEEE Signal Process. Mag. 27(5), 66–77 (2010) Junior, J.C.S.J., Musse, S.R., Jung, C.R.: Crowd analysis using computer vision techniques. IEEE Signal Process. Mag. 27(5), 66–77 (2010)
18.
Zurück zum Zitat Moussaïd, M., Perozo, N., Garnier, S., Helbing, D., Theraulaz, G.: The walking behaviour of pedestrian social groups and its impact on crowd dynamics. PloS one. 5(4) (2010)CrossRef Moussaïd, M., Perozo, N., Garnier, S., Helbing, D., Theraulaz, G.: The walking behaviour of pedestrian social groups and its impact on crowd dynamics. PloS one. 5(4) (2010)CrossRef
19.
Zurück zum Zitat Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Ohio, pp. 580–587 (2014) Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Ohio, pp. 580–587 (2014)
20.
Zurück zum Zitat Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., LeCun, Y.: Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint (2013). arXiv:1312.6229 Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., LeCun, Y.: Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint (2013). arXiv:​1312.​6229
21.
Zurück zum Zitat Wang, X., Han, T.X., Yan, S.: An HOG-LBP human detector with partial occlusion handling. In: IEEE 12th International Conference on Computer Vision, pp. 32–29, Kyoto (2009) Wang, X., Han, T.X., Yan, S.: An HOG-LBP human detector with partial occlusion handling. In: IEEE 12th International Conference on Computer Vision, pp. 32–29, Kyoto (2009)
22.
Zurück zum Zitat Maji, S., Malik, J.: Object detection using a max-margin hough transform. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1038–1045, Miami (2009) Maji, S., Malik, J.: Object detection using a max-margin hough transform. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1038–1045, Miami (2009)
23.
Zurück zum Zitat Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779–788, Las Vegas (2016) Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779–788, Las Vegas (2016)
24.
Zurück zum Zitat Zhang, W., Zelinsky, G., Samaras, D.: Real-time accurate object detection using multiple resolutions. In: 11th International Conference on Computer Vision (ICCV), pp. 1–8 (2007) Zhang, W., Zelinsky, G., Samaras, D.: Real-time accurate object detection using multiple resolutions. In: 11th International Conference on Computer Vision (ICCV), pp. 1–8 (2007)
25.
Zurück zum Zitat Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Computer Vision and Pattern Recognition (CVPR), vol. 1, pp. 886–893 (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Computer Vision and Pattern Recognition (CVPR), vol. 1, pp. 886–893 (2005)
26.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778, Las Vegas (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778, Las Vegas (2016)
28.
Zurück zum Zitat Barr, J.R., Bowyer, K.W., Flynn, P.J.: The effectiveness of face detection algorithms in unconstrained crowd scenes. In: IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1020–1027, Hayden (2014) Barr, J.R., Bowyer, K.W., Flynn, P.J.: The effectiveness of face detection algorithms in unconstrained crowd scenes. In: IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1020–1027, Hayden (2014)
Metadaten
Titel
Face Detection for Crowd Analysis Using Deep Convolutional Neural Networks
verfasst von
Bryan Kneis
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-98204-5_6

Premium Partner