Skip to main content
Erschienen in: International Journal of Social Robotics 5/2016

01.11.2016

Learning Saliency Features for Face Detection and Recognition Using Multi-task Network

verfasst von: Qian Zhao, Shuzhi Sam Ge, Mao Ye, Sibang Liu, Wei He

Erschienen in: International Journal of Social Robotics | Ausgabe 5/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this work, we have proposed a method to learn a type of saliency features, which merely makes response in face regions. Based on the saliency features, a joint pipeline is designed to detect and recognize faces as a part of human–robot interaction (HRI) system of SRU robot. The characteristics of the architecture can be described as follows: (i) In the network, detectors can only be activated by face regions. By convoluting the input image, the detectors can produce a group of saliency feature maps, which indicate the location of faces. (ii) The face representations are achieved by pooling on these high response regions. They enjoy discriminative ability to face identification. Hence, classification and detection can be blended using a single network. (iii) To enhance the saliency of features, false responses are suppressed by introducing a saliency term in loss function, which forces the feature detector to ignore non-face inputs. It also can be seen as a branch of multi-task network to learn background. By restricting false responses, the performance of face verification can be improved, especially when the training and testing are implemented on different dataset. In experiments, the effects of saliency term on face verification and benchmark discriminative ability of saliency features on LFW are analyzed. And the effectiveness of this method in face detection is verified by the experimental results on FDDB.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ahonen T, Member S, Hadid A, Pietikainen M, Member S (2006) Face description with local binary patterns: Application to face recognition. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, pp 2037–2041 Ahonen T, Member S, Hadid A, Pietikainen M, Member S (2006) Face description with local binary patterns: Application to face recognition. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, pp 2037–2041
2.
Zurück zum Zitat Benezeth Y, Emile B, Laurent H, Rosenberger C (2010) Vision-based system for human detection and tracking in indoor environment. Int J Soc Robot 2(1):41–52CrossRef Benezeth Y, Emile B, Laurent H, Rosenberger C (2010) Vision-based system for human detection and tracking in indoor environment. Int J Soc Robot 2(1):41–52CrossRef
3.
Zurück zum Zitat Bengio Y, Lamblin P, Popovici D, Larochelle H et al (2007) Greedy layer-wise training of deep networks. Adv Neural Inf Process Syst 19:153 Bengio Y, Lamblin P, Popovici D, Larochelle H et al (2007) Greedy layer-wise training of deep networks. Adv Neural Inf Process Syst 19:153
4.
Zurück zum Zitat Berg T, Belhumeur PN (2012) Tom-vs-pete classifiers and identity-preserving alignment for face verification. In: BMVC, Citeseer, vol. 2, p 7 Berg T, Belhumeur PN (2012) Tom-vs-pete classifiers and identity-preserving alignment for face verification. In: BMVC, Citeseer, vol. 2, p 7
5.
Zurück zum Zitat Chen D, Cao X, Wang L, Wen F, Sun J (2012) Bayesian face revisited: a joint formulation. In: ECCV 2012, Springer, pp 566–579 Chen D, Cao X, Wang L, Wen F, Sun J (2012) Bayesian face revisited: a joint formulation. In: ECCV 2012, Springer, pp 566–579
6.
Zurück zum Zitat Chen D, Cao X, Wen F, Sun J (2013) Blessing of dimensionality: high-dimensional feature and its efficient compression for face verification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2013, pp 3025 – 3032 Chen D, Cao X, Wen F, Sun J (2013) Blessing of dimensionality: high-dimensional feature and its efficient compression for face verification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2013, pp 3025 – 3032
8.
Zurück zum Zitat Hadsell R, Chopra S, Lecun Y (2006) Dimensionality reduction by learning an invariant mapping. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2006, pp 1735–1742 Hadsell R, Chopra S, Lecun Y (2006) Dimensionality reduction by learning an invariant mapping. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2006, pp 1735–1742
9.
Zurück zum Zitat He H, Ge SS, Zhang Z (2011) Visual attention prediction using saliency determination of scene understanding for social robots. Int J Soc Robot 3(4):457–468MathSciNetCrossRef He H, Ge SS, Zhang Z (2011) Visual attention prediction using saliency determination of scene understanding for social robots. Int J Soc Robot 3(4):457–468MathSciNetCrossRef
10.
Zurück zum Zitat He W, Chen Y, Yin Z (2015a) Adaptive neural network control of an uncertain robot with full-state constraints. IEEE Trans Cybern, in press He W, Chen Y, Yin Z (2015a) Adaptive neural network control of an uncertain robot with full-state constraints. IEEE Trans Cybern, in press
11.
Zurück zum Zitat He W, Ge SS, Li Y, Chew E, Ng YS (2015b) Neural network control of a rehabilitation robot by state and output feedback. J Intell Robot Syst 80(1):15–31CrossRef He W, Ge SS, Li Y, Chew E, Ng YS (2015b) Neural network control of a rehabilitation robot by state and output feedback. J Intell Robot Syst 80(1):15–31CrossRef
12.
Zurück zum Zitat Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science (New York, NY) 313(5786):504–547MathSciNetCrossRefMATH Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science (New York, NY) 313(5786):504–547MathSciNetCrossRefMATH
13.
Zurück zum Zitat Huang C, Zhu S, Yu K (2012) Large scale strongly supervised ensemble metric learning, with applications to face verification and retrieval. arXiv preprint arXiv:1212.6094 Huang C, Zhu S, Yu K (2012) Large scale strongly supervised ensemble metric learning, with applications to face verification and retrieval. arXiv preprint arXiv:​1212.​6094
14.
Zurück zum Zitat Huang GB, Learned-Miller E (2014) Labeled faces in the wild: Updates and new reporting procedures. Dept Comput Sci, Univ Massachusetts Amherst, Amherst, MA, USA, Technical Report pp 14–003 Huang GB, Learned-Miller E (2014) Labeled faces in the wild: Updates and new reporting procedures. Dept Comput Sci, Univ Massachusetts Amherst, Amherst, MA, USA, Technical Report pp 14–003
15.
Zurück zum Zitat Huang GB, Ramesh M, Berg T, Learned-Miller E (2007) Labeled faces in the wild: a database for studying face recognition in unconstrained environments. Technical Report 07-49, University of Massachusetts, Amherst Huang GB, Ramesh M, Berg T, Learned-Miller E (2007) Labeled faces in the wild: a database for studying face recognition in unconstrained environments. Technical Report 07-49, University of Massachusetts, Amherst
16.
Zurück zum Zitat Jain V, Learned-Miller E (2010) Fddb: A benchmark for face detection in unconstrained settings. Technical Report UM-CS-2010-009, University of Massachusetts, Amherst Jain V, Learned-Miller E (2010) Fddb: A benchmark for face detection in unconstrained settings. Technical Report UM-CS-2010-009, University of Massachusetts, Amherst
17.
Zurück zum Zitat Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: IEEE conference on computer vision and pattern recognition (CVPR) 2006, IEEE, 2, pp 2169–2178 Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: IEEE conference on computer vision and pattern recognition (CVPR) 2006, IEEE, 2, pp 2169–2178
18.
Zurück zum Zitat Lin D, Lu C, Liao R, Jia J (2014a) Learning important spatial pooling regions for scene classification. In: IEEE conference on computer vision and pattern recognition (CVPR) 2014, pp 3726–3733 Lin D, Lu C, Liao R, Jia J (2014a) Learning important spatial pooling regions for scene classification. In: IEEE conference on computer vision and pattern recognition (CVPR) 2014, pp 3726–3733
19.
Zurück zum Zitat Lin M, Chen Q, Yan S (2014b) Network in network. In: International conference on learning representations (ICLR) 2014 Lin M, Chen Q, Yan S (2014b) Network in network. In: International conference on learning representations (ICLR) 2014
20.
Zurück zum Zitat Liu C, Wechsler H (2002) Gabor feature based classification using the enhanced fisher linear discriminant model. IEEE Trans Image Process 11:467–476CrossRef Liu C, Wechsler H (2002) Gabor feature based classification using the enhanced fisher linear discriminant model. IEEE Trans Image Process 11:467–476CrossRef
21.
Zurück zum Zitat Liu Z, Luo P, Wang X, Tang X (2014) Deep learning face attributes in the wild. Eprint Arxiv Liu Z, Luo P, Wang X, Tang X (2014) Deep learning face attributes in the wild. Eprint Arxiv
22.
Zurück zum Zitat Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comp Vision 60(2):91–110CrossRef Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comp Vision 60(2):91–110CrossRef
23.
Zurück zum Zitat Mozos OM, Kurazume R, Hasegawa T (2010) Multi-part people detection using 2d range data. Int J Soc Robot 2(1):31–40CrossRef Mozos OM, Kurazume R, Hasegawa T (2010) Multi-part people detection using 2d range data. Int J Soc Robot 2(1):31–40CrossRef
24.
Zurück zum Zitat Simonyan K, Parkhi O, Vedaldi A, Zisserman A, Simonyan K, Parkhi O, Vedaldi A, Zisserman A (2013) Fisher vector faces in the wild. In Proceedings of the BMVC pp 8.1–8.11 Simonyan K, Parkhi O, Vedaldi A, Zisserman A, Simonyan K, Parkhi O, Vedaldi A, Zisserman A (2013) Fisher vector faces in the wild. In Proceedings of the BMVC pp 8.1–8.11
25.
Zurück zum Zitat Sun Y, Wang X, Tang X (2013a) Deep convolutional network cascade for facial point detection. In: IEEE conference on computer vision and pattern recognition (CVPR) 2013, pp 3476–3483 Sun Y, Wang X, Tang X (2013a) Deep convolutional network cascade for facial point detection. In: IEEE conference on computer vision and pattern recognition (CVPR) 2013, pp 3476–3483
26.
Zurück zum Zitat Sun Y, Wang X, Tang X (2013b) Hybrid deep learning for face verification. In: IEEE international conference on computer vision (ICCV) 2013, pp 1489–1496 Sun Y, Wang X, Tang X (2013b) Hybrid deep learning for face verification. In: IEEE international conference on computer vision (ICCV) 2013, pp 1489–1496
27.
Zurück zum Zitat Sun Y, Wang X, Tang X (2014a) Deep learning face representation by joint identification-verification. Proceedings of neural information processing systems conference (NIPS) 2014 Sun Y, Wang X, Tang X (2014a) Deep learning face representation by joint identification-verification. Proceedings of neural information processing systems conference (NIPS) 2014
28.
Zurück zum Zitat Sun Y, Wang X, Tang X (2014b) Deep learning face representation from predicting 10,000 classes. In: IEEE conference on computer vision and pattern recognition (CVPR) 2014, pp 1891–1898 Sun Y, Wang X, Tang X (2014b) Deep learning face representation from predicting 10,000 classes. In: IEEE conference on computer vision and pattern recognition (CVPR) 2014, pp 1891–1898
29.
Zurück zum Zitat Taigman Y, Yang M, Ranzato M, Wolf L (2014) Deepface: closing the gap to human-level performance in face verification. In: IEEE conference on computer vision and pattern recognition (CVPR) 2014, pp 1701–1708 Taigman Y, Yang M, Ranzato M, Wolf L (2014) Deepface: closing the gap to human-level performance in face verification. In: IEEE conference on computer vision and pattern recognition (CVPR) 2014, pp 1701–1708
30.
Zurück zum Zitat Yi Sun XT Xiaogang Wang (2014) Deeply learned face representations are sparse, selective, and robust. In: Proceedings of neural information processing systems conference (NIPS) 2014 Yi Sun XT Xiaogang Wang (2014) Deeply learned face representations are sparse, selective, and robust. In: Proceedings of neural information processing systems conference (NIPS) 2014
31.
Zurück zum Zitat Yi Sun XWXT Ding Liang (2015) DeepID3: Face recognition with very deep neural networks. In: Proceedings of neural information processing systems conference (NIPS) 2014 Yi Sun XWXT Ding Liang (2015) DeepID3: Face recognition with very deep neural networks. In: Proceedings of neural information processing systems conference (NIPS) 2014
32.
Zurück zum Zitat Z Zhang, P Luo, Chen CL, Tang X (2014) Facial landmark detection by deep multi-task learning. Springer International Publishing, New York Z Zhang, P Luo, Chen CL, Tang X (2014) Facial landmark detection by deep multi-task learning. Springer International Publishing, New York
Metadaten
Titel
Learning Saliency Features for Face Detection and Recognition Using Multi-task Network
verfasst von
Qian Zhao
Shuzhi Sam Ge
Mao Ye
Sibang Liu
Wei He
Publikationsdatum
01.11.2016
Verlag
Springer Netherlands
Erschienen in
International Journal of Social Robotics / Ausgabe 5/2016
Print ISSN: 1875-4791
Elektronische ISSN: 1875-4805
DOI
https://doi.org/10.1007/s12369-016-0347-x

Weitere Artikel der Ausgabe 5/2016

International Journal of Social Robotics 5/2016 Zur Ausgabe

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.