Skip to main content
Top

2016 | OriginalPaper | Chapter

Do We Really Need to Collect Millions of Faces for Effective Face Recognition?

Authors : Iacopo Masi, Anh Tuấn Trần, Tal Hassner, Jatuporn Toy Leksut, Gérard Medioni

Published in: Computer Vision – ECCV 2016

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Face recognition capabilities have recently made extraordinary leaps. Though this progress is at least partially due to ballooning training set sizes – huge numbers of face images downloaded and labeled for identity – it is not clear if the formidable task of collecting so many images is truly necessary. We propose a far more accessible means of increasing training data sizes for face recognition systems: Domain specific data augmentation. We describe novel methods of enriching an existing dataset with important facial appearance variations by manipulating the faces it contains. This synthesis is also used when matching query images represented by standard convolutional neural networks. The effect of training and testing with synthesized images is tested on the LFW and IJB-A (verification and identification) benchmarks and Janus CS2. The performances obtained by our approach match state of the art results reported by systems trained on millions of downloaded images.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
MegaFace [14] is larger than CASIA, but was designed as a testing set and so provides few images per subject. It was consequently never used for training CNN systems.
 
Literature
1.
go back to reference AbdAlmageed, W., Wu, Y., Rawls, S., Harel, S., Hassner, T., Masi, I., Choi, J., Leksut, J., Kim, J., Natarajan, P., Nevatia, R., Medioni, G.: Face recognition using deep multi-pose representations. In: Winter Conference on Applications of Computer Vision (2016) AbdAlmageed, W., Wu, Y., Rawls, S., Harel, S., Hassner, T., Masi, I., Choi, J., Leksut, J., Kim, J., Natarajan, P., Nevatia, R., Medioni, G.: Face recognition using deep multi-pose representations. In: Winter Conference on Applications of Computer Vision (2016)
2.
go back to reference Baltrusaitis, T., Robinson, P., Morency, L.P.: Constrained local neural fields for robust facial landmark detection in the wild. In: Proceedings of the International Conference on Computer Vision Workshops (2013) Baltrusaitis, T., Robinson, P., Morency, L.P.: Constrained local neural fields for robust facial landmark detection in the wild. In: Proceedings of the International Conference on Computer Vision Workshops (2013)
3.
go back to reference Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: Delving deep into convolutional nets. In: Proceedings of the British Machine Vision Conference (2014) Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: Delving deep into convolutional nets. In: Proceedings of the British Machine Vision Conference (2014)
4.
go back to reference Chen, J.C., Sankaranarayanan, S., Patel, V.M., Chellappa, R.: Unconstrained face verification using fisher vectors computed from frontalized faces. In: International Conference on Biometrics: Theory, Applications and Systems (2015) Chen, J.C., Sankaranarayanan, S., Patel, V.M., Chellappa, R.: Unconstrained face verification using fisher vectors computed from frontalized faces. In: International Conference on Biometrics: Theory, Applications and Systems (2015)
5.
go back to reference Chen, J.C., Patel, V.M., Chellappa, R.: Unconstrained face verification using deep cnn features. In: Winter Conference on Applications of Computer Vision (2016) Chen, J.C., Patel, V.M., Chellappa, R.: Unconstrained face verification using deep cnn features. In: Winter Conference on Applications of Computer Vision (2016)
6.
go back to reference Ding, C., Xu, C., Tao, D.: Multi-task pose-invariant face recognition. Trans. Image Process. 24(3), 980–993 (2015)MathSciNetCrossRef Ding, C., Xu, C., Tao, D.: Multi-task pose-invariant face recognition. Trans. Image Process. 24(3), 980–993 (2015)MathSciNetCrossRef
7.
go back to reference Eigen, D., Fergus, R.: Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture. In: Proceedings of the International Conference on Computer Vision, pp. 2650–2658 (2015) Eigen, D., Fergus, R.: Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture. In: Proceedings of the International Conference on Computer Vision, pp. 2650–2658 (2015)
8.
go back to reference Ferrari, C., Lisanti, G., Berretti, S., Del Bimbo, A.: Dictionary learning based 3D morphable model construction for face recognition with varying expression and pose. In: 3DV (2015) Ferrari, C., Lisanti, G., Berretti, S., Del Bimbo, A.: Dictionary learning based 3D morphable model construction for face recognition with varying expression and pose. In: 3DV (2015)
9.
go back to reference Hartley, R., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge (2003)MATH Hartley, R., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge (2003)MATH
10.
go back to reference Hassner, T.: Viewing real-world faces in 3d. In: Proceedings of the International Conference on Computer Vision, pp. 3607–3614 (2013) Hassner, T.: Viewing real-world faces in 3d. In: Proceedings of the International Conference on Computer Vision, pp. 3607–3614 (2013)
11.
go back to reference Hassner, T., Harel, S., Paz, E., Enbar, R.: Effective face frontalization in unconstrained images. In: Proceedings of the International Conference on Computer Vision Pattern Recognition (2015) Hassner, T., Harel, S., Paz, E., Enbar, R.: Effective face frontalization in unconstrained images. In: Proceedings of the International Conference on Computer Vision Pattern Recognition (2015)
12.
go back to reference Hassner, T., Masi, I., Kim, J., Choi, J., Harel, S., Natarajan, P., Medioni, G.: Pooling faces: template based face recognition with pooled face images. In: Proceedings of the International Conference on Computer Vision Pattern Recognition Workshops, June 2016 Hassner, T., Masi, I., Kim, J., Choi, J., Harel, S., Natarajan, P., Medioni, G.: Pooling faces: template based face recognition with pooled face images. In: Proceedings of the International Conference on Computer Vision Pattern Recognition Workshops, June 2016
13.
go back to reference Huang, G.B., Ramesh, M., Berg, T., Learned-Miller, E.: Labeled faces in the wild: a database for studying face recognition in unconstrained environments. Technical report 07-49, UMass, Amherst, October 2007 Huang, G.B., Ramesh, M., Berg, T., Learned-Miller, E.: Labeled faces in the wild: a database for studying face recognition in unconstrained environments. Technical report 07-49, UMass, Amherst, October 2007
14.
go back to reference Kemelmacher-Shlizerman, I., Seitz, S.M., Miller, D., Brossard, E.: The MegaFace benchmark: 1 million faces for recognition at scale. In: Proceedings of the International Conference on Computer Vision Pattern Recognition (2016) Kemelmacher-Shlizerman, I., Seitz, S.M., Miller, D., Brossard, E.: The MegaFace benchmark: 1 million faces for recognition at scale. In: Proceedings of the International Conference on Computer Vision Pattern Recognition (2016)
15.
go back to reference Kemelmacher-Shlizerman, I., Suwajanakorn, S., Seitz, S.M.: Illumination-aware age progression. In: Proceedings of the International Conference on Computer Vision Pattern Recognition, pp. 3334–3341. IEEE (2014) Kemelmacher-Shlizerman, I., Suwajanakorn, S., Seitz, S.M.: Illumination-aware age progression. In: Proceedings of the International Conference on Computer Vision Pattern Recognition, pp. 3334–3341. IEEE (2014)
16.
go back to reference Klare, B.F., Klein, B., Taborsky, E., Blanton, A., Cheney, J., Allen, K., Grother, P., Mah, A., Burge, M., Jain, A.K.: Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus benchmark A. In: Proceedings of the International Conference on Computer Vision Pattern Recognition, pp. 1931–1939 (2015) Klare, B.F., Klein, B., Taborsky, E., Blanton, A., Cheney, J., Allen, K., Grother, P., Mah, A., Burge, M., Jain, A.K.: Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus benchmark A. In: Proceedings of the International Conference on Computer Vision Pattern Recognition, pp. 1931–1939 (2015)
17.
go back to reference Klontz, J., Klare, B., Klum, S., Taborsky, E., Burge, M., Jain, A.K.: Open source biometric recognition. In: International Conference on Biometrics: Theory, Applications and Systems (2013) Klontz, J., Klare, B., Klum, S., Taborsky, E., Burge, M., Jain, A.K.: Open source biometric recognition. In: International Conference on Biometrics: Theory, Applications and Systems (2013)
18.
go back to reference Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Neural Information Processing Systems, pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Neural Information Processing Systems, pp. 1097–1105 (2012)
19.
go back to reference Lawrence, S., Giles, C.L., Tsoi, A.C., Back, A.D.: Face recognition: a convolutional neural-network approach. Trans. Neural Netw. 8(1), 98–113 (1997)CrossRef Lawrence, S., Giles, C.L., Tsoi, A.C., Back, A.D.: Face recognition: a convolutional neural-network approach. Trans. Neural Netw. 8(1), 98–113 (1997)CrossRef
21.
go back to reference Lewis, J.P., Anjyo, K., Rhee, T., Zhang, M., Pighin, F., Deng, Z.: Practice and theory of blendshape facial models. In: Eurographics 2014 (2014) Lewis, J.P., Anjyo, K., Rhee, T., Zhang, M., Pighin, F., Deng, Z.: Practice and theory of blendshape facial models. In: Eurographics 2014 (2014)
22.
go back to reference Li, H., Hua, G., Lin, Z., Brandt, J., Yang, J.: Probabilistic elastic matching for pose variant face verification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3499–3506 (2013) Li, H., Hua, G., Lin, Z., Brandt, J., Yang, J.: Probabilistic elastic matching for pose variant face verification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3499–3506 (2013)
23.
go back to reference Masi, I., Ferrari, C., Del Bimbo, A., Medioni, G.: Pose independent face recognition by localizing local binary patterns via deformation components. In: International Conference on Pattern Recognition (2014) Masi, I., Ferrari, C., Del Bimbo, A., Medioni, G.: Pose independent face recognition by localizing local binary patterns via deformation components. In: International Conference on Pattern Recognition (2014)
24.
go back to reference Masi, I., Rawls, S., Medioni, G., Natarajan, P.: Pose-aware face recognition in the wild. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (2016) Masi, I., Rawls, S., Medioni, G., Natarajan, P.: Pose-aware face recognition in the wild. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (2016)
25.
go back to reference McLaughlin, N., Martinez Del Rincon, J., Miller, P.: Data-augmentation for reducing dataset bias in person re-identification. In: International Conference on Advanced Video and Signal Based Surveillance. IEEE (2015) McLaughlin, N., Martinez Del Rincon, J., Miller, P.: Data-augmentation for reducing dataset bias in person re-identification. In: International Conference on Advanced Video and Signal Based Surveillance. IEEE (2015)
26.
go back to reference Nguyen, M.H., Lalonde, J.F., Efros, A.A., De la Torre, F.: Image-based shaving. Comput. Graph. Forum 27(2), 627–635 (2008)CrossRef Nguyen, M.H., Lalonde, J.F., Efros, A.A., De la Torre, F.: Image-based shaving. Comput. Graph. Forum 27(2), 627–635 (2008)CrossRef
27.
go back to reference Parkhi, O.M., Simonyan, K., Vedaldi, A., Zisserman, A.: A compact and discriminative face track descriptor. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (2014) Parkhi, O.M., Simonyan, K., Vedaldi, A., Zisserman, A.: A compact and discriminative face track descriptor. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (2014)
28.
go back to reference Parkhi, O.M., Vedaldi, A., Zisserman, A.: Deep face recognition. In: Proceedings of the British Machine Vision Conference (2015) Parkhi, O.M., Vedaldi, A., Zisserman, A.: Deep face recognition. In: Proceedings of the British Machine Vision Conference (2015)
29.
go back to reference Paysan, P., Knothe, R., Amberg, B., Romdhani, S., Vetter, T.: A 3d face model for pose and illumination invariant face recognition. In: Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2009, pp. 296–301, September 2009 Paysan, P., Knothe, R., Amberg, B., Romdhani, S., Vetter, T.: A 3d face model for pose and illumination invariant face recognition. In: Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2009, pp. 296–301, September 2009
30.
go back to reference Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., et al.: Imagenet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2015)MathSciNetCrossRef Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., et al.: Imagenet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2015)MathSciNetCrossRef
31.
go back to reference Sánchez, J., Perronnin, F., Mensink, T., Verbeek, J.: Image classification with the fisher vector: theory and practice. Int. J. Comput. Vis. 105(3), 222–245 (2013)MathSciNetCrossRefMATH Sánchez, J., Perronnin, F., Mensink, T., Verbeek, J.: Image classification with the fisher vector: theory and practice. Int. J. Comput. Vis. 105(3), 222–245 (2013)MathSciNetCrossRefMATH
32.
go back to reference Sankaranarayanan, S., Alavi, A., Chellappa, R.: Triplet similarity embedding for face verification (2016). arXiv preprint: arXiv:1602.03418 Sankaranarayanan, S., Alavi, A., Chellappa, R.: Triplet similarity embedding for face verification (2016). arXiv preprint: arXiv:​1602.​03418
33.
go back to reference Schroff, F., Kalenichenko, D., Philbin, J.: Facenet: a unified embedding for face recognition and clustering. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, pp. 815–823 (2015) Schroff, F., Kalenichenko, D., Philbin, J.: Facenet: a unified embedding for face recognition and clustering. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, pp. 815–823 (2015)
34.
go back to reference Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015)
35.
go back to reference Su, H., Maji, S., Kalogerakis, E., Learned-Miller, E.: Multi-view convolutional neural networks for 3d shape recognition. In: Proceedings of the International Conference on Computer Vision, pp. 945–953 (2015) Su, H., Maji, S., Kalogerakis, E., Learned-Miller, E.: Multi-view convolutional neural networks for 3d shape recognition. In: Proceedings of the International Conference on Computer Vision, pp. 945–953 (2015)
36.
go back to reference Sun, Y., Chen, Y., Wang, X., Tang, X.: Deep learning face representation by joint identification-verification. In: Neural Information Processing System, pp. 1988–1996 (2014) Sun, Y., Chen, Y., Wang, X., Tang, X.: Deep learning face representation by joint identification-verification. In: Neural Information Processing System, pp. 1988–1996 (2014)
37.
go back to reference Sun, Y., Liang, D., Wang, X., Tang, X.: Deepid3: face recognition with very deep neural networks (2015). arXiv preprint: arXiv:1502.00873 Sun, Y., Liang, D., Wang, X., Tang, X.: Deepid3: face recognition with very deep neural networks (2015). arXiv preprint: arXiv:​1502.​00873
38.
go back to reference Sun, Y., Wang, X., Tang, X.: Deep learning face representation from predicting 10,000 classes. In: Proceedings of the International Conference on Computer Vision Pattern Recognition. IEEE (2014) Sun, Y., Wang, X., Tang, X.: Deep learning face representation from predicting 10,000 classes. In: Proceedings of the International Conference on Computer Vision Pattern Recognition. IEEE (2014)
39.
go back to reference Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: Closing the gap to human-level performance in face verification. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, pp. 1701–1708. IEEE (2014) Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: Closing the gap to human-level performance in face verification. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, pp. 1701–1708. IEEE (2014)
40.
go back to reference Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Web-scale training for face identification. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (2015) Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Web-scale training for face identification. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (2015)
42.
go back to reference Wolf, L., Hassner, T., Maoz, I.: Face recognition in unconstrained videos with matched background similarity. In: Proceedings of the International Conference on Computer Vision Pattern Recognition, pp. 529–534. IEEE (2011) Wolf, L., Hassner, T., Maoz, I.: Face recognition in unconstrained videos with matched background similarity. In: Proceedings of the International Conference on Computer Vision Pattern Recognition, pp. 529–534. IEEE (2011)
43.
go back to reference Xie, S., Tu, Z.: Holistically-nested edge detection. In: Proceedings of the International Conference on Computer Vision (2015) Xie, S., Tu, Z.: Holistically-nested edge detection. In: Proceedings of the International Conference on Computer Vision (2015)
44.
go back to reference Yang, H., Patras, I.: Mirror, mirror on the wall, tell me, is the error small? In: Proceedings of the International Conference on Computer Vision Pattern Recognition (2015) Yang, H., Patras, I.: Mirror, mirror on the wall, tell me, is the error small? In: Proceedings of the International Conference on Computer Vision Pattern Recognition (2015)
45.
go back to reference Yi, D., Lei, Z., Li, S.: Towards pose robust face recognition. In: Proceedings of the International Conference on Computer Vision Pattern Recognition, pp. 3539–3545 (2013) Yi, D., Lei, Z., Li, S.: Towards pose robust face recognition. In: Proceedings of the International Conference on Computer Vision Pattern Recognition, pp. 3539–3545 (2013)
47.
go back to reference Zhou, E., Cao, Z., Yin, Q.: Naive-deep face recognition: touching the limit of LFW benchmark or not? (2015). arXiv preprint: arXiv:1501.04690 Zhou, E., Cao, Z., Yin, Q.: Naive-deep face recognition: touching the limit of LFW benchmark or not? (2015). arXiv preprint: arXiv:​1501.​04690
Metadata
Title
Do We Really Need to Collect Millions of Faces for Effective Face Recognition?
Authors
Iacopo Masi
Anh Tuấn Trần
Tal Hassner
Jatuporn Toy Leksut
Gérard Medioni
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-46454-1_35

Premium Partner