Skip to main content
Erschienen in: Multimedia Systems 5/2019

13.06.2017 | Special Issue Paper

Salient-points-guided face alignment

verfasst von: Yangyang Hao, Hengliang Zhu, Kai Wu, Xiao Lin, Lizhuang Ma

Erschienen in: Multimedia Systems | Ausgabe 5/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Regression-based face alignment approach is fast and accurate but is always limited by the initial face. Aim at limitation of initialization in regression methods, this paper presents a novel two-stage framework named salient-points-guided face alignment. In first stage, we use cascade regression framework to train a salient points (eye centers, nose, mouth corners) localization model. Then the salient points information is used as a guidance for searching the similar faces from training set. In second stage, leveraging the similar faces to generate the initial face for all points regression. In order to give more comprehensive comparison, a new evaluation metric is proposed. Considering the global distance between estimated face and ground-truth, the new evaluation metric is defined as sum of the global distance and the widely used average point-to-point distance. The results show that our approach can achieve state-of-the-art performance (12% higher than the human performance on COFW) and the new evaluation metric is more reasonable.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ashraf, A.B., Lucey, S., Cohn, J.F., Chen, T.: The painful face: pain expression recognition using active appearance models. Image Vis Comput 27(12), 1788–1796 (2007)CrossRef Ashraf, A.B., Lucey, S., Cohn, J.F., Chen, T.: The painful face: pain expression recognition using active appearance models. Image Vis Comput 27(12), 1788–1796 (2007)CrossRef
2.
Zurück zum Zitat Asthana, A., Zafeiriou, S., Cheng, S., Pantic, M.: Robust discriminative response map fitting with constrained local models. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3444–3451 (2013) Asthana, A., Zafeiriou, S., Cheng, S., Pantic, M.: Robust discriminative response map fitting with constrained local models. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3444–3451 (2013)
3.
Zurück zum Zitat Asthana, A., Zafeiriou, S., Cheng, S., Pantic, M.: Incremental face alignment in the wild. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1859–1866 (2014) Asthana, A., Zafeiriou, S., Cheng, S., Pantic, M.: Incremental face alignment in the wild. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1859–1866 (2014)
4.
Zurück zum Zitat Belhumeur, P.N., Jacobs, D.W., Kriegman, D.J., Kumar, N.: Localizing parts of faces using a consensus of exemplars. IEEE Trans. Pattern Anal. Mach. Intell. 35(12), 545–552 (2013)CrossRef Belhumeur, P.N., Jacobs, D.W., Kriegman, D.J., Kumar, N.: Localizing parts of faces using a consensus of exemplars. IEEE Trans. Pattern Anal. Mach. Intell. 35(12), 545–552 (2013)CrossRef
5.
Zurück zum Zitat Burgos-Artizzu, X.P., Perona, P., Dollr, P.: Robust face landmark estimation under occlusion. In: IEEE International Conference on Computer Vision (ICCV), pp. 1513–1520 (2013) Burgos-Artizzu, X.P., Perona, P., Dollr, P.: Robust face landmark estimation under occlusion. In: IEEE International Conference on Computer Vision (ICCV), pp. 1513–1520 (2013)
6.
Zurück zum Zitat Cao, X., Wei, Y., Wen, F., Sun, J.: Face alignment by explicit shape regression. Int. J. Comput. Vis. 107(2), 117–190 (2012)MathSciNet Cao, X., Wei, Y., Wen, F., Sun, J.: Face alignment by explicit shape regression. Int. J. Comput. Vis. 107(2), 117–190 (2012)MathSciNet
7.
Zurück zum Zitat Chen, C., Dantcheva, A., Ross, A.: Automatic facial makeup detection with application in face recognition. In: International Conference on Biometrics (ICB), pp. 1–8 (2013) Chen, C., Dantcheva, A., Ross, A.: Automatic facial makeup detection with application in face recognition. In: International Conference on Biometrics (ICB), pp. 1–8 (2013)
8.
Zurück zum Zitat Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active appearance models. The IEEE Transactions on Pattern Analysis and Machine Intelligence pp. 681–685 (2001)CrossRef Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active appearance models. The IEEE Transactions on Pattern Analysis and Machine Intelligence pp. 681–685 (2001)CrossRef
9.
Zurück zum Zitat Cootes, T.F., Taylor, C.J., Cooper, D.H., Graham, J.: Active shape models-their training and application. Comput. Vis. Image Underst. 61(1), 38–59 (1995)CrossRef Cootes, T.F., Taylor, C.J., Cooper, D.H., Graham, J.: Active shape models-their training and application. Comput. Vis. Image Underst. 61(1), 38–59 (1995)CrossRef
10.
Zurück zum Zitat Cristinacce, D., Cootes, T.F.: Feature detection and tracking with constrained local models. BMVC 41, 929–938 (2006)MATH Cristinacce, D., Cootes, T.F.: Feature detection and tracking with constrained local models. BMVC 41, 929–938 (2006)MATH
11.
Zurück zum Zitat Dantone, M., Gall, J., Fanelli, G., Van Gool, L.: Real-time facial feature detection using conditional regression forests. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2578–2585 (2012) Dantone, M., Gall, J., Fanelli, G., Van Gool, L.: Real-time facial feature detection using conditional regression forests. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2578–2585 (2012)
12.
Zurück zum Zitat Guo, D., Sim, T.: Digital face makeup by example. In: IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 73–79 (2009) Guo, D., Sim, T.: Digital face makeup by example. In: IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 73–79 (2009)
13.
Zurück zum Zitat Dollar, P., Welinder, P., Perona, P.: Cascaded pose regression. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1078–1085 (2010) Dollar, P., Welinder, P., Perona, P.: Cascaded pose regression. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1078–1085 (2010)
14.
Zurück zum Zitat Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (2001)MathSciNetCrossRef Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (2001)MathSciNetCrossRef
15.
Zurück zum Zitat Joseph Tan, D., Holzer, S., Navab, N., Ilic, S.: Deformable template tracking in 1ms. In: Proceedings of the British Machine Vision Conference(BMVC), p. 43C56 (2014) Joseph Tan, D., Holzer, S., Navab, N., Ilic, S.: Deformable template tracking in 1ms. In: Proceedings of the British Machine Vision Conference(BMVC), p. 43C56 (2014)
16.
Zurück zum Zitat Kazemi, V., Sullivan, J.: One millisecond face alignment with an ensemble of regression trees. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1867–1874 (2014) Kazemi, V., Sullivan, J.: One millisecond face alignment with an ensemble of regression trees. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1867–1874 (2014)
17.
Zurück zum Zitat Le, V., Brandt, J., Lin, Z., Bourdev, L., Huang, T.: Interactive facial feature localization. ECCV 7574, 679–692 (2012) Le, V., Brandt, J., Lin, Z., Bourdev, L., Huang, T.: Interactive facial feature localization. ECCV 7574, 679–692 (2012)
18.
Zurück zum Zitat Lee, D., Park, H., Yoo, C.: Face alignment using cascade gaussian process regression trees. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 4204–4212 (2015) Lee, D., Park, H., Yoo, C.: Face alignment using cascade gaussian process regression trees. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 4204–4212 (2015)
19.
Zurück zum Zitat Liang, L., Xiao, R., Wen, F., Sun, J.: Face alignment via component-based discriminative search. ECCV 5303, 72–85 (2008) Liang, L., Xiao, R., Wen, F., Sun, J.: Face alignment via component-based discriminative search. ECCV 5303, 72–85 (2008)
20.
Zurück zum Zitat Lucey, S., Wang, Y., Cox, M., Sridharan, S., Cohn, J.F.: Efficient constrained local model fitting for non-rigid face alignment. Image Vis. Comput. 27(12), 1804–1813 (2009)CrossRef Lucey, S., Wang, Y., Cox, M., Sridharan, S., Cohn, J.F.: Efficient constrained local model fitting for non-rigid face alignment. Image Vis. Comput. 27(12), 1804–1813 (2009)CrossRef
21.
Zurück zum Zitat Matthews, I., Baker, S.: Active appearance models revisited. Int. J. Comput. Vis. 60(2), 135–164 (2004)CrossRef Matthews, I., Baker, S.: Active appearance models revisited. Int. J. Comput. Vis. 60(2), 135–164 (2004)CrossRef
22.
Zurück zum Zitat Messer, K., Matas, J., Kittler, J., Jonsson, K.: Xm2vtsdb: The extended m2vts database. In: Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA), pp. 72–77 (2000) Messer, K., Matas, J., Kittler, J., Jonsson, K.: Xm2vtsdb: The extended m2vts database. In: Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA), pp. 72–77 (2000)
23.
Zurück zum Zitat Ozuysal, M., Calonder, M., Lepetit, V., Fua, P.: Fast keypoint recognition using random ferns. IEEE Trans. Pattern Anal. Mach. Intell. 32(3), 448–461 (2010)CrossRef Ozuysal, M., Calonder, M., Lepetit, V., Fua, P.: Fast keypoint recognition using random ferns. IEEE Trans. Pattern Anal. Mach. Intell. 32(3), 448–461 (2010)CrossRef
24.
Zurück zum Zitat Ren, S., Cao, X., Wei, Y., Sun, J.: Face alignment at 3000 fps via regressing local binary features. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1685–1692 (2014) Ren, S., Cao, X., Wei, Y., Sun, J.: Face alignment at 3000 fps via regressing local binary features. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1685–1692 (2014)
25.
Zurück zum Zitat Saragih, J.M., Lucey, S., Cohn, J.F.: Deformable model fitting by regularized landmark mean-shift. Int. J. Comput. Vis. 91(2), 200–215 (2011)MathSciNetCrossRef Saragih, J.M., Lucey, S., Cohn, J.F.: Deformable model fitting by regularized landmark mean-shift. Int. J. Comput. Vis. 91(2), 200–215 (2011)MathSciNetCrossRef
26.
Zurück zum Zitat Smith, B.M., Brandt, J., Lin, Z., Zhang, L.: Nonparametric context modeling of local appearance for pose- and expression-robust facial landmark localization. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1741–1748 (2014) Smith, B.M., Brandt, J., Lin, Z., Zhang, L.: Nonparametric context modeling of local appearance for pose- and expression-robust facial landmark localization. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1741–1748 (2014)
27.
Zurück zum Zitat Smith, B.M., Zhang, L.: Joint face alignment with non-parametric shape models. Computer Vision–ECCV, pp. 43–56. Springer, New York (2012) Smith, B.M., Zhang, L.: Joint face alignment with non-parametric shape models. Computer Vision–ECCV, pp. 43–56. Springer, New York (2012)
28.
Zurück zum Zitat Sun, Y., Wang, X., Tang, X.: Deep convolutional network cascade for facial point detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3476–3483 (2013) Sun, Y., Wang, X., Tang, X.: Deep convolutional network cascade for facial point detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3476–3483 (2013)
29.
Zurück zum Zitat Tzimiropoulos, G.: Project-out cascaded regression with an application to face alignment. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015) Tzimiropoulos, G.: Project-out cascaded regression with an application to face alignment. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015)
30.
Zurück zum Zitat Wang, Y., Lucey, S., Cohn, J.F.: Enforcing convexity for improved alignment with constrained local models. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2008) Wang, Y., Lucey, S., Cohn, J.F.: Enforcing convexity for improved alignment with constrained local models. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2008)
31.
Zurück zum Zitat Wu, T., Turaga, P., Chellappa, R.: Age estimation and face verification across aging using landmarks. IEEE Trans. Inf. Forensics Secur. 7(6), 1780–1788 (2012)CrossRef Wu, T., Turaga, P., Chellappa, R.: Age estimation and face verification across aging using landmarks. IEEE Trans. Inf. Forensics Secur. 7(6), 1780–1788 (2012)CrossRef
32.
Zurück zum Zitat Xiong, X., De la Torre, F.: Global supervised descent method. In: Computer Vision and Pattern Recognition, pp. 2664–2673 (2015) Xiong, X., De la Torre, F.: Global supervised descent method. In: Computer Vision and Pattern Recognition, pp. 2664–2673 (2015)
33.
Zurück zum Zitat Xiong, X., De la Torre, F.: Supervised descent method and its applications to face alignment. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 532–539 (2013) Xiong, X., De la Torre, F.: Supervised descent method and its applications to face alignment. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 532–539 (2013)
34.
Zurück zum Zitat Yang, H., Mou, W., Zhang, Y., Patras, I., Gunes, H., Robinson, P.: Face alignment assisted by head pose estimation. In: Proceedings of the British Machine Vision Conference (BMVC), pp. 130.1–130.13 (2015) Yang, H., Mou, W., Zhang, Y., Patras, I., Gunes, H., Robinson, P.: Face alignment assisted by head pose estimation. In: Proceedings of the British Machine Vision Conference (BMVC), pp. 130.1–130.13 (2015)
35.
Zurück zum Zitat Yang, H., Patras, I.: Sieving regression forest votes for facial feature detection in the wild. In: IEEE International Conference on Computer Vision (ICCV), pp. 1936–1943 (2013) Yang, H., Patras, I.: Sieving regression forest votes for facial feature detection in the wild. In: IEEE International Conference on Computer Vision (ICCV), pp. 1936–1943 (2013)
36.
Zurück zum Zitat Zhang, J., Shan, S., Kan, M., Chen, X.: Coarse-to-fine auto-encoder networks (cfan) for real-time face alignment. In: ECCV, pp. 1–16 (2014) Zhang, J., Shan, S., Kan, M., Chen, X.: Coarse-to-fine auto-encoder networks (cfan) for real-time face alignment. In: ECCV, pp. 1–16 (2014)
37.
Zurück zum Zitat Zhang, Z., Luo, P., Chen, C.L., Tang, X.: Facial landmark detection by deep multi-task learning. In: ECCV, pp. 94–108 (2014)CrossRef Zhang, Z., Luo, P., Chen, C.L., Tang, X.: Facial landmark detection by deep multi-task learning. In: ECCV, pp. 94–108 (2014)CrossRef
38.
Zurück zum Zitat Zhu, S., Li, C., Loy, C.C., Tang, X.: Face alignment by coarse-to-fine shape searching. In: IEEE Conference on Computer Vision and Pattern Recognition (2015) Zhu, S., Li, C., Loy, C.C., Tang, X.: Face alignment by coarse-to-fine shape searching. In: IEEE Conference on Computer Vision and Pattern Recognition (2015)
39.
Zurück zum Zitat Zhu, X., Ramanan, D.: Face detection, pose estimation, and landmark localization in the wild. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2879–2886 (2012) Zhu, X., Ramanan, D.: Face detection, pose estimation, and landmark localization in the wild. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2879–2886 (2012)
Metadaten
Titel
Salient-points-guided face alignment
verfasst von
Yangyang Hao
Hengliang Zhu
Kai Wu
Xiao Lin
Lizhuang Ma
Publikationsdatum
13.06.2017
Verlag
Springer Berlin Heidelberg
Erschienen in
Multimedia Systems / Ausgabe 5/2019
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-017-0555-8

Weitere Artikel der Ausgabe 5/2019

Multimedia Systems 5/2019 Zur Ausgabe