Skip to main content
Top
Published in: Neural Computing and Applications 13/2022

28-05-2022 | Review

Frontal face reconstruction based on detail identification, variable scale self-attention and flexible skip connection

Authors: Haokun Luo, Shengcai Cen, Qichen Ding, Xueyun Chen

Published in: Neural Computing and Applications | Issue 13/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Reconstruction of the frontal face from the profile is of great significance for face recognition in complex scenes. The existing mainstream methods of face reconstruction, such as FF-GAN, CAPG-GAN, TP-GAN, etc., have made good progresses on improving the generator network, but fewer considerations on the identification of face details and the extraction of spatial context features. To address the problem, we propose the frontal face reconstruction based on the detail discrimination, variable scale self attention, and flexible skip connection (FR-DVF): designing a group of discriminators for multi-scale detail region identification, a novel encoder-decoder generator structure with a variable scale type of self-attention module, which inserts a max-pooling layer into the pathways of the traditional module to reduce its feature-dimension and computing-cost, and a flexible type of the skip-connections to alleviate the stiff property of the traditional connections between the encoder and decoder layers. After adding detail discrimination, variable scale self attention module, and flexible skip connection structure, the rank-1 recognition rate (\(\%\)) of DVF-FR in the database of M2FPA increased by 2.94, 1.93 and 1.67\(\%\), respectively, as well as that occurred in FERET.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Akshay A, Marks Tim K, Jones Michael J, Tieu Kinh H, Rohith MV (2011) Fully automatic pose-invariant face recognition via 3d pose normalization. In: 2011 international conference on computer vision, pp 937–944 Akshay A, Marks Tim K, Jones Michael J, Tieu Kinh H, Rohith MV (2011) Fully automatic pose-invariant face recognition via 3d pose normalization. In: 2011 international conference on computer vision, pp 937–944
2.
go back to reference Feng GC, Yuen PC (2000) Recognition of head-and-shoulder face image using virtual frontal-view image. IEEE Trans Syst Man Cybern Part A Syst Humans 30(6):871–882CrossRef Feng GC, Yuen PC (2000) Recognition of head-and-shoulder face image using virtual frontal-view image. IEEE Trans Syst Man Cybern Part A Syst Humans 30(6):871–882CrossRef
3.
go back to reference Guo Y, Juyong Z, Jianfei C, Boyi J, Jianmin J (2019) Cnn-based real-time dense face reconstruction with inverse-rendered photo-realistic face images. IEEE Trans Pattern Anal Mach Intell 41(6):1294–1307CrossRef Guo Y, Juyong Z, Jianfei C, Boyi J, Jianmin J (2019) Cnn-based real-time dense face reconstruction with inverse-rendered photo-realistic face images. IEEE Trans Pattern Anal Mach Intell 41(6):1294–1307CrossRef
4.
go back to reference Liang S, Xiaoning S, Tao Z, Yuquan Z (2019) Histogram-based crc for 3d-aided pose-invariant face recognition. Sensors, 19(4) Liang S, Xiaoning S, Tao Z, Yuquan Z (2019) Histogram-based crc for 3d-aided pose-invariant face recognition. Sensors, 19(4)
5.
go back to reference Hang Z, Jihao L, Ziwei L, Yu L, Xiaogang W (2020) Rotate-and-render: Unsupervised photorealistic face rotation from single-view images. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), June Hang Z, Jihao L, Ziwei L, Yu L, Xiaogang W (2020) Rotate-and-render: Unsupervised photorealistic face rotation from single-view images. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), June
6.
go back to reference Xi Y, Xiang Y, Kihyuk S, Xiaoming L, Manmohan C (2017) Towards large-pose face frontalization in the wild. In: Proceeding of international conference on computer vision, Venice, Italy, October Xi Y, Xiang Y, Kihyuk S, Xiaoming L, Manmohan C (2017) Towards large-pose face frontalization in the wild. In: Proceeding of international conference on computer vision, Venice, Italy, October
7.
go back to reference Meina K, Shiguang S, Hong C, Xilin C (2014) Stacked progressive auto-encoders (spae) for face recognition across poses. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), June Meina K, Shiguang S, Hong C, Xilin C (2014) Stacked progressive auto-encoders (spae) for face recognition across poses. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), June
8.
go back to reference Forrester C, David B, Dilip K, Aaron S, Inbar M, Freeman William T (2017) Synthesizing normalized faces from facial identity features. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), July Forrester C, David B, Dilip K, Aaron S, Inbar M, Freeman William T (2017) Synthesizing normalized faces from facial identity features. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), July
9.
go back to reference Xin Yu, Porikli F, Fernando B, Hartley R (2020) Hallucinating unaligned face images by multiscale transformative discriminative networks. Int J Comput Vision 128(2):500–526CrossRef Xin Yu, Porikli F, Fernando B, Hartley R (2020) Hallucinating unaligned face images by multiscale transformative discriminative networks. Int J Comput Vision 128(2):500–526CrossRef
10.
go back to reference Junho Y, Heechul J, ByungIn Y, Changkyu C, Dusik P, Junmo K (2015) Rotating your face using multi-task deep neural network. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), June Junho Y, Heechul J, ByungIn Y, Changkyu C, Dusik P, Junmo K (2015) Rotating your face using multi-task deep neural network. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), June
11.
go back to reference Zhihong Zhang X, Chen BW, Guosheng H, Zuo W, Hancock ER (2019) Face frontalization using an appearance-flow-based convolutional neural network. IEEE Trans Image Process 28(5):2187–2199MathSciNetCrossRef Zhihong Zhang X, Chen BW, Guosheng H, Zuo W, Hancock ER (2019) Face frontalization using an appearance-flow-based convolutional neural network. IEEE Trans Image Process 28(5):2187–2199MathSciNetCrossRef
12.
go back to reference Luan T, Xi Y, Xiaoming L (2017) Disentangled representation learning gan for pose-invariant face recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1415–1424 Luan T, Xi Y, Xiaoming L (2017) Disentangled representation learning gan for pose-invariant face recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1415–1424
13.
go back to reference Rui H, Shu Z, Tianyu L, Ran H (2017) Beyond face rotation: Global and local perception gan for photorealistic and identity preserving frontal view synthesis. In: Proceedings of the IEEE international conference on computer vision (ICCV), Oct Rui H, Shu Z, Tianyu L, Ran H (2017) Beyond face rotation: Global and local perception gan for photorealistic and identity preserving frontal view synthesis. In: Proceedings of the IEEE international conference on computer vision (ICCV), Oct
14.
go back to reference Yibo H, Xiang W, Bing Y, Ran H, Zhenan S (2018) Pose-guided photorealistic face rotation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June Yibo H, Xiang W, Bing Y, Ran H, Zhenan S (2018) Pose-guided photorealistic face rotation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June
15.
go back to reference Dzmitry B, Kyunghyun C, Yoshua B (2016) Neural machine translation by jointly learning to align and translate Dzmitry B, Kyunghyun C, Yoshua B (2016) Neural machine translation by jointly learning to align and translate
16.
go back to reference Wei S, Tianfu W (2019) Learning spatial pyramid attentive pooling in image synthesis and image-to-image translation Wei S, Tianfu W (2019) Learning spatial pyramid attentive pooling in image synthesis and image-to-image translation
17.
go back to reference He Z, Kan M, Zhang J and Shan S (2020) Progressive attention generative adversarial network for facial attribute editing, Pa-gan He Z, Kan M, Zhang J and Shan S (2020) Progressive attention generative adversarial network for facial attribute editing, Pa-gan
18.
go back to reference Yu Y, Songyao J, Robinson Joseph P, Yun F (2020) Dual-attention gan for large-pose face frontalization. In: 2020 15th IEEE international conference on automatic face and gesture recognition (FG 2020), pp 249–256 Yu Y, Songyao J, Robinson Joseph P, Yun F (2020) Dual-attention gan for large-pose face frontalization. In: 2020 15th IEEE international conference on automatic face and gesture recognition (FG 2020), pp 249–256
19.
go back to reference Yuhang L, Xuejin C, Feng W, Zheng Z (2019) Linestofacephoto: face photo generation from lines with conditional self-attention generative adversarial networks. MM ’19, pp 2323-2331, New York, NY, USA,. Association for Computing Machinery Yuhang L, Xuejin C, Feng W, Zheng Z (2019) Linestofacephoto: face photo generation from lines with conditional self-attention generative adversarial networks. MM ’19, pp 2323-2331, New York, NY, USA,. Association for Computing Machinery
20.
go back to reference He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR) He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR)
21.
go back to reference Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. Springer, Cham Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. Springer, Cham
22.
go back to reference Goodfellow IJ, Pouget-Abadie J, Mirza M, Bing X, Bengio Y (2014) Generative adversarial nets. MIT Press Goodfellow IJ, Pouget-Abadie J, Mirza M, Bing X, Bengio Y (2014) Generative adversarial nets. MIT Press
23.
go back to reference Jaderberg M, Simonyan K, Zisserman A et al (2015) Spatial transformer networks. Adv Neural Inf Process Syst 28:2017–2025 Jaderberg M, Simonyan K, Zisserman A et al (2015) Spatial transformer networks. Adv Neural Inf Process Syst 28:2017–2025
24.
go back to reference Xiaolong W, Ross G, Abhinav G, Kaiming H (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803 Xiaolong W, Ross G, Abhinav G, Kaiming H (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803
25.
go back to reference Jie H, Li S, Gang S (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141 Jie H, Li S, Gang S (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141
26.
go back to reference Xiang L, Wenhai W, Xiaolin H, Jian Y (2019) Selective kernel networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 510–519 Xiang L, Wenhai W, Xiaolin H, Jian Y (2019) Selective kernel networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 510–519
27.
go back to reference Sanghyun W, Jongchan P, Joon-Young L, So KI (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19 Sanghyun W, Jongchan P, Joon-Young L, So KI (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
28.
go back to reference Jun-Yan Z, Taesung P, Phillip I, Efros Alexei A (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE international conference on computer vision, pp 2223–2232 Jun-Yan Z, Taesung P, Phillip I, Efros Alexei A (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE international conference on computer vision, pp 2223–2232
29.
30.
go back to reference Alex K, Ilya S, Hinton Geoffrey E (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inform Process Syst 25:1097–1105 Alex K, Ilya S, Hinton Geoffrey E (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inform Process Syst 25:1097–1105
31.
go back to reference Peipei L, Xiang W, Yibo H, Ran H, Zhenan S (2019) M2fpa: a multi-yaw multi-pitch high-quality dataset and benchmark for facial pose analysis. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 10043–10051 Peipei L, Xiang W, Yibo H, Ran H, Zhenan S (2019) M2fpa: a multi-yaw multi-pitch high-quality dataset and benchmark for facial pose analysis. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 10043–10051
32.
go back to reference Jonathon PP, Harry W, Jeffery H, Rauss Patrick J (1998) The feret database and evaluation procedure for face-recognition algorithms. Image Vision Comput 16(5):295–306CrossRef Jonathon PP, Harry W, Jeffery H, Rauss Patrick J (1998) The feret database and evaluation procedure for face-recognition algorithms. Image Vision Comput 16(5):295–306CrossRef
Metadata
Title
Frontal face reconstruction based on detail identification, variable scale self-attention and flexible skip connection
Authors
Haokun Luo
Shengcai Cen
Qichen Ding
Xueyun Chen
Publication date
28-05-2022
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 13/2022
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-022-07124-5

Other articles of this Issue 13/2022

Neural Computing and Applications 13/2022 Go to the issue

Deep Learning for Biomedical and Healthcare Applications

Scene guided colorization using neural networks

Premium Partner