Elsevier

Pattern Recognition

Volume 54, June 2016, Pages 68-82
Pattern Recognition

Approximately symmetrical face images for image preprocessing in face recognition and sparse representation based classification

https://doi.org/10.1016/j.patcog.2015.12.017Get rights and content

Highlights

  • Developed a novel method to automatically produce approximately axis-symmetrical virtual face images.

  • Treated as an effective image preprocessing method.

  • Used as a virtual image dictionary learning method for image classification.

  • Extensive experiments on different face databases show its effectiveness as an image preprocessing algorithm.

  • The strong identification capability of our method is verified in comparison with state-of-the-art dictionary learning algorithms.

Abstract

Though most of the faces are axis-symmetrical objects, few real-world face images are axis-symmetrical images. In the past years, there are many studies on face recognition, but only little attention is paid to this issue and few studies to explore and exploit the axis-symmetrical property of faces for face recognition are conducted. In this paper, we take the axis-symmetrical nature of faces into consideration and design a framework to produce approximately axis-symmetrical virtual dictionary for enhancing the accuracy of face recognition. It is noteworthy that the novel algorithm to produce axis-symmetrically virtual face images is mathematically very tractable and quite easy to implement. Extensive experimental results demonstrate the superiority in face recognition of the virtual face images obtained using our method to the original face images. Moreover, experimental results on different databases also show that the proposed method can achieve satisfactory classification accuracy in comparison with state-of-the-art image preprocessing algorithms. The MATLAB code of the proposed method can be available at http://www.yongxu.org/lunwen.html.

Introduction

In the face recognition community, various algorithms have been proposed to remedy difficulties caused by the small sample size (SSS) problem and variations of poses, illuminations and facial expressions [1], [2]. It should be pointed out that the problem of varying poses, illuminations and facial expressions usually has the following characteristics. Each face usually has severe variations of poses, illuminations and facial expressions but a face has only a limited number of available face images [3]. This of course causes a great intra-subject difference and the difference may even be greater than the inter-subject difference. This may incur the result that different faces have similar patterns [4], [5]. As a consequence, the performance of face recognition algorithms will be seriously affected.

Many efforts have been made to address these difficulties from the viewpoint of methodology. For example, when linear discriminant analysis (LDA) [6], [8] is applied to face recognition, it usually suffers from the SSS problem. In other words, because the number of face samples for training is almost always smaller than the dimension of the face sample, LDA cannot be directly implemented. A number of variants of the original LDA algorithm have been proposed to overcome this problem. Moreover, various methods have been proposed to perform face recognition across poses, illuminations and facial expressions and to reduce disadvantageous effect on recognition of faces of variations of poses, illuminations and facial expressions [7].

Besides special algorithms are designed to overcome above difficulties, to obtain more virtual face samples is another important way to improve the performance of face recognition [9], [10]. In recent years researchers have made notable advances in this way. For example, symmetrical face images proposed in [11] are very beneficial to overcome the problem of varying poses and illuminations. The mirror face image based face recognition method proposed in [12] has the following merits. The used mirror face images not only can provide possible poses of faces that are not shown in the original face images but also offer us natural face images. Li et al. [13] proposed to enlarge the number of the training samples by exploiting the inter-class relationship and combined the original samples and obtained virtual samples to perform face recognition. The simultaneous use of these synthesized images and original face images can lead to very good performance. Besides the above symmetrical face images and mirror face images, noised or corrupted face images are also available virtual face samples [14], [15].

We see that a few methods take the facial geometry into account when generating more available face samples. However, when humans recognize faces, they often exploit the facial geometry information. A common property of most faces is that they have a symmetrically geometric structure. In other words, the left-half face is almost always the “mirror image” of the right-half face. This motivates us to explore the way to exploit the symmetrical structure of faces when designing automatic face recognition algorithms. It has been proved that the performance of some tasks such as face detection can be promoted by using the symmetrical structure of faces [16], [17], [18]. The facial symmetry can also be applied for shadow compensation and analysis [19], [20]. Moreover, the symmetry of faces has also been shown to be partially beneficial to 3D face recognition [21], [22], [23]. However, it seems that how to effectively exploit the facial symmetry to perform 2D face recognition is not studied in-depth. Especially, in the 2D face recognition community, there are only few studies on how to design an algorithm of automatically generating symmetrical face images. For 2D face recognition, the literature showed that if the 2D face image is completely an axis-symmetrical image, then we can use only one-half of the face image for the classification of faces [24]. However, most of the real 2D face images are not axis-symmetrical. Thus, if only one-half of the face image is used for matching and classification of faces, there will be much information loss and the classification performance will be degraded. Previous research also demonstrates that enhancing the symmetry of a face image is a meaningful and effective strategy [25], [26].

In the past, there were some studies on preprocessing of face images [29], [30]. For example, some methods to reduce effects on face recognition of illuminations have been proposed [27], [28]. With the development of the time–frequency theory, frequency domain features such as discrete cosine transform (DCT) feature [31] were extensively used for face recognition. Because the small-scale feature is more robust to illumination variations in recognition tasks, the logarithmic total variation (LTV) model [32] and the logarithmic wavelet transform (LWT) method [33] based on the multi-scale facial structure representation were proposed as image preprocessing methods for face recognition. In the light of the fact that each face image can be constructed by the integration of the large- and small-scale features, Xie et al. [34] proposed to integrate the large- and small-scale features (LSSF) of face images to perform normalization of face illumination. Zhang et al. [35] proposed to use gradientfaces (GRF) and Šrtuc and Pavešić [36] presented a non-local means based normalization method (NLM) as image preprocessing techniques for robust face recognition. Cyganek et al. [37] proposed a very impressive method, which used Support Vector Machines with specific tensor kernel and achieved high accuracy for processing multidimensional images (among which are face datasets) without any need for sophisticated pre-processing or posing restrictions on the face pose. In addition, the Lambertian reflectance model also has received much attention in remedying the problems caused by unbalanced illuminations [39]. However, a real face image is usually not completely subject to the assumption of this model. The illuminating light field method also seems to be applicable for face recognition across illuminations and poses [40]. However, this method is still established on the basis of the Lambertain reflectance model.

In this paper, we propose a novel face image preprocessing method and also use it to generate a virtual dictionary for image classification. The proposed method aims at producing virtual face images with an approximately symmetrical structure, which will be utilized in both image preprocessing and image classification. Our method has the following notable merits. (1) Our method can well reduce the negative effect on face images of heterogeneous illuminations and it can be used as a face image preprocessing method. (2) The proposed method can automatically produce an approximately axis-symmetrical virtual face image from an original face image. Because the virtual face image is approximately axis-symmetrical, it can effectively reduce negative effects on face recognition of varying face appearance. As a result, the use of the virtual face images enables the accuracy of a face recognition algorithm to be improved. Thus, this not only provides a good solution to the face image preprocessing problem, but also can generate geometrically symmetrical and more attractive face images. To evaluate the performance of our method in two different applications, i.e. face image preprocessing and generation of virtual dictionary, we design experiments composed of two parts. We first applied our method as an image preprocessing method to all images including training samples and test samples from different datasets, and then exploited the nearest neighbor classifier for classification. In addition, we also applied our method on different face databases to generate virtual training dictionaries and then employed different sparse presentation based classification algorithms for classification. It should be pointed out that in the second part of experiments, our method was applied only to the original training sample and the test samples were not used to generate virtual samples. This makes use of our method partially similar to the dictionary learning method in face recognition. In summary, the paper has the following important contributions:

  • It proposes an originally creative idea and algorithm to automatically produce approximately axis-symmetrical virtual face images.

  • The proposed method can be treated as an image preprocessing method for face recognition, and extensive experiments on different face databases show the effectiveness of our method.

  • The images synthesized by our method can be used as a virtual image dictionary, and the strong recognition capability of our method is verified in comparison with state-of-the-art dictionary learning algorithms in image classification.

  • As an unsupervised image classification and image preprocessing method, our method can work much better than the benchmark that uses the original image datasets for classification.

The rest of our paper is organized as follows. Section 2 presents the description of the proposed method in detail. The analysis and advantages of the proposed method and the obtained virtual images are shown in Section 3. In Section 4, we introduce the experimental evaluation of our method as the image preprocessing method on some widely used databases. In Section 5, the experimental evaluation of generating the virtual dictionary is reported. The analysis and discussion are explicitly presented in Section 6. Finally, we give our conclusions in Section 7.

Section snippets

The description of the proposed method

In this section we describe the main steps of the proposed method in detail. Let z1 and z2 denote the left vector and right vector of an original face image, respectively. In particular, both z1 and z2 are supposed to be column vectors. The procedure to obtain z1 and z2 is as follows. Let F stand for an original face image. Let Fj(j=1,2,,J) denote that j-th column of 2D image matrix F. The pixel at the i-th row and j-th column of F is denoted by Fij. Suppose that J is an even number. z1 and z2

Analysis and advantages of the proposed method

The proposed method has the following advantages. First, it can easily generate symmetrical virtual face images from original face images. This not only can reduce the negative effect on face recognition of facial pose and illumination variations but also may make the geometry of face images look more consistent with that of true faces. Second, the proposed method is simple and easy to implement. It produces approximately symmetrical face images at a low computational cost. In addition, we say

Experimental evaluation of our method as the image preprocessing method

In our experiments, the proposed method was evaluated by using the FERET [44], ORL [45], Georgia Tech (GT) [46] and Labeled Faces in the Wild (LFW) databases [47]. In order to evaluate the capability of our method in image preprocessing, we compared our method with state-of-the-art image preprocessing methods, i.e. the logarithmic wavelet transform (LWT) method [33], logarithmic total variation (LTV) [32], local binary pattern histogram Fourier (LBPHF) features [48], large- and small-scale

Experimental evaluation on the generated virtual dictionary

The goal of the experiments in this section is to test the effectiveness of the virtual dictionary generated using our method. Our method is first applied to the original training samples to obtain the virtual training samples, and then it constructs the virtual dictionary by stacking these virtual sample vectors to form a matrix. The original test sample is not processed such that our method is similar to the dictionary learning method in face recognition. Because the dictionary learning based

Limitations and discussion

In this paper, we proposed an efficient and effective method to perform face image preprocessing and to synthesize a virtual dictionary for image classification. Our method directly utilizes the geometrical midline of the face image to separate it into two-half images with the same size and exploits both of the left-half and right-half images of the face image to obtain a symmetrical face image for image preprocessing and classification. The general framework of generating an approximately

Conclusions

Our work brings a novel viewpoint and way to automatically produce approximately axis-symmetrical virtual face images and to explore them for better recognition of faces. It seems that no similar works are presented in the past. The designed method is not only novel and simple but also is very competent. Moreover, after approximately axis-symmetrical virtual face images are obtained, an arbitrary classification method or classifier can be applied for recognition of faces. This means that what

Conflict of interest

There is no conflict of interests.

Acknowledgments

This paper is partially supported by the National Natural Science Foundation of China (Nos. 61370163, 61233011, 61332011, 61271344), Shenzhen Municipal Science and Technology Innovation Council (Nos. JCYJ20130329151843309 and JCYJ20140904154630436), Shenzhen Fundamental Research Fund (JCYJ20140508160910917 and JCYJ20150403161923528).

Yong Xu was born in Sichuan, China, in 1972. He received his B.S. degree and M.S. degree at the Air Force Institute of Meteorology (China) in 1994 and 1997, respectively. He received the Ph.D. degree in Pattern recognition and Intelligence System at the Nanjing University of Science and Technology (NUST) in 2005. Now, he works at Shenzhen Graduate School, Harbin Institute of Technology. His current interests include pattern recognition, biometrics, machine learning and video analysis.

References (58)

  • H.T. Ho et al.

    Pose-invariant face recognition using Markov random fields

    IEEE Trans. Image Process.

    (2013)
  • Y. Xu et al.

    Data uncertainty in face recognition

    IEEE Trans. Cybern.

    (2014)
  • Y. Wang et al.

    Robust 3D face recognition by local shape difference boosting

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2010)
  • A.S. Georghiades et al.

    From few to manyillumination cone models for face recognition under variable lighting and pose

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2001)
  • J. Gui et al.

    Representative vector machinesa unified framework for classical classifiers

    IEEE Trans. Cybern.

    (2016)
  • W. Gao, S. Shan, X. Chai, X. Fu, Virtual face image generation for illumination and pose insensitive face recognition,...
  • Y. Xu et al.

    Integrating conventional and inverse representation for face recognition

    IEEE Trans. Cybern.

    (2014)
  • Q. Li et al.

    Enlarge the training set based on inter-class relationship for face recognition from one image per person

    Plos One

    (2013)
  • D. Tang et al.

    A novel sparse representation method based on virtual samples for face recognition

    Neural Comput. Appl.

    (2014)
  • M.C. Su, C.H. Chou, Application of associative memory in human face detection, in: Proceedings of the International...
  • S. Saha, S. Bandyopadhyay, A symmetry based face detection technique, in: Proceedings of the IEEE WIE National...
  • W.Y. Zhao, R. Chellappa, Illumination-insensitive face recognition using symmetric shape-from-shading, in: Proceedings...
  • G. Passalis et al.

    Using facial symmetry to handle pose variations in real-world 3D face recognition

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2011)
  • J. Harguess, J.K. Aggarwal, Is there a connection between face symmetry and face recognition? in: Proceedings of the...
  • L. Zhang et al.

    3D face authentication and recognition based on bilateral symmetry analysis

    Vis. Comput.

    (2006)
  • A.K. Singh, G.C. Nandi, Face recognition using facial symmetry, in: Proceedings of the Conference on Computer Science...
  • A.C. Little

    Domain specificity in human symmetry preferencessymmetry is most pleasant when looking at human faces

    Symmetry

    (2014)
  • Q. Liao et al.

    Enhancing the symmetry and proportion of 3D face geometry

    IEEE Trans. Vis. Comput. Graph.

    (2012)
  • X. Tan et al.

    Enhanced local texture feature sets for face recognition under difficult lighting conditions

    IEEE Trans. Image Process.

    (2010)
  • Cited by (0)

    Yong Xu was born in Sichuan, China, in 1972. He received his B.S. degree and M.S. degree at the Air Force Institute of Meteorology (China) in 1994 and 1997, respectively. He received the Ph.D. degree in Pattern recognition and Intelligence System at the Nanjing University of Science and Technology (NUST) in 2005. Now, he works at Shenzhen Graduate School, Harbin Institute of Technology. His current interests include pattern recognition, biometrics, machine learning and video analysis.

    Zheng Zhang received the B.S. degree from Henan University of Science and Technology and M.S. degree from Shenzhen Graduate School, Harbin Institute of Technology (HIT) in 2012 and 2014, respectively. Currently, he is pursuing the Ph.D. degree in computer science and technology at Shenzhen Graduate School, Harbin Institute of Technology, Shenzhen, China. His current research interests include pattern recognition, machine learning and computer vision.

    Guangming Lu received the B.S. degree in electrical engineering, master׳s degree in control theory and control engineering, and the Ph.D. degree in computer science and engineering from the Harbin Institute of Technology (HIT), Harbin, China, in 1998, 2000, and 2005, respectively. He was a Post-Doctoral Fellow with Tsinghua University, Beijing, China, from 2005 to 2007. He is currently a Professor with the Biocomputing Research Center, Shenzhen Graduate School, HIT. His current research interests include pattern recognition, image processing, and automated biometric technologies and applications.

    Jian Yang received the B.S. degree in mathematics from the Xuzhou Normal University in 1995. He received the M.S. degree in applied mathematics from the Changsha Railway University in 1998 and the Ph.D. degree from the Nanjing University of Science and Technology (NUST), on the subject of pattern recognition and intelligence systems in 2002. In 2003, he was a postdoctoral researcher at the University of Zaragoza. From 2004 to 2006, he was a Postdoctoral Fellow at Biometrics Centre of Hong Kong Polytechnic University. From 2006 to 2007, he was a Postdoctoral Fellow at Department of Computer Science of New Jersey Institute of Technology. Now, he is a professor in the School of Computer Science and Technology of NUST. He is the author of more than 80 scientific papers in pattern recognition and computer vision. His journal papers have been cited more than 1600 times in the ISI Web of Science, and 2800 times in the Web of Scholar Google. His research interests include pattern recognition, computer vision and machine learning. Currently, he is an Associate Editor of Pattern Recognition Letters and IEEE Transaction On Neural Networks and Learning Systems, respectively.

    View full text