Skip to main content
Erschienen in: Soft Computing 6/2011

01.06.2011 | Focus

Automatic localization and annotation of facial features using machine learning techniques

verfasst von: Paul C. Conilione, Dianhui Wang

Erschienen in: Soft Computing | Ausgabe 6/2011

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Content-based image retrieval (CBIR) systems traditionally find images within a database that are similar to query image using low level features, such as colour histograms. However, this requires a user to provide an image to the system. It is easier for a user to query the CBIR system using search terms which requires the image content to be described by semantic labels. However, finding a relationship between the image features and semantic labels is a challenging problem to solve. This paper aims to discover semantic labels for facial features for use in a face image retrieval system. Face image retrieval traditionally uses global face-image information to determine similarity between images. However little has been done in the field of face image retrieval to use local face-features and semantic labelling. Our work aims to develop a clustering method for the discovery of semantic labels of face-features. We also present a machine learning based face-feature localization mechanism which we show has promise in providing accurate localization.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
The ffs and d notation is not included in the diagram for clarity.
 
2
Known as Stratified K-fold cross validation.
 
3
Detailed in Sect. 2.2.1.
 
Literatur
Zurück zum Zitat Ai H, Liang L, Xiao X, Xu G (2001) Face indexing and retrieval in personal digital album. In: Proceedings of 2nd IEEE Pacific rim conference on multimedia, vol 2195. Springer, London, pp 48–54 Ai H, Liang L, Xiao X, Xu G (2001) Face indexing and retrieval in personal digital album. In: Proceedings of 2nd IEEE Pacific rim conference on multimedia, vol 2195. Springer, London, pp 48–54
Zurück zum Zitat Asthana A, Goecke R, Quadrianto N, Gedeon T (2009) Learning based automatic face annotation for arbitrary poses and expressions from frontal images only. In: Proceedings of IEEE conference on computer vis and pattern recognit. pp 1635–1642 Asthana A, Goecke R, Quadrianto N, Gedeon T (2009) Learning based automatic face annotation for arbitrary poses and expressions from frontal images only. In: Proceedings of IEEE conference on computer vis and pattern recognit. pp 1635–1642
Zurück zum Zitat Belhumeur PN, Hespanha J, Kriegman DJ (1997) Eigenfaces vs. Fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720CrossRef Belhumeur PN, Hespanha J, Kriegman DJ (1997) Eigenfaces vs. Fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720CrossRef
Zurück zum Zitat Cai D, He X, Han J, Zhang H-J (2006) Orthogonal laplacianfaces for face recognition. IEEE Trans Image Process 15(11):3608–3614CrossRef Cai D, He X, Han J, Zhang H-J (2006) Orthogonal laplacianfaces for face recognition. IEEE Trans Image Process 15(11):3608–3614CrossRef
Zurück zum Zitat Carneiro G, Chan AB, Moreno PJ, Vasconcelos N (2007) Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans Pattern Anal Mach Intell 29(3):394–410CrossRef Carneiro G, Chan AB, Moreno PJ, Vasconcelos N (2007) Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans Pattern Anal Mach Intell 29(3):394–410CrossRef
Zurück zum Zitat Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):1–60CrossRef Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):1–60CrossRef
Zurück zum Zitat Gao Y, Qi Y (2005) Robust visual similarity retrieval in single model face databases. Pattern Recognit 38(7):1009–1020CrossRef Gao Y, Qi Y (2005) Robust visual similarity retrieval in single model face databases. Pattern Recognit 38(7):1009–1020CrossRef
Zurück zum Zitat Hanif SM, Prevost L, Belaroussi R, Milgram M (2008) Real-time facial feature localization by combining space displacement neural networks. Pattern Recognit Lett 29(8):1094–1104CrossRef Hanif SM, Prevost L, Belaroussi R, Milgram M (2008) Real-time facial feature localization by combining space displacement neural networks. Pattern Recognit Lett 29(8):1094–1104CrossRef
Zurück zum Zitat He X, Cai D, Han J (2008) Learning a maximum margin subspace for image retrieval. IEEE Trans Knowl Data Eng 20(2):189–201CrossRef He X, Cai D, Han J (2008) Learning a maximum margin subspace for image retrieval. IEEE Trans Knowl Data Eng 20(2):189–201CrossRef
Zurück zum Zitat Heisele B, Serre T, Poggio T (2007) A component-based framework for face detection and identification. Int J Comput Vis 74(2):167–181CrossRef Heisele B, Serre T, Poggio T (2007) A component-based framework for face detection and identification. Int J Comput Vis 74(2):167–181CrossRef
Zurück zum Zitat Heisele B, Serre T, Pontil M, Poggio T (2001) Component-based face detection. In: Proceedings of IEEE conference on computer visual and pattern recognition, vol 1. IEEE Computer Society, Los Alamitos, CA, USA, pp 657–662 Heisele B, Serre T, Pontil M, Poggio T (2001) Component-based face detection. In: Proceedings of IEEE conference on computer visual and pattern recognition, vol 1. IEEE Computer Society, Los Alamitos, CA, USA, pp 657–662
Zurück zum Zitat Hsu CW, Chang CC, Lin C-J (2005) A practical guide to support vector classification Hsu CW, Chang CC, Lin C-J (2005) A practical guide to support vector classification
Zurück zum Zitat Hsu RL, Abdel Mottaleb M, Jain AK (2002) Face detection in color images. IEEE Trans Pattern Anal Mach Intell 24(5):696–706CrossRef Hsu RL, Abdel Mottaleb M, Jain AK (2002) Face detection in color images. IEEE Trans Pattern Anal Mach Intell 24(5):696–706CrossRef
Zurück zum Zitat Hsu RL, Jain AK (2002) Semantic face matching. In: Proceedings of IEEE international conference on multimedia and expo, vol 2, pp 145–148 Hsu RL, Jain AK (2002) Semantic face matching. In: Proceedings of IEEE international conference on multimedia and expo, vol 2, pp 145–148
Zurück zum Zitat Huang GB, Zhu QY, Siew CK (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. In: Proceedings of international conference on neural networks, vol 2. IEEE, pp 985–990 Huang GB, Zhu QY, Siew CK (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. In: Proceedings of international conference on neural networks, vol 2. IEEE, pp 985–990
Zurück zum Zitat Ito H, Koshimizu H (2006) Face image retrieval and annotation based on two latent semantic spaces in FIARS. In:Proceedings of 8th IEEE international multimedia. IEEE Computer Society, Washington, DC, USA, pp 831–836 Ito H, Koshimizu H (2006) Face image retrieval and annotation based on two latent semantic spaces in FIARS. In:Proceedings of 8th IEEE international multimedia. IEEE Computer Society, Washington, DC, USA, pp 831–836
Zurück zum Zitat Lai PJ, Wang JH (2003) Facial image database for law enforcement application: an implementation. In: Proceedings of 37th IEEE international conference on security technology. Taipei, Taiwan, pp 285–289 Lai PJ, Wang JH (2003) Facial image database for law enforcement application: an implementation. In: Proceedings of 37th IEEE international conference on security technology. Taipei, Taiwan, pp 285–289
Zurück zum Zitat Levenberg K (1944) A method for the solution of certain problems in least squares. Quart Appl Math 2:164–168MathSciNetMATH Levenberg K (1944) A method for the solution of certain problems in least squares. Quart Appl Math 2:164–168MathSciNetMATH
Zurück zum Zitat Li CM, Li YS, Zhuang QD, Xiao ZZ (2004) The face localization and regional features extraction. In: Proceedings of international conference on machine learn and cybernetics, vol 6. pp 3835–3840 Li CM, Li YS, Zhuang QD, Xiao ZZ (2004) The face localization and regional features extraction. In: Proceedings of international conference on machine learn and cybernetics, vol 6. pp 3835–3840
Zurück zum Zitat Lin SH, Kung SY, Lin LJ (1997) Face recognition/detection by probabilistic decision-based neural network. IEEE Trans Neural Netw 8(1):114–132CrossRef Lin SH, Kung SY, Lin LJ (1997) Face recognition/detection by probabilistic decision-based neural network. IEEE Trans Neural Netw 8(1):114–132CrossRef
Zurück zum Zitat Lu Y, Guo H, Feldkamp L (1998) Robust neural learning from unbalanced data samples. In: Proceedings of IEEE international joint conference on neural networks, vol 3. Anchorage, Alaska, USA, pp 1816–1821 Lu Y, Guo H, Feldkamp L (1998) Robust neural learning from unbalanced data samples. In: Proceedings of IEEE international joint conference on neural networks, vol 3. Anchorage, Alaska, USA, pp 1816–1821
Zurück zum Zitat MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: Cam LL, Neyman J (eds.) Proceedings of fifth Berkeley symposium on math statist and prob, vol 1. University of California, pp 281–297 MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: Cam LL, Neyman J (eds.) Proceedings of fifth Berkeley symposium on math statist and prob, vol 1. University of California, pp 281–297
Zurück zum Zitat Nguyen D, Widrow B (1990) Improving the learning speed of 2-layer neural networks by choosing initial values of the adaptive weights. In: Proceedings of international joint conference on neural networks, vol 3, pp 21–26 Nguyen D, Widrow B (1990) Improving the learning speed of 2-layer neural networks by choosing initial values of the adaptive weights. In: Proceedings of international joint conference on neural networks, vol 3, pp 21–26
Zurück zum Zitat Rasiwasia N, Moreno PL, Vasconcelos N (2007) Bridging the gap: query by semantic example. IEEE Trans Multimed 9(5):923–938CrossRef Rasiwasia N, Moreno PL, Vasconcelos N (2007) Bridging the gap: query by semantic example. IEEE Trans Multimed 9(5):923–938CrossRef
Zurück zum Zitat Sahbi H (2008) A particular gaussian mixture model for clustering and its application to image retrieval. Soft Comput 12(7):667–676CrossRef Sahbi H (2008) A particular gaussian mixture model for clustering and its application to image retrieval. Soft Comput 12(7):667–676CrossRef
Zurück zum Zitat Sheikholeslami G, Chang W, Zhang A (2002) SemQuery: semantic clustering and querying on heterogeneous features for visual data. IEEE Trans Knowl Data Eng 14(5):988–1002CrossRef Sheikholeslami G, Chang W, Zhang A (2002) SemQuery: semantic clustering and querying on heterogeneous features for visual data. IEEE Trans Knowl Data Eng 14(5):988–1002CrossRef
Zurück zum Zitat Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380CrossRef Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380CrossRef
Zurück zum Zitat Sridharan K, Nayak S, Chikkerur S, Govindaraju V (2005) A probabilistic approach to semantic face retrieval system. In: Kanade T, Jain A, Ratha NK (eds) Audio- and video-based biometric person authentication, vol 3546 of Lecture Notes in Computer Science. Springer, Berlin, pp 977–986 Sridharan K, Nayak S, Chikkerur S, Govindaraju V (2005) A probabilistic approach to semantic face retrieval system. In: Kanade T, Jain A, Ratha NK (eds) Audio- and video-based biometric person authentication, vol 3546 of Lecture Notes in Computer Science. Springer, Berlin, pp 977–986
Zurück zum Zitat Tan SC, Rao MVC, Lim CP (2008) A hybrid neural network classifier combining ordered fuzzy artmap and the dynamic decay adjustment algorithm. Soft Comput 12(8):765–775CrossRef Tan SC, Rao MVC, Lim CP (2008) A hybrid neural network classifier combining ordered fuzzy artmap and the dynamic decay adjustment algorithm. Soft Comput 12(8):765–775CrossRef
Zurück zum Zitat Tesic J, Smith JR (2006) Semantic labeling of multimedia content clusters. In: Proceedings of IEEE international conference on multimedia and expo. pp 1493–1496 Tesic J, Smith JR (2006) Semantic labeling of multimedia content clusters. In: Proceedings of IEEE international conference on multimedia and expo. pp 1493–1496
Zurück zum Zitat Turk M, Pentland A (1991) Eigenfaces for recognition. Cogn Neurosci 3(1):71–86CrossRef Turk M, Pentland A (1991) Eigenfaces for recognition. Cogn Neurosci 3(1):71–86CrossRef
Zurück zum Zitat Wang DH (2006) ELM-based multiple classifier systems. In: Proceedings of international conference on control, sutom, robot and vis. pp 1–5 Wang DH (2006) ELM-based multiple classifier systems. In: Proceedings of international conference on control, sutom, robot and vis. pp 1–5
Zurück zum Zitat Wang DH, Kim Y-S, Park SC, Lee CS, Han YK (2007) Learning based neural similarity metrics for multimedia data mining. Soft Comput 11(4):335–340CrossRef Wang DH, Kim Y-S, Park SC, Lee CS, Han YK (2007) Learning based neural similarity metrics for multimedia data mining. Soft Comput 11(4):335–340CrossRef
Zurück zum Zitat Wang DH, Ma XH (2005) A hybrid image retrieval system with user’s relevance feedback using neurocomputing. Informatica 29:271–279 Wang DH, Ma XH (2005) A hybrid image retrieval system with user’s relevance feedback using neurocomputing. Informatica 29:271–279
Zurück zum Zitat Wu B, Ai H, Huang C (2004a) Facial Image Retrieval based on Demographic Classification. In: Proceedings of 17th international conference on pattern recognition, vol 3. pp 914–917 Wu B, Ai H, Huang C (2004a) Facial Image Retrieval based on Demographic Classification. In: Proceedings of 17th international conference on pattern recognition, vol 3. pp 914–917
Zurück zum Zitat Wu TF, Lin CJ, Weng RC (2004b) Probability estimates for multi-class classification by pairwise coupling. Machine Learn Res 5:975–1005MathSciNet Wu TF, Lin CJ, Weng RC (2004b) Probability estimates for multi-class classification by pairwise coupling. Machine Learn Res 5:975–1005MathSciNet
Zurück zum Zitat Xu R, Wunsch DI (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16(3):645–678CrossRef Xu R, Wunsch DI (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16(3):645–678CrossRef
Zurück zum Zitat Yang M-H, Kriegman DJ, Ahuja N (2002) Detecting Faces in Images: A Survey. IEEE Trans Pattern Anal Mach Intell 24(1):34–58CrossRef Yang M-H, Kriegman DJ, Ahuja N (2002) Detecting Faces in Images: A Survey. IEEE Trans Pattern Anal Mach Intell 24(1):34–58CrossRef
Zurück zum Zitat Zhou H, Yuan Y, Sadka AH (2008) Application of semantic features in face recognition. Pattern Recogn 41(10):3251–3256MATHCrossRef Zhou H, Yuan Y, Sadka AH (2008) Application of semantic features in face recognition. Pattern Recogn 41(10):3251–3256MATHCrossRef
Zurück zum Zitat Zuo F, de With PH (2008) Facial feature extraction by a cascade of model-based algorithms. Signal Process Image Commun 23(3):194–211CrossRef Zuo F, de With PH (2008) Facial feature extraction by a cascade of model-based algorithms. Signal Process Image Commun 23(3):194–211CrossRef
Metadaten
Titel
Automatic localization and annotation of facial features using machine learning techniques
verfasst von
Paul C. Conilione
Dianhui Wang
Publikationsdatum
01.06.2011
Verlag
Springer-Verlag
Erschienen in
Soft Computing / Ausgabe 6/2011
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-010-0586-y

Weitere Artikel der Ausgabe 6/2011

Soft Computing 6/2011 Zur Ausgabe