Skip to main content
Top
Published in: International Journal of Multimedia Information Retrieval 3/2019

16-11-2018 | Short Paper

DHFML: deep heterogeneous feature metric learning for matching photograph and cartoon pairs

Author: Anand Mishra

Published in: International Journal of Multimedia Information Retrieval | Issue 3/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We study the problem of retrieving cartoon faces of celebrities given their real face as a query. We refer to this problem as Photo2Cartoon. The Photo2Cartoon problem is challenging since (i) cartoons vary excessively in style and (ii) modality gap between real and cartoon faces is large. To address these challenges, we present a discriminative deep metric learning approach designed for matching cross-modal faces and showcase Photo2Cartoon. The proposed approach learns a nonlinear transformation to project real and cartoon face pairs into a common subspace where distance between positive pairs becomes smaller as compared to distance between negative pairs. We evaluate our method on two public benchmarks, namely IIIT-CFW and Viewed Sketch, and show superior retrieval results as compared to related methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Cartoon is typically non-realistic or semi-realistic artistic style of drawing or painting, or an image or series of images intended for satire, caricature or humor [1].
 
Literature
2.
3.
go back to reference Cao Q, Shen L, Xie W, Parkhi O.M, Zisserman A (2018) VGGFace2: a dataset for recognising faces across pose and age. In: FG Cao Q, Shen L, Xie W, Parkhi O.M, Zisserman A (2018) VGGFace2: a dataset for recognising faces across pose and age. In: FG
4.
go back to reference Crowley EJ, Parkhi OM, Zisserman A (2015) Face painting: querying art with photos. In: BMVC Crowley EJ, Parkhi OM, Zisserman A (2015) Face painting: querying art with photos. In: BMVC
6.
go back to reference Goldberger J, Hinton GE, Roweis ST, Salakhutdinov RR (2005) Neighbourhood components analysis. In: NIPS Goldberger J, Hinton GE, Roweis ST, Salakhutdinov RR (2005) Neighbourhood components analysis. In: NIPS
7.
go back to reference Härdle WK, Simar L (2015) Canonical correlation analysis. In: Applied multivariate statistical analysis. Springer, pp 443–454 Härdle WK, Simar L (2015) Canonical correlation analysis. In: Applied multivariate statistical analysis. Springer, pp 443–454
8.
go back to reference Hu P, Ramanan D (2017) Finding tiny faces. In: CVPR Hu P, Ramanan D (2017) Finding tiny faces. In: CVPR
9.
go back to reference Huang D, Wang Y.F (2013) Coupled dictionary and feature space learning with applications to cross-domain image synthesis and recognition. In: ICCV Huang D, Wang Y.F (2013) Coupled dictionary and feature space learning with applications to cross-domain image synthesis and recognition. In: ICCV
10.
go back to reference Huang GB, Ramesh M, Berg T, Learned-Miller E (2007) Labeled faces in the wild: a database for studying face recognition in unconstrained environments. Technical report 07-49, University of Massachusetts, Amherst Huang GB, Ramesh M, Berg T, Learned-Miller E (2007) Labeled faces in the wild: a database for studying face recognition in unconstrained environments. Technical report 07-49, University of Massachusetts, Amherst
12.
go back to reference Huo J, Gao Y, Shi Y, Yang W, Yin H (2016) Ensemble of sparse cross-modal metrics for heterogeneous face recognition. In: ACM-MM Huo J, Gao Y, Shi Y, Yang W, Yin H (2016) Ensemble of sparse cross-modal metrics for heterogeneous face recognition. In: ACM-MM
13.
go back to reference Kang C, Liao S, He Y, Wang J, Niu W, Xiang S, Pan C (2015) Cross-modal similarity learning: a low rank bilinear formulation. In: CIKM Kang C, Liao S, He Y, Wang J, Niu W, Xiang S, Pan C (2015) Cross-modal similarity learning: a low rank bilinear formulation. In: CIKM
14.
go back to reference Kemelmacher-Shlizerman I, Seitz SM, Miller D, Brossard E (2016) The megaface benchmark: 1 million faces for recognition at scale. In: CVPR Kemelmacher-Shlizerman I, Seitz SM, Miller D, Brossard E (2016) The megaface benchmark: 1 million faces for recognition at scale. In: CVPR
15.
go back to reference Klare B, Jain AK (2013) Heterogeneous face recognition using kernel prototype similarities. IEEE Trans Pattern Anal Mach Intell 35(6):1410–1422CrossRef Klare B, Jain AK (2013) Heterogeneous face recognition using kernel prototype similarities. IEEE Trans Pattern Anal Mach Intell 35(6):1410–1422CrossRef
16.
go back to reference Koch G, Zemel R, Salakhutdinov R (2015) Siamese neural networks for one-shot image recognition. In: ICML deep learning workshop, vol 2 Koch G, Zemel R, Salakhutdinov R (2015) Siamese neural networks for one-shot image recognition. In: ICML deep learning workshop, vol 2
17.
go back to reference Kumar N, Berg AC, Belhumeur PN, Nayar SK (2009) Attribute and simile classifiers for face verification. In: ICCV Kumar N, Berg AC, Belhumeur PN, Nayar SK (2009) Attribute and simile classifiers for face verification. In: ICCV
18.
go back to reference Liong VE, Lu J, Tan YP, Zhou J (2017) Deep coupled metric learning for cross-modal matching. IEEE Trans Multimed 19(6):1234–1244CrossRef Liong VE, Lu J, Tan YP, Zhou J (2017) Deep coupled metric learning for cross-modal matching. IEEE Trans Multimed 19(6):1234–1244CrossRef
19.
go back to reference Martinez AM (1998) The AR face database. CVC technical report Martinez AM (1998) The AR face database. CVC technical report
20.
go back to reference Mauro R, Kubovy M (1992) Caricature and face recognition. Mem Cogn 20(4):433–440CrossRef Mauro R, Kubovy M (1992) Caricature and face recognition. Mem Cogn 20(4):433–440CrossRef
21.
go back to reference Messer K, Matas J, Kittler J, Luettin J, Maitre G (1999) XM2VTSDB: the extended M2VTS database. In: Second international conference on audio and video-based biometric person authentication Messer K, Matas J, Kittler J, Luettin J, Maitre G (1999) XM2VTSDB: the extended M2VTS database. In: Second international conference on audio and video-based biometric person authentication
22.
go back to reference Mignon A, Jurie F (2012) CMML: a new metric learning approach for cross modal matching. In: ACCV Mignon A, Jurie F (2012) CMML: a new metric learning approach for cross modal matching. In: ACCV
23.
go back to reference Mishra A, Nandan Rai S, Mishra A, Jawahar C.V (2016) IIIT-CFW: a benchmark database of cartoon faces in the wild. In: VASE ECCVW Mishra A, Nandan Rai S, Mishra A, Jawahar C.V (2016) IIIT-CFW: a benchmark database of cartoon faces in the wild. In: VASE ECCVW
24.
go back to reference Ouyang S, Hospedales TM, Song Y, Li X (2014) Cross-modal face matching: beyond viewed sketches. In: ACCV Ouyang S, Hospedales TM, Song Y, Li X (2014) Cross-modal face matching: beyond viewed sketches. In: ACCV
25.
go back to reference Parkhi OM, Vedaldi A, Zisserman A (2015) Deep face recognition. In: BMVC Parkhi OM, Vedaldi A, Zisserman A (2015) Deep face recognition. In: BMVC
26.
go back to reference Schroff F, Kalenichenko D, Philbin J (2015) FaceNet: a unified embedding for face recognition and clustering. In: CVPR Schroff F, Kalenichenko D, Philbin J (2015) FaceNet: a unified embedding for face recognition and clustering. In: CVPR
27.
go back to reference Simonyan K, Parkhi OM, Vedaldi A, Zisserman A (2013) Fisher vector faces in the wild. In: BMVC Simonyan K, Parkhi OM, Vedaldi A, Zisserman A (2013) Fisher vector faces in the wild. In: BMVC
28.
29.
go back to reference Song HO, Xiang Y, Jegelka S, Savarese S (2016) Deep metric learning via lifted structured feature embedding. In: CVPR Song HO, Xiang Y, Jegelka S, Savarese S (2016) Deep metric learning via lifted structured feature embedding. In: CVPR
30.
go back to reference Sugiyama M (2006) Local fisher discriminant analysis for supervised dimensionality reduction. In: ICML Sugiyama M (2006) Local fisher discriminant analysis for supervised dimensionality reduction. In: ICML
31.
go back to reference Wang X, Tang X (2009) Face photo-sketch synthesis and recognition. IEEE Trans Pattern Anal Mach Intell 31(11):1955–1967CrossRef Wang X, Tang X (2009) Face photo-sketch synthesis and recognition. IEEE Trans Pattern Anal Mach Intell 31(11):1955–1967CrossRef
32.
go back to reference Weinberger KQ, Saul LK (2009) Distance metric learning for large margin nearest neighbor classification. J Mach Learn Res 10:207–244MATH Weinberger KQ, Saul LK (2009) Distance metric learning for large margin nearest neighbor classification. J Mach Learn Res 10:207–244MATH
33.
go back to reference Zhang K, Zhang Z, Li Z, Qiao Y (2016) Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process Lett 23(10):1499–1503CrossRef Zhang K, Zhang Z, Li Z, Qiao Y (2016) Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process Lett 23(10):1499–1503CrossRef
Metadata
Title
DHFML: deep heterogeneous feature metric learning for matching photograph and cartoon pairs
Author
Anand Mishra
Publication date
16-11-2018
Publisher
Springer London
Published in
International Journal of Multimedia Information Retrieval / Issue 3/2019
Print ISSN: 2192-6611
Electronic ISSN: 2192-662X
DOI
https://doi.org/10.1007/s13735-018-0160-4

Other articles of this Issue 3/2019

International Journal of Multimedia Information Retrieval 3/2019 Go to the issue

Premium Partner