Skip to main content

2018 | OriginalPaper | Buchkapitel

Hierarchical Tree Representation Based Face Clustering for Video Retrieval

verfasst von : Pengyi Hao, Edwin Manhando, Cong Bai, Yujiao Huang

Erschienen in: Advances in Multimedia Information Processing – PCM 2017

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We present a video as a set of people, each person is a sequence of faces clustered by proposed hierarchical tree representation with the purpose of finding all the occurrences of a person in the video without any help of textual information. In the proposed method, faces in a video are detected and tracked to be face-tracks at first, and each face-track is associated with one person. By leveraging temporal constrains, face-tracks that depict the same person in a video are connected. Then we build undirected graphs for a video, and extend discriminative histogram intersection metric learning to generate semantic distances for cutting undirected graphs to be face clusters without predefining the number of clusters. When searching for videos containing the person of query, it is effective to compare faces of query video with sets of people summarized from videos in the dataset. Experimental results show that the proposed face clustering can improve the mean Average Precision of video retrieval and decrease the query time compared to several state-of-the-art approaches.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zhang, Y.F., Xu, C.S., Lu, H.Q., Huang, Y.M.: Character identification in feature-length films using global face-name matching. IEEE Trans. Multimedia 11(7), 1276–1288 (2009)CrossRef Zhang, Y.F., Xu, C.S., Lu, H.Q., Huang, Y.M.: Character identification in feature-length films using global face-name matching. IEEE Trans. Multimedia 11(7), 1276–1288 (2009)CrossRef
2.
Zurück zum Zitat Sivic, J., Everingham, M., Zisserman, A.: Person spotting: video shot retrieval for face sets. In: Leow, W.-K., Lew, M.S., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 226–236. Springer, Heidelberg (2005). https://doi.org/10.1007/11526346_26CrossRef Sivic, J., Everingham, M., Zisserman, A.: Person spotting: video shot retrieval for face sets. In: Leow, W.-K., Lew, M.S., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 226–236. Springer, Heidelberg (2005). https://​doi.​org/​10.​1007/​11526346_​26CrossRef
3.
Zurück zum Zitat Hao, P., Kamata, S.: Efficiently finding individuals from video dataset. IEICE Trans. Inf. Syst. E95-D(5), 1280–1287 (2012)CrossRef Hao, P., Kamata, S.: Efficiently finding individuals from video dataset. IEICE Trans. Inf. Syst. E95-D(5), 1280–1287 (2012)CrossRef
4.
Zurück zum Zitat Nguyen, T., Ngo, T., Le, D.-D., Satoh, S., Le, B., Duong, D.: An efficient method for face retrieval from large video datasets, pp. 382–389 (2010) Nguyen, T., Ngo, T., Le, D.-D., Satoh, S., Le, B., Duong, D.: An efficient method for face retrieval from large video datasets, pp. 382–389 (2010)
5.
Zurück zum Zitat Andriluka, M., Roth, S., Schiele, B.: People-tracking-by-detection and people-detection-by-tracking. In: Proceedings of IEEE International Conference on Computer Vision Pattern Recognition (2008) Andriluka, M., Roth, S., Schiele, B.: People-tracking-by-detection and people-detection-by-tracking. In: Proceedings of IEEE International Conference on Computer Vision Pattern Recognition (2008)
6.
Zurück zum Zitat Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)CrossRef Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)CrossRef
7.
Zurück zum Zitat Tapaswi, M., Bauml, M., Stiefelhagen, R.: Knock! Knock! Who is it? Probabilistic person identification in TV-series. In: Proceedings of IEEE Conference on Computer Vision Pattern Recognition (2012) Tapaswi, M., Bauml, M., Stiefelhagen, R.: Knock! Knock! Who is it? Probabilistic person identification in TV-series. In: Proceedings of IEEE Conference on Computer Vision Pattern Recognition (2012)
8.
Zurück zum Zitat Sang, J., Xu, C.: Character-based movie summarization. In: Proceedings of 18th ACM International Conference on Multimedia, pp. 855–858 (2010) Sang, J., Xu, C.: Character-based movie summarization. In: Proceedings of 18th ACM International Conference on Multimedia, pp. 855–858 (2010)
9.
Zurück zum Zitat Wu, B., Zhang, Y., Hu, B.-G., Ji, Q.: Constrained clustering and its application to face clustering in videos. In: Proceedings of IEEE International Conference on Computer Vision Pattern Recognition (2013) Wu, B., Zhang, Y., Hu, B.-G., Ji, Q.: Constrained clustering and its application to face clustering in videos. In: Proceedings of IEEE International Conference on Computer Vision Pattern Recognition (2013)
11.
Zurück zum Zitat Hao, P., Yang, X., Li, X., Kamata, S., Chen, S.: Discriminative histogram intersection metric learning and its applications. J. Comput. Sci. Technol. 32(3), 507–519 (2017)MathSciNetCrossRef Hao, P., Yang, X., Li, X., Kamata, S., Chen, S.: Discriminative histogram intersection metric learning and its applications. J. Comput. Sci. Technol. 32(3), 507–519 (2017)MathSciNetCrossRef
12.
Zurück zum Zitat Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of IEEE Conference on Computer Vision Pattern Recognition, pp. 511–518 (2001) Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of IEEE Conference on Computer Vision Pattern Recognition, pp. 511–518 (2001)
13.
Zurück zum Zitat Shi, J., Tomasi, C.: Good features to track. In: Proceedings of IEEE Conference on Computer Vision, Pattern Recognition, pp. 593–600 (1994) Shi, J., Tomasi, C.: Good features to track. In: Proceedings of IEEE Conference on Computer Vision, Pattern Recognition, pp. 593–600 (1994)
14.
Zurück zum Zitat Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 971–987 (2002)CrossRef Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 971–987 (2002)CrossRef
15.
Zurück zum Zitat Guillaumin, M., Verbeek, J., Schmid, C.: Is that you? Metric learning approaches for face identification. In: Proceedings of IEEE Conference on Computer Vision, pp. 498–505 (2009) Guillaumin, M., Verbeek, J., Schmid, C.: Is that you? Metric learning approaches for face identification. In: Proceedings of IEEE Conference on Computer Vision, pp. 498–505 (2009)
17.
Zurück zum Zitat Cinbis, R.G., Verbeek, J., Schmid, C.: Unsupervised metric learning for face identification in TV video. In: Proceedings of IEEE Conference on Computer Vision, pp. 1559–1566 (2011) Cinbis, R.G., Verbeek, J., Schmid, C.: Unsupervised metric learning for face identification in TV video. In: Proceedings of IEEE Conference on Computer Vision, pp. 1559–1566 (2011)
18.
Zurück zum Zitat Huang, K., Ying, Y., Campbell, C.: GSML: a unified framework for sparse metric learning. In: Proceedings of ICDM, pp. 189–198 (2009) Huang, K., Ying, Y., Campbell, C.: GSML: a unified framework for sparse metric learning. In: Proceedings of ICDM, pp. 189–198 (2009)
19.
Zurück zum Zitat Sibson, R.: SLINK: an optimally efficient algorithm for the single-link cluster method. Comput. J. (British Computer Society) 16(1), 30–34 (1973)MathSciNet Sibson, R.: SLINK: an optimally efficient algorithm for the single-link cluster method. Comput. J. (British Computer Society) 16(1), 30–34 (1973)MathSciNet
Metadaten
Titel
Hierarchical Tree Representation Based Face Clustering for Video Retrieval
verfasst von
Pengyi Hao
Edwin Manhando
Cong Bai
Yujiao Huang
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-77383-4_34

Neuer Inhalt