Skip to main content

2013 | OriginalPaper | Buchkapitel

Partitioned K-Means Clustering for Fast Construction of Unbiased Visual Vocabulary

verfasst von : Shikui Wei, Xinxiao Wu, Dong Xu

Erschienen in: The Era of Interactive Media

Verlag: Springer New York

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Bag-of-Words (BoW) model has been widely used for feature representation in multimedia search area, in which a key step is to vector-quantize local image descriptors and generate a visual vocabulary. Popular visual vocabulary construction schemes generally perform a flat or hierarchical clustering operation using a very large training set in their original description space. However, these methods usually suffer from two issues: (1) A large training set is required to construct a large visual vocabulary, making the construction computationally inefficient; (2) The generated visual vocabularies are heavily biased towards the training samples. In this work, we introduce a partitioned k-means clustering (PKM) scheme to efficiently generate a large and unbiased vocabulary using only a small training set. Instead of directly clustering training descriptors in their original space, we first split the original space into a set of subspaces and then perform a separate k-means clustering process in each subspace. Sequentially, we can build a complete visual vocabulary by combining different cluster centroids from multiple subspaces. Comprehensive experiments demonstrate that the proposed method indeed generates unbiased vocabularies and provides good scalability for building large vocabularies.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
3.
Zurück zum Zitat A. Gersho and R. Gray. Vector quantization and signal compression. Kulwer Academic Publishers, 1992. A. Gersho and R. Gray. Vector quantization and signal compression. Kulwer Academic Publishers, 1992.
4.
Zurück zum Zitat H. Jegou, M. Douze, and C. Schmid. Improving bag-of-features for large scale image search. IJCV, pages 1–21. H. Jegou, M. Douze, and C. Schmid. Improving bag-of-features for large scale image search. IJCV, pages 1–21.
5.
Zurück zum Zitat H. Jegou, M. Douze, and C. Schmid. Hamming embedding and weak geometric consistency for large scale image search. In ECCV, pages 304–317, 2008. H. Jegou, M. Douze, and C. Schmid. Hamming embedding and weak geometric consistency for large scale image search. In ECCV, pages 304–317, 2008.
6.
Zurück zum Zitat H. Jégou, M. Douze, and C. Schmid. Product quantization for nearest neighbor search. T-PAMI, pages 117–128, 2010. H. Jégou, M. Douze, and C. Schmid. Product quantization for nearest neighbor search. T-PAMI, pages 117–128, 2010.
7.
Zurück zum Zitat G. Motta, F. Rizzo, and J. Storer. Partitioned vector quantization: application to lossless compression of hyperspectral images. In ICASSP, pages 241–244. G. Motta, F. Rizzo, and J. Storer. Partitioned vector quantization: application to lossless compression of hyperspectral images. In ICASSP, pages 241–244.
8.
Zurück zum Zitat Y. Mu, J. Sun, T. Han, L. Cheong, and S. Yan. Randomized locality sensitive vocabularies for bag-of-features model. ECCV, pages 748–761, 2010. Y. Mu, J. Sun, T. Han, L. Cheong, and S. Yan. Randomized locality sensitive vocabularies for bag-of-features model. ECCV, pages 748–761, 2010.
9.
Zurück zum Zitat D. Nister and H. Stewenius. Scalable recognition with a vocabulary tree. In CVPR, volume 5, 2006. D. Nister and H. Stewenius. Scalable recognition with a vocabulary tree. In CVPR, volume 5, 2006.
10.
Zurück zum Zitat E. Nowak, F. Jurie, and B. Triggs. Sampling strategies for bag-of-features image classification. In LNCS, volume 3954, pages 490–503, 2006. E. Nowak, F. Jurie, and B. Triggs. Sampling strategies for bag-of-features image classification. In LNCS, volume 3954, pages 490–503, 2006.
11.
Zurück zum Zitat J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In CVPR, pages 1575–1589, 2007. J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In CVPR, pages 1575–1589, 2007.
12.
Zurück zum Zitat J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In ICCV, volume 2, pages 1470–1477, 2003. J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In ICCV, volume 2, pages 1470–1477, 2003.
13.
Zurück zum Zitat S. Zhang, Q. Huang, G. Hua, S. Jiang, W. Gao, and Q. Tian. Building contextual visual vocabulary for large-scale image applications. In Proc. ACM Multimedia. S. Zhang, Q. Huang, G. Hua, S. Jiang, W. Gao, and Q. Tian. Building contextual visual vocabulary for large-scale image applications. In Proc. ACM Multimedia.
Metadaten
Titel
Partitioned K-Means Clustering for Fast Construction of Unbiased Visual Vocabulary
verfasst von
Shikui Wei
Xinxiao Wu
Dong Xu
Copyright-Jahr
2013
Verlag
Springer New York
DOI
https://doi.org/10.1007/978-1-4614-3501-3_40

Neuer Inhalt