Skip to main content

2013 | OriginalPaper | Buchkapitel

3. k-NN Boosting Prototype Learning for Object Classification

verfasst von : Paolo Piro, Michel Barlaud, Richard Nock, Frank Nielsen

Erschienen in: Analysis, Retrieval and Delivery of Multimedia Content

Verlag: Springer New York

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Image classification is a challenging task in computer vision. For example fully understanding real-world images may involve both scene and object recognition. Many approaches have been proposed to extract meaningful descriptors from images and classifying them in a supervised learning framework. In this chapter, we revisit the classic k-nearest neighbors (k-NN) classification rule, which has shown to be very effective when dealing with local image descriptors. However, k-NN still features some major drawbacks, mainly due to the uniform voting among the nearest prototypes in the feature space. In this chapter, we propose a generalization of the classic k-NN rule in a supervised learning (boosting) framework. Namely, we redefine the voting rule as a strong classifier that linearly combines predictions from the k closest prototypes. In order to induce this classifier, we propose a novel learning algorithm, MLNN (Multiclass Leveraged Nearest Neighbors), which gives a simple procedure for performing prototype selection very efficiently. We tested our method first on object classification using 12 categories of objects, then on scene recognition as well, using 15 real-world categories. Experiments show significant improvement over classic k-NN in terms of classification performances.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
We call surrogate a function that upperbounds the risk functional we should minimize, and thus can be used as a primer for its minimization.
 
Literatur
2.
Zurück zum Zitat Bosch A, Zisserman A, Muñoz X (2008) Scene classification using a hybrid generative/discriminative approach. IEEE Trans Pattern Anal Mach Intell 30(4):712–727CrossRef Bosch A, Zisserman A, Muñoz X (2008) Scene classification using a hybrid generative/discriminative approach. IEEE Trans Pattern Anal Mach Intell 30(4):712–727CrossRef
3.
Zurück zum Zitat Brighton H, Mellish C (2002) Advances in instance selection for instance-based learning algorithms. Data Min Knowl Disc 6:153–172MathSciNetMATHCrossRef Brighton H, Mellish C (2002) Advances in instance selection for instance-based learning algorithms. Data Min Knowl Disc 6:153–172MathSciNetMATHCrossRef
4.
Zurück zum Zitat Cha SH (2008) Taxonomy of nominal type histogram distance measures. In: Proceedings of the American conference on applied mathematics, pp 325–330 Cha SH (2008) Taxonomy of nominal type histogram distance measures. In: Proceedings of the American conference on applied mathematics, pp 325–330
5.
Zurück zum Zitat Csurka G, Bray C, Dance C, Fan L (2004) Visual categorization with bags of keypoints. In: Workshop on statistical learning in computer vision (ECCV), pp 1–22 Csurka G, Bray C, Dance C, Fan L (2004) Visual categorization with bags of keypoints. In: Workshop on statistical learning in computer vision (ECCV), pp 1–22
6.
Zurück zum Zitat Davis J, Kulis B, Jain P, Sra S, Dhillon I (2007) Information-theoretic metric learning. In: International conference on machine learning (ICML), pp 209–216 Davis J, Kulis B, Jain P, Sra S, Dhillon I (2007) Information-theoretic metric learning. In: International conference on machine learning (ICML), pp 209–216
7.
Zurück zum Zitat Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to Boosting. J Comput Syst Sci 55(1):119–139 Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to Boosting. J Comput Syst Sci 55(1):119–139
8.
Zurück zum Zitat Garcia V, Debreuve E, Barlaud M (2008) Fast k nearest neighbor search using gpu. In: CVPR workshop on computer vision on GPU (CVGPU), Anchorage, Alaska, USA Garcia V, Debreuve E, Barlaud M (2008) Fast k nearest neighbor search using gpu. In: CVPR workshop on computer vision on GPU (CVGPU), Anchorage, Alaska, USA
9.
Zurück zum Zitat Hastie T, Tibshirani R (1996) Discriminant adaptive nearest neighbor classification. IEEE Trans Pattern Anal Mach Intell 18(6):(6):607–616 Hastie T, Tibshirani R (1996) Discriminant adaptive nearest neighbor classification. IEEE Trans Pattern Anal Mach Intell 18(6):(6):607–616
10.
Zurück zum Zitat Jegou H, Douze M, Schmid C (2008) Hamming embedding and weak geometric consistency for large scale image search. In: European conference on computer vision (ECCV), vol I, pp 304–317 Jegou H, Douze M, Schmid C (2008) Hamming embedding and weak geometric consistency for large scale image search. In: European conference on computer vision (ECCV), vol I, pp 304–317
14.
Zurück zum Zitat Nock R, Nielsen F (2009) Bregman divergences and surrogates for learning. IEEE Trans. Pattern Anal. Mach. Intell. 31:2048–2059CrossRef Nock R, Nielsen F (2009) Bregman divergences and surrogates for learning. IEEE Trans. Pattern Anal. Mach. Intell. 31:2048–2059CrossRef
15.
Zurück zum Zitat Perronnin F, Dance C (2007) Fisher kernels on visual vocabularies for image categorization. In: Computer vision and pattern recognition (CVPR), pp 1–8 Perronnin F, Dance C (2007) Fisher kernels on visual vocabularies for image categorization. In: Computer vision and pattern recognition (CVPR), pp 1–8
16.
Zurück zum Zitat Perronnin F, Sánchez J, Liu Y (2010) Large-scale image categorization with explicit data embedding. In: Computer vision and pattern recognition (CVPR), pp 2297–2304 Perronnin F, Sánchez J, Liu Y (2010) Large-scale image categorization with explicit data embedding. In: Computer vision and pattern recognition (CVPR), pp 2297–2304
17.
Zurück zum Zitat Piro P, Nock R, Nielsen F, Barlaud M (2009) Boosting k-NN for categorization of natural scenes. ArXiv:1001.1221 Piro P, Nock R, Nielsen F, Barlaud M (2009) Boosting k-NN for categorization of natural scenes. ArXiv:1001.1221
18.
Zurück zum Zitat Piro P, Nock R, Nielsen F, Barlaud M (2010) Leveraging \(k\)-NN for generic classification boosting. In: Proceedings of the 2010 IEEE international workshop on machine learning for signal Processing (MLSP) Piro P, Nock R, Nielsen F, Barlaud M (2010) Leveraging \(k\)-NN for generic classification boosting. In: Proceedings of the 2010 IEEE international workshop on machine learning for signal Processing (MLSP)
19.
Zurück zum Zitat Schapire RE, Singer Y (1999) Improved boosting algorithms using confidence-rated predictions. J Mach Learn 37:297–336MATHCrossRef Schapire RE, Singer Y (1999) Improved boosting algorithms using confidence-rated predictions. J Mach Learn 37:297–336MATHCrossRef
20.
22.
Zurück zum Zitat Zou H, Zhu J, Hastie T (2008) New multicategory boosting algorithms based on multicategory fisher-consistent losses. Ann Appl Stat 2(4):1290–1306MathSciNetMATHCrossRef Zou H, Zhu J, Hastie T (2008) New multicategory boosting algorithms based on multicategory fisher-consistent losses. Ann Appl Stat 2(4):1290–1306MathSciNetMATHCrossRef
23.
Zurück zum Zitat Zhang H, Berg A, Maire M, Malik J (2006) SVM-KNN: discriminative nearest neighbor classification for visual category recognition. In: Computer vision and pattern recognition (CVPR’06), pp 2126–2136 Zhang H, Berg A, Maire M, Malik J (2006) SVM-KNN: discriminative nearest neighbor classification for visual category recognition. In: Computer vision and pattern recognition (CVPR’06), pp 2126–2136
Metadaten
Titel
k-NN Boosting Prototype Learning for Object Classification
verfasst von
Paolo Piro
Michel Barlaud
Richard Nock
Frank Nielsen
Copyright-Jahr
2013
Verlag
Springer New York
DOI
https://doi.org/10.1007/978-1-4614-3831-1_3

Neuer Inhalt