Skip to main content
Erschienen in: The Journal of Supercomputing 2/2014

01.05.2014

Effective naive Bayes nearest neighbor based image classification on GPU

verfasst von: Lei Zhu, Hai Jin, Ran Zheng, Xiaowen Feng

Erschienen in: The Journal of Supercomputing | Ausgabe 2/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Non-parametric classifier, Naive Bayes nearest neighbor, is designed with no training phase, and its performance outperforms many well-trained learning-based image classifiers. Unfortunately, despite its high accuracy, it suffers from great computational pressure from distance computations in space of local feature. This paper explores accelerating strategies from perspectives of both algorithm design and software development. Our approach integrates space decomposition capability of Product quantization (PQ) and parallel accelerating capability of underlying computational platform, Graphics processing unit (GPU). PQ is exploited to compress the indexed local features and prune the search space. GPU is used to ease most of computational pressure by processing the tasks in parallel. To achieve good parallel efficiency, a new sequential classification process is first designed and decomposed into independent components with high parallelism. Effective parallelization techniques are then presented to make use of computational resources. Parallel heap array is built to accelerate the process of feature quantization. Distance table lookup is built to speed up the process of feature search. Comparative experiments on UIUC-Sport dataset demonstrate that our integrated solution outperforms other implementations significantly on Core-quad Intel Core i7 950 CPU and GPU of NVIDIA Geforce GTX460. Scalability experiment on 80 million tiny images database shows that our approach still performs well when large-scale image database is explored.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Behmo R, Marcombes P, Dalalyan A, Prinet V (2010) Towards optimal naive bayes nearest neighbor. In: 11th European conference on computer vision. Springer, Heraklion, pp 171–184 Behmo R, Marcombes P, Dalalyan A, Prinet V (2010) Towards optimal naive bayes nearest neighbor. In: 11th European conference on computer vision. Springer, Heraklion, pp 171–184
2.
Zurück zum Zitat Belloch JA, Gonzalez A, Martnez-Zaldvar FJ, Vidal AM (2011) Real-time massive convolution for audio applications on GPU. J Supercomput 58(3):449–457CrossRef Belloch JA, Gonzalez A, Martnez-Zaldvar FJ, Vidal AM (2011) Real-time massive convolution for audio applications on GPU. J Supercomput 58(3):449–457CrossRef
3.
Zurück zum Zitat Boiman O, Shechtman E, Irani M (2008) In defense of nearest-neighbor based image classification. In: 21th IEEE international conference on computer vision and pattern recognition. IEEE, Anchorage, pp 1–8 Boiman O, Shechtman E, Irani M (2008) In defense of nearest-neighbor based image classification. In: 21th IEEE international conference on computer vision and pattern recognition. IEEE, Anchorage, pp 1–8
4.
Zurück zum Zitat Bosch A, Zisserman A, Munoz X (2007) Representing shape with a spatial pyramid kernel. In: 6th ACM international conference on image and video retrieval. ACM, New York, pp 401–408 Bosch A, Zisserman A, Munoz X (2007) Representing shape with a spatial pyramid kernel. In: 6th ACM international conference on image and video retrieval. ACM, New York, pp 401–408
5.
Zurück zum Zitat Cornelis N, Van Gool L (2008) Fast scale invariant feature detection and matching on programmable graphics hardware. In: 21th IEEE international conference on computer vision and pattern recognition workshops. IEEE, Anchorage, pp 1–8 Cornelis N, Van Gool L (2008) Fast scale invariant feature detection and matching on programmable graphics hardware. In: 21th IEEE international conference on computer vision and pattern recognition workshops. IEEE, Anchorage, pp 1–8
7.
Zurück zum Zitat Deng J, Berg A, Li K, Fei-Fei L (2010) What does classifying more than 10,000 image categories tell us? In: 11th European conference on computer vision. Springer, Heraklion, pp 71–84 Deng J, Berg A, Li K, Fei-Fei L (2010) What does classifying more than 10,000 image categories tell us? In: 11th European conference on computer vision. Springer, Heraklion, pp 71–84
8.
Zurück zum Zitat Diao M, Kim J (2009) Multimedia mining on manycore architectures: the case for GPUs. In: 5th International symposium on advances in visual computing. Springer, Las Vegas, pp 619–630 Diao M, Kim J (2009) Multimedia mining on manycore architectures: the case for GPUs. In: 5th International symposium on advances in visual computing. Springer, Las Vegas, pp 619–630
9.
Zurück zum Zitat Diao M, Nicopoulos C, Kim J (2011) Large-scale semantic concept detection on manycore platforms for multimedia mining. In: 25th IEEE international symposium on parallel and distributed processing. IEEE, Anchorage, pp 384–394 Diao M, Nicopoulos C, Kim J (2011) Large-scale semantic concept detection on manycore platforms for multimedia mining. In: 25th IEEE international symposium on parallel and distributed processing. IEEE, Anchorage, pp 384–394
10.
Zurück zum Zitat Diao M, Nicopoulos C, Kim J (2012) MRF satellite image classification on GPU. In: 41th International conference on parallel processing workshops. IEEE, Pittsburgh, pp 149–156 Diao M, Nicopoulos C, Kim J (2012) MRF satellite image classification on GPU. In: 41th International conference on parallel processing workshops. IEEE, Pittsburgh, pp 149–156
11.
Zurück zum Zitat Enovar R, Risojevi V, Babi Z, Dobravec T, Buli P (2013) A GPU implementation of a structural-similarity-based aerial-image classification. J Supercomput 65(2):978–996 Enovar R, Risojevi V, Babi Z, Dobravec T, Buli P (2013) A GPU implementation of a structural-similarity-based aerial-image classification. J Supercomput 65(2):978–996
12.
Zurück zum Zitat Fang J, Varbanescu A, Sips H (2011) A comprehensive performance comparison of cuda and opencl. In: 40th IEEE international conference on parallel processing. IEEE, Taipei City, pp 216–225 Fang J, Varbanescu A, Sips H (2011) A comprehensive performance comparison of cuda and opencl. In: 40th IEEE international conference on parallel processing. IEEE, Taipei City, pp 216–225
13.
Zurück zum Zitat Fatone L, Giacinti M, Mariani F, Recchioni M, Zirilli F (2012) Parallel option pricing on GPU: barrier options and realized variance options. J Supercomput 62(3):1480–1501CrossRef Fatone L, Giacinti M, Mariani F, Recchioni M, Zirilli F (2012) Parallel option pricing on GPU: barrier options and realized variance options. J Supercomput 62(3):1480–1501CrossRef
14.
Zurück zum Zitat Fei-Fei L, Fergus R, Perona P (2007) Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. Comput Vis Image Underst 106(1):59–70CrossRef Fei-Fei L, Fergus R, Perona P (2007) Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. Comput Vis Image Underst 106(1):59–70CrossRef
15.
Zurück zum Zitat Garcia V, Debreuve E, Barlaud M (2008) Fast k nearest neighbor search using GPU. In: 21th International conference on computer vision and pattern recognition workshops. IEEE, Anchorage, pp 1–6 Garcia V, Debreuve E, Barlaud M (2008) Fast k nearest neighbor search using GPU. In: 21th International conference on computer vision and pattern recognition workshops. IEEE, Anchorage, pp 1–6
16.
Zurück zum Zitat Garcia V, Debreuve F, Nielsen E, Barlaud M (2010) K-nearest neighbor search: fast GPU-based implementations and application to high-dimensional feature matching. In: 18th IEEE international conference on image processing. IEEE, Hong Kong, pp 3757–3760 Garcia V, Debreuve F, Nielsen E, Barlaud M (2010) K-nearest neighbor search: fast GPU-based implementations and application to high-dimensional feature matching. In: 18th IEEE international conference on image processing. IEEE, Hong Kong, pp 3757–3760
17.
Zurück zum Zitat Intel (2009) Intel math Kernel library. Users Guide 314774–009US Intel (2009) Intel math Kernel library. Users Guide 314774–009US
18.
Zurück zum Zitat Jegou H, Douze M, Schmid C (2011) Product quantization for nearest neighbor search. IEEE Trans Pattern Anal Mach Intell 33(1):117–128CrossRef Jegou H, Douze M, Schmid C (2011) Product quantization for nearest neighbor search. IEEE Trans Pattern Anal Mach Intell 33(1):117–128CrossRef
19.
Zurück zum Zitat Kato K, Hosino T (2012) Multi-GPU algorithm for k-nearest neighbor problem. Concurr Comp Pract E 24(1):45–53CrossRef Kato K, Hosino T (2012) Multi-GPU algorithm for k-nearest neighbor problem. Concurr Comp Pract E 24(1):45–53CrossRef
20.
Zurück zum Zitat Li LJ, Fei-Fei L (2007) Evaluating bag-of-visual-words representations in scene classification. In: 11th IEEE international conference on computer vision. IEEE, Rio de Janeiro, pp 1–8 Li LJ, Fei-Fei L (2007) Evaluating bag-of-visual-words representations in scene classification. In: 11th IEEE international conference on computer vision. IEEE, Rio de Janeiro, pp 1–8
21.
Zurück zum Zitat Li LJ, Socher R, Fei-Fei L (2009) Towards total scene understanding: classification, annotation and segmentation in an automatic framework. In: 22th IEEE international conference on computer vision and pattern recognition. IEEE, Miami, pp 2036–2043 Li LJ, Socher R, Fei-Fei L (2009) Towards total scene understanding: classification, annotation and segmentation in an automatic framework. In: 22th IEEE international conference on computer vision and pattern recognition. IEEE, Miami, pp 2036–2043
22.
Zurück zum Zitat Liu LK, Liu Q, Natsev A, Ross K, Smith J, Varbanescu A (2007) Digital media indexing on the cell processor. In: 8th IEEE international conference on multimedia and expo. IEEE, Beijing, pp 1866–1869 Liu LK, Liu Q, Natsev A, Ross K, Smith J, Varbanescu A (2007) Digital media indexing on the cell processor. In: 8th IEEE international conference on multimedia and expo. IEEE, Beijing, pp 1866–1869
23.
Zurück zum Zitat Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110CrossRef Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110CrossRef
24.
Zurück zum Zitat McCann S, Lowe D (2012) Local naive bayes nearest neighbor for image classification. In: 25th IEEE international conference on computer vision and pattern recognition. IEEE, Providence, RI, pp 3650–3656 McCann S, Lowe D (2012) Local naive bayes nearest neighbor for image classification. In: 25th IEEE international conference on computer vision and pattern recognition. IEEE, Providence, RI, pp 3650–3656
25.
Zurück zum Zitat Miao Q, Chen Y, Li J, Zhang Q, Zhang Y, Chen G (2009) Parallelization and optimization of a CBVIR system on multi-core architectures. In: 23th IEEE international symposium on parallel and distributed processing. IEEE, Rome, pp 1–8 Miao Q, Chen Y, Li J, Zhang Q, Zhang Y, Chen G (2009) Parallelization and optimization of a CBVIR system on multi-core architectures. In: 23th IEEE international symposium on parallel and distributed processing. IEEE, Rome, pp 1–8
26.
Zurück zum Zitat Noumsi A, Derrien S, Quinton P (2006) Acceleration of a content-based image-retrieval application on the RDISK cluster. In: 20th IEEE international symposium on parallel and distributed processing. IEEE, Rhodes Island, pp 25–29 Noumsi A, Derrien S, Quinton P (2006) Acceleration of a content-based image-retrieval application on the RDISK cluster. In: 20th IEEE international symposium on parallel and distributed processing. IEEE, Rhodes Island, pp 25–29
27.
Zurück zum Zitat Owens JD, Luebke D, Govindaraju N, Harris M, Krger J, Lefohn A, Purcell TJ (2007) A survey of general-purpose computation on graphics hardware. Comput Graph Forum 26(1):80–113CrossRef Owens JD, Luebke D, Govindaraju N, Harris M, Krger J, Lefohn A, Purcell TJ (2007) A survey of general-purpose computation on graphics hardware. Comput Graph Forum 26(1):80–113CrossRef
28.
Zurück zum Zitat Schellmann M, Gorlatch S, Meilnder D, Ksters T, Schfers K, Wbbeling F, Burger M (2011) Parallel medical image reconstruction: from graphics processing units. J Supercomput 57(2):151–160CrossRef Schellmann M, Gorlatch S, Meilnder D, Ksters T, Schfers K, Wbbeling F, Burger M (2011) Parallel medical image reconstruction: from graphics processing units. J Supercomput 57(2):151–160CrossRef
29.
Zurück zum Zitat Sinha SN, Michael Frahm J, Pollefeys M, Genc Y, (2006) GPU-based video feature tracking and matching. In: Technical report, in workshop on edge computing using new commodity architectures Sinha SN, Michael Frahm J, Pollefeys M, Genc Y, (2006) GPU-based video feature tracking and matching. In: Technical report, in workshop on edge computing using new commodity architectures
30.
Zurück zum Zitat Terboven C, Deselaers T, Bischof C, Ney H (2006) Shared-memory parallelization for content-based image retrieval. In: 9th European conference on computer vision workshop on computation intensive methods for computer vision, Springer, Graz Terboven C, Deselaers T, Bischof C, Ney H (2006) Shared-memory parallelization for content-based image retrieval. In: 9th European conference on computer vision workshop on computation intensive methods for computer vision, Springer, Graz
31.
Zurück zum Zitat Thibault J, Senocak I (2012) Accelerating incompressible flow computations with a P threads-CUDA implementation on small-footprint multi-GPU platforms. J Supercomput 59(2):693–719CrossRef Thibault J, Senocak I (2012) Accelerating incompressible flow computations with a P threads-CUDA implementation on small-footprint multi-GPU platforms. J Supercomput 59(2):693–719CrossRef
32.
Zurück zum Zitat Torralba A, Fergus R, Freeman W (2008) 80 million tiny images: a large data set for nonparametric object and scene recognition. IEEE Trans Pattern Anal Mach Intell 30(11):1958–1970CrossRef Torralba A, Fergus R, Freeman W (2008) 80 million tiny images: a large data set for nonparametric object and scene recognition. IEEE Trans Pattern Anal Mach Intell 30(11):1958–1970CrossRef
33.
Zurück zum Zitat Tsuchiyama R, Nakamura T, Iizuka T, Asahara A, Son J, Miki S (2012) The OpenCL programming book. Fixstars Tsuchiyama R, Nakamura T, Iizuka T, Asahara A, Son J, Miki S (2012) The OpenCL programming book. Fixstars
34.
Zurück zum Zitat Tuytelaars T, Fritz M, Saenko K, Darrell T (2011) The NBNN kernel. In: 15th IEEE international conference on computer vision. IEEE, Barcelona, pp 1824–1831 Tuytelaars T, Fritz M, Saenko K, Darrell T (2011) The NBNN kernel. In: 15th IEEE international conference on computer vision. IEEE, Barcelona, pp 1824–1831
35.
Zurück zum Zitat Van de Sande KEA, Gevers T, Snoek CGM (2011) Empowering visual categorization with the GPU. IEEE Trans Multimedia 13(1):60–70CrossRef Van de Sande KEA, Gevers T, Snoek CGM (2011) Empowering visual categorization with the GPU. IEEE Trans Multimedia 13(1):60–70CrossRef
36.
Zurück zum Zitat Varma M, Ray D (2007) Visual place categorization: problem, dataset, and algorithm. In: 11th IEEE international conference on computer vision. IEEE, Rio de Janeiro, pp 1–8 Varma M, Ray D (2007) Visual place categorization: problem, dataset, and algorithm. In: 11th IEEE international conference on computer vision. IEEE, Rio de Janeiro, pp 1–8
37.
Zurück zum Zitat Wang HL, Cheong LF (2006) Affective understanding in film. IEEE Trans Circ Syst Video Technol 16(6):689–704CrossRef Wang HL, Cheong LF (2006) Affective understanding in film. IEEE Trans Circ Syst Video Technol 16(6):689–704CrossRef
38.
Zurück zum Zitat Wu J, Christensen HI, Rehg JM (2009) Visual place categorization: problem, dataset, and algorithm. In: 14th IEEE international conference on intelligent robots and systems. IEEE, St. Louis, pp 4763–4770 Wu J, Christensen HI, Rehg JM (2009) Visual place categorization: problem, dataset, and algorithm. In: 14th IEEE international conference on intelligent robots and systems. IEEE, St. Louis, pp 4763–4770
39.
Zurück zum Zitat Xiao J, Antonio S, Fu Y, Lu Y, Tian Q (2009) Refining image retrieval using one-class classification. In: 10th IEEE international conference on multimedia and expo. IEEE, New York, pp 314–317 Xiao J, Antonio S, Fu Y, Lu Y, Tian Q (2009) Refining image retrieval using one-class classification. In: 10th IEEE international conference on multimedia and expo. IEEE, New York, pp 314–317
Metadaten
Titel
Effective naive Bayes nearest neighbor based image classification on GPU
verfasst von
Lei Zhu
Hai Jin
Ran Zheng
Xiaowen Feng
Publikationsdatum
01.05.2014
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 2/2014
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-013-1068-7

Weitere Artikel der Ausgabe 2/2014

The Journal of Supercomputing 2/2014 Zur Ausgabe

Premium Partner