Skip to main content
Top
Published in: International Journal of Multimedia Information Retrieval 2/2014

01-06-2014 | Regular Paper

Parallel incremental power mean SVM for the classification of large-scale image datasets

Authors: Thanh-Nghi Doan, Thanh-Nghi Do, François Poulet

Published in: International Journal of Multimedia Information Retrieval | Issue 2/2014

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The amount of image data becomes larger and larger, both image size (due the higher resolution) and image number. It is estimated for personal use only, an average single user will take 100,000 images during his life. The growth of image data is illustrated by the dataset size, for example ImageNet benchmark dataset is made of more than 14 million images and more than 21,000 classes. This is very challenging for classification algorithms. They have to deal with time and space complexity and very imbalanced data when using SVM algorithms. We present extensions of Power Mean SVM to deal with such data. The first one is an incremental version to deal with the space complexity, the second one is a parallel version of the incremental version to deal with time complexity and the last one is the use of a balanced bagging algorithm for training binary classifiers to deal with imbalanced data. We evaluate our parallel incremental version of balanced bagging PmSVM on the 1,000 classes of ImageNet (ILSVRC 2010). The results show that our algorithm can be run on standard PC (with eg. 2 or 4 GB RAM); it is 255 times faster than the original version and 1,276 times faster than state-of-the-art linear classifier, LIBLINEAR with 80 cores.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Chua TS, Tang J, Hong R, Li H, Luo Z, Zheng, YT (2009) Nus-wide: a real-world web image database from national university of Singapore. In: Proceedings of ACM Conference on Image and Video Retrieval (CIVR’09). Santorini, Greece Chua TS, Tang J, Hong R, Li H, Luo Z, Zheng, YT (2009) Nus-wide: a real-world web image database from national university of Singapore. In: Proceedings of ACM Conference on Image and Video Retrieval (CIVR’09). Santorini, Greece
3.
go back to reference Csurka G, Dance CR, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, ECCV, pp 1–22 Csurka G, Dance CR, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, ECCV, pp 1–22
4.
go back to reference Deng J, Berg AC, Li K, Li FF (2010) What does classifying more than 10, 000 image categories tell us? In: Daniilidis K, Maragos P, Paragios N (eds) ECCV, Part V. Lecture Notes in Computer Science, vol 6315. Springer pp 71–84 Deng J, Berg AC, Li K, Li FF (2010) What does classifying more than 10, 000 image categories tell us? In: Daniilidis K, Maragos P, Paragios N (eds) ECCV, Part V. Lecture Notes in Computer Science, vol 6315. Springer pp 71–84
5.
go back to reference Do TN, Nguyen VH, Poulet F (2008) Speed up SVM algorithm for massive classification tasks. In: Tang C, Ling CX, Zhou X, Cercone N, Li X (eds) ADMA. Lecture Notes in Computer Science, vol 5139. Springer Do TN, Nguyen VH, Poulet F (2008) Speed up SVM algorithm for massive classification tasks. In: Tang C, Ling CX, Zhou X, Cercone N, Li X (eds) ADMA. Lecture Notes in Computer Science, vol 5139. Springer
6.
go back to reference Everingham M, Van Gool L, Williams CKI, Winn J, Zisserman A (2010) The Pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338CrossRef Everingham M, Van Gool L, Williams CKI, Winn J, Zisserman A (2010) The Pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338CrossRef
8.
go back to reference Guermeur Y (2007) SVM multiclasses, théorie et applications Guermeur Y (2007) SVM multiclasses, théorie et applications
9.
go back to reference Hsieh CJ, Chang KW, Lin CJ, Keerthi SS, Sundararajan S (2008) A dual coordinate descent method for large-scale linear SVM. In: International Conference on Machine Learning, pp 408–415 Hsieh CJ, Chang KW, Lin CJ, Keerthi SS, Sundararajan S (2008) A dual coordinate descent method for large-scale linear SVM. In: International Conference on Machine Learning, pp 408–415
11.
go back to reference Krebel UH-G (1999) Pairwise classification and support vector machines. In: Schölkopf B, Burges CJC, Smola AJ (eds) Advances in Kernel methods. MIT Press, Cambridge, pp 255–268 Krebel UH-G (1999) Pairwise classification and support vector machines. In: Schölkopf B, Burges CJC, Smola AJ (eds) Advances in Kernel methods. MIT Press, Cambridge, pp 255–268
12.
go back to reference Lenca P, Lallich S, Do TN, Pham NK (2008) A comparison of different off-centered entropies to deal with class imbalance for decision trees. In: The Pacific-Asia Conference on Knowledge Discovery and Data Mining, LNAI 5012. Springer, New York, pp 634–643 Lenca P, Lallich S, Do TN, Pham NK (2008) A comparison of different off-centered entropies to deal with class imbalance for decision trees. In: The Pacific-Asia Conference on Knowledge Discovery and Data Mining, LNAI 5012. Springer, New York, pp 634–643
13.
go back to reference Li FF, Fergus R, Perona P (2007) Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. Comput Vis Image Underst 106(1):59–70 Li FF, Fergus R, Perona P (2007) Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. Comput Vis Image Underst 106(1):59–70
14.
go back to reference Lin Y, Lv F, Zhu S, Yang M, Cour T, Yu K, Cao L, Huang TS (2011) Large-scale image classification: fast feature extraction and SVM training. In: CVPR. IEEE pp 1689–1696 Lin Y, Lv F, Zhu S, Yang M, Cour T, Yu K, Cao L, Huang TS (2011) Large-scale image classification: fast feature extraction and SVM training. In: CVPR. IEEE pp 1689–1696
18.
go back to reference Perronnin F, Sánchez J, Liu Y (2010) Large-scale image categorization with explicit data embedding. In: CVPR. IEEE, pp 2297–2304 Perronnin F, Sánchez J, Liu Y (2010) Large-scale image categorization with explicit data embedding. In: CVPR. IEEE, pp 2297–2304
19.
go back to reference Pham NK, Do TN, Lenca P, Lallich S (2008) Using local node information in decision trees: coupling a local decision rule with an off-centered entropy. In: International Conference on Data Mining. CSREA Press, Las Vegas, pp 117–123 Pham NK, Do TN, Lenca P, Lallich S (2008) Using local node information in decision trees: coupling a local decision rule with an off-centered entropy. In: International Conference on Data Mining. CSREA Press, Las Vegas, pp 117–123
20.
go back to reference Platt J, Cristianini N, Shawe-Taylor J (2000) Large margin dags for multiclass classification. Adv Neural Inf Process Syst 12:547–553 Platt J, Cristianini N, Shawe-Taylor J (2000) Large margin dags for multiclass classification. Adv Neural Inf Process Syst 12:547–553
22.
go back to reference Vedaldi A, Zisserman A (2012) Efficient additive kernels via explicit feature maps. IEEE Trans Pattern Anal Mach Intell 34(3):480–492CrossRef Vedaldi A, Zisserman A (2012) Efficient additive kernels via explicit feature maps. IEEE Trans Pattern Anal Mach Intell 34(3):480–492CrossRef
23.
go back to reference Visa S, Ralescu A (2005) Issues in mining imbalanced data sets—a review paper. In: Midwest Artificial Intelligence and Cognitive Science Conference. Dayton, USA pp 67–73 Visa S, Ralescu A (2005) Issues in mining imbalanced data sets—a review paper. In: Midwest Artificial Intelligence and Cognitive Science Conference. Dayton, USA pp 67–73
24.
go back to reference Weiss GM, Provost F (2003) Learning when training data are costly: the effect of class distribution on tree induction. J Artif Intell Res 19:315–354MATH Weiss GM, Provost F (2003) Learning when training data are costly: the effect of class distribution on tree induction. J Artif Intell Res 19:315–354MATH
25.
go back to reference Weston J, Watkins C (1999) Support vector machines for multi-class pattern recognition. In: Proceedings of the Seventh European Symposium on Artificial, Neural Networks, pp 219–224 Weston J, Watkins C (1999) Support vector machines for multi-class pattern recognition. In: Proceedings of the Seventh European Symposium on Artificial, Neural Networks, pp 219–224
26.
go back to reference Wu J (2010) A fast dual method for hik svm learning. In: Daniilidis K, Maragos P, Paragios N (eds) European Conference on Computer Vision, Lecture Notes in Computer Science. Springer, New York, vol 6312, pp 552–565 Wu J (2010) A fast dual method for hik svm learning. In: Daniilidis K, Maragos P, Paragios N (eds) European Conference on Computer Vision, Lecture Notes in Computer Science. Springer, New York, vol 6312, pp 552–565
27.
go back to reference Wu J (2012) Power mean svm for large scale visual classification. In: CVPR. IEEE pp 2344–2351 Wu J (2012) Power mean svm for large scale visual classification. In: CVPR. IEEE pp 2344–2351
28.
go back to reference Wu J, Tan WC, Rehg JM (2011) Efficient and effective visual codebook generation using additive kernels. J Mach Learn Res 12:3097–3118MATHMathSciNet Wu J, Tan WC, Rehg JM (2011) Efficient and effective visual codebook generation using additive kernels. J Mach Learn Res 12:3097–3118MATHMathSciNet
29.
go back to reference Yu HF, Hsieh CJ, Chang KW, Lin CJ (2012) Large linear classification when data cannot fit in memory. TKDD 5(4):23CrossRef Yu HF, Hsieh CJ, Chang KW, Lin CJ (2012) Large linear classification when data cannot fit in memory. TKDD 5(4):23CrossRef
30.
go back to reference Yuan GX, Ho CH, Lin CJ (2012) Recent advances of large-scale linear classification. Proc IEEE 100(9):2584–2603CrossRef Yuan GX, Ho CH, Lin CJ (2012) Recent advances of large-scale linear classification. Proc IEEE 100(9):2584–2603CrossRef
Metadata
Title
Parallel incremental power mean SVM for the classification of large-scale image datasets
Authors
Thanh-Nghi Doan
Thanh-Nghi Do
François Poulet
Publication date
01-06-2014
Publisher
Springer London
Published in
International Journal of Multimedia Information Retrieval / Issue 2/2014
Print ISSN: 2192-6611
Electronic ISSN: 2192-662X
DOI
https://doi.org/10.1007/s13735-014-0053-0

Other articles of this Issue 2/2014

International Journal of Multimedia Information Retrieval 2/2014 Go to the issue

Premium Partner