Skip to main content
Erschienen in: Pattern Analysis and Applications 3/2018

06.04.2018 | Original Article

Analysis of single- and dual-dictionary strategies in pedestrian classification

verfasst von: V. Javier Traver, Carlos Serra-Toro

Erschienen in: Pattern Analysis and Applications | Ausgabe 3/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sparse coding has recently been a hot topic in visual tasks in image processing and computer vision. It has applications and brings benefits in reconstruction-like tasks and in classification-like tasks as well. However, regarding binary classification problems, there are several choices to learn and use dictionaries that have not been studied. In particular, how single-dictionary and dual-dictionary approaches compare in terms of classification performance is largely unexplored. We compare three single-dictionary strategies and two dual-dictionary strategies for the problem of pedestrian classification (“pedestrian” vs “background” images). In each of these five cases, images are represented as the sparse coefficients induced from the respective dictionaries, and these coefficients are the input to a regular classifier both for training and subsequent classification of novel unseen instances. Experimental results with the INRIA pedestrian dataset suggest, on the one hand, that dictionaries learned from only one of the classes, even from the background class, are enough for obtaining competitive good classification performance. On the other hand, while better performance is generally obtained when instances of both classes are used for dictionary learning, the representation induced by a single dictionary learned from a set of instances from both classes provides comparable or even superior performance over the representations induced by two dictionaries learned separately from the pedestrian and background classes.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alfaro A, Mery D, Soto A (2016) Action recognition in video using sparse coding and relative features. In: Computer vision and pattern recognition (CVPR), pp 2688–2697 Alfaro A, Mery D, Soto A (2016) Action recognition in video using sparse coding and relative features. In: Computer vision and pattern recognition (CVPR), pp 2688–2697
3.
Zurück zum Zitat Bryt O, Elad M (2008) Compression of facial images using the K-SVD algorithm. J Vis Commun Image Represent 19(4):270–282CrossRef Bryt O, Elad M (2008) Compression of facial images using the K-SVD algorithm. J Vis Commun Image Represent 19(4):270–282CrossRef
4.
Zurück zum Zitat Castrodad A, Sapiro G (2012) Sparse modeling of human actions from motion imagery. Int J Comput Vis (IJCV) 100(1):1–15CrossRef Castrodad A, Sapiro G (2012) Sparse modeling of human actions from motion imagery. Int J Comput Vis (IJCV) 100(1):1–15CrossRef
5.
Zurück zum Zitat Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Computer vision and pattern recognition (CVPR) Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Computer vision and pattern recognition (CVPR)
6.
Zurück zum Zitat Deng W, Hu J, Guo J (2012) Extended SRC: undersampled face recognition via intraclass variant dictionary. IEEE Trans Pattern Anal Mach Intell (PAMI) 34(9):1864–1870CrossRef Deng W, Hu J, Guo J (2012) Extended SRC: undersampled face recognition via intraclass variant dictionary. IEEE Trans Pattern Anal Mach Intell (PAMI) 34(9):1864–1870CrossRef
7.
Zurück zum Zitat Deng W, Hu J, Guo J (2013) In defense of sparsity based face recognition. In: Computer vision and pattern recognition (CVPR) Deng W, Hu J, Guo J (2013) In defense of sparsity based face recognition. In: Computer vision and pattern recognition (CVPR)
8.
Zurück zum Zitat Elad M (2010) Sparse and redundant representations: from theory to applications in signal and image processing. Springer, BerlinCrossRefMATH Elad M (2010) Sparse and redundant representations: from theory to applications in signal and image processing. Springer, BerlinCrossRefMATH
9.
Zurück zum Zitat Elad M, Aharon M (2006) Image denoising via learned dictionaries and sparse representation. In: Computer vision and pattern recognition (CVPR) Elad M, Aharon M (2006) Image denoising via learned dictionaries and sparse representation. In: Computer vision and pattern recognition (CVPR)
10.
Zurück zum Zitat Fadili MJ, Starck JL, Murtagh F (2009) Inpainting and zooming using sparse representations. Comput J 52:64–79CrossRef Fadili MJ, Starck JL, Murtagh F (2009) Inpainting and zooming using sparse representations. Comput J 52:64–79CrossRef
11.
Zurück zum Zitat Gao Y, Ma J, Yuille AL (2017) Semi-supervised sparse representation based classification for face recognition with insufficient labeled samples. IEEE Trans Image Process 26(5):2545–2560MathSciNetCrossRef Gao Y, Ma J, Yuille AL (2017) Semi-supervised sparse representation based classification for face recognition with insufficient labeled samples. IEEE Trans Image Process 26(5):2545–2560MathSciNetCrossRef
12.
Zurück zum Zitat Hawe S, Seibert M, Kleinsteuber M (2013) Separable dictionary learning. In: Computer vision and pattern recognition (CVPR), pp 438–445 Hawe S, Seibert M, Kleinsteuber M (2013) Separable dictionary learning. In: Computer vision and pattern recognition (CVPR), pp 438–445
13.
Zurück zum Zitat Howse J, Joshi P, Beyeler M (2016) OpenCV: Computer Vision Projects with Python. Packt Howse J, Joshi P, Beyeler M (2016) OpenCV: Computer Vision Projects with Python. Packt
14.
Zurück zum Zitat Hsieh SH, Lu CS, Pei SC (2014) 2D sparse dictionary learning via tensor decomposition. In: IEEE global conference on signal and information processing (GlobalSIP), pp 492–496 Hsieh SH, Lu CS, Pei SC (2014) 2D sparse dictionary learning via tensor decomposition. In: IEEE global conference on signal and information processing (GlobalSIP), pp 492–496
15.
Zurück zum Zitat Hunter JD (2007) Matplotlib: a 2D graphics environment. Comput Sci Eng 9(3):90–95CrossRef Hunter JD (2007) Matplotlib: a 2D graphics environment. Comput Sci Eng 9(3):90–95CrossRef
16.
Zurück zum Zitat Jiang Z, Lin Z, Davis LS (2013) Label consistent K-SVD: learning a discriminative dictionary for recognition. IEEE Trans Pattern Anal Mach Intell (PAMI) 35(11):2651–2664CrossRef Jiang Z, Lin Z, Davis LS (2013) Label consistent K-SVD: learning a discriminative dictionary for recognition. IEEE Trans Pattern Anal Mach Intell (PAMI) 35(11):2651–2664CrossRef
17.
Zurück zum Zitat Krishna Vinay G, Haque SM, Venkatesh Babu R, Ramakrishnan K (2012) Human detection using sparse representation. In: IEEE international conference on acoustics, speech and signal processing (ICASSP) Krishna Vinay G, Haque SM, Venkatesh Babu R, Ramakrishnan K (2012) Human detection using sparse representation. In: IEEE international conference on acoustics, speech and signal processing (ICASSP)
18.
Zurück zum Zitat Liang F, Tang S, Zhang Y, Xu Z, Li J (2014) Pedestrian detection based on sparse coding and transfer learning. Mach Vis Appl (MVA) 25(7):1697–1709CrossRef Liang F, Tang S, Zhang Y, Xu Z, Li J (2014) Pedestrian detection based on sparse coding and transfer learning. Mach Vis Appl (MVA) 25(7):1697–1709CrossRef
19.
Zurück zum Zitat Liu W, Tao D, Cheng J, Tang Y (2014) Multiview Hessian discriminative sparse coding for image annotation. Comput Vis Image Underst (CVIU) 118(Supplement C):50–60CrossRef Liu W, Tao D, Cheng J, Tang Y (2014) Multiview Hessian discriminative sparse coding for image annotation. Comput Vis Image Underst (CVIU) 118(Supplement C):50–60CrossRef
20.
Zurück zum Zitat Liu W, Liu H, Tao D, Wang Y, Lu K (2015) Multiview Hessian regularized logistic regression for action recognition. Sig Process 110:101–107CrossRef Liu W, Liu H, Tao D, Wang Y, Lu K (2015) Multiview Hessian regularized logistic regression for action recognition. Sig Process 110:101–107CrossRef
21.
Zurück zum Zitat Liu W, Zha ZJ, Wang Y, Lu K, Tao D (2016) \(p\)-Laplacian regularized sparse coding for human activity recognition. IEEE Trans Ind Electron 63(8):5120–5129 Liu W, Zha ZJ, Wang Y, Lu K, Tao D (2016) \(p\)-Laplacian regularized sparse coding for human activity recognition. IEEE Trans Ind Electron 63(8):5120–5129
22.
Zurück zum Zitat Liu Y, Lasang P, Siegel M, Sun Q (2016) Multi-sparse descriptor: a scale invariant feature for pedestrian detection. Neurocomputing 184:55–65CrossRef Liu Y, Lasang P, Siegel M, Sun Q (2016) Multi-sparse descriptor: a scale invariant feature for pedestrian detection. Neurocomputing 184:55–65CrossRef
24.
25.
Zurück zum Zitat Mairal J, Bach F, Ponce J, Sapiro G (2009) Online dictionary learning for sparse coding. In: International conference on machine learning (ICML) Mairal J, Bach F, Ponce J, Sapiro G (2009) Online dictionary learning for sparse coding. In: International conference on machine learning (ICML)
26.
Zurück zum Zitat Mairal J, Bach F, Ponce J, Sapiro G (2010) Online learning for matrix factorization and sparse coding. J Mach Learn Res 11:19–60MathSciNetMATH Mairal J, Bach F, Ponce J, Sapiro G (2010) Online learning for matrix factorization and sparse coding. J Mach Learn Res 11:19–60MathSciNetMATH
27.
Zurück zum Zitat Mairal J, Bach F, Ponce J (2012) Task-driven dictionary learning. IEEE Trans Pattern Anal Mach Intell (PAMI) 34(4):791–804CrossRef Mairal J, Bach F, Ponce J (2012) Task-driven dictionary learning. IEEE Trans Pattern Anal Mach Intell (PAMI) 34(4):791–804CrossRef
28.
Zurück zum Zitat Mairal J, Bach F, Ponce J (2014) Sparse modeling for image and vision processing. Found Trends Comput Graph Vis 8(2–3):85–283CrossRefMATH Mairal J, Bach F, Ponce J (2014) Sparse modeling for image and vision processing. Found Trends Comput Graph Vis 8(2–3):85–283CrossRefMATH
29.
Zurück zum Zitat Mallat S, Zhang Z (1993) Matching pursuits with time-frequency dictionaries. IEEE Trans Signal Process 41(12):3397–3415CrossRefMATH Mallat S, Zhang Z (1993) Matching pursuits with time-frequency dictionaries. IEEE Trans Signal Process 41(12):3397–3415CrossRefMATH
30.
Zurück zum Zitat Matthews BW (1975) Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta (BBA) Protein Struct 405(2):442–451CrossRef Matthews BW (1975) Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta (BBA) Protein Struct 405(2):442–451CrossRef
31.
Zurück zum Zitat Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825–2830MathSciNetMATH Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825–2830MathSciNetMATH
32.
Zurück zum Zitat Ren X, Ramanan D (2013) Histograms of sparse codes for object detection. In: Computer vision and pattern recognition (CVPR) Ren X, Ramanan D (2013) Histograms of sparse codes for object detection. In: Computer vision and pattern recognition (CVPR)
33.
Zurück zum Zitat Rigamonti R, Brown M, Lepetit V (2011) Are sparse representations really relevant for image classification? In: Computer vision and pattern recognition (CVPR) Rigamonti R, Brown M, Lepetit V (2011) Are sparse representations really relevant for image classification? In: Computer vision and pattern recognition (CVPR)
34.
Zurück zum Zitat Rubinstein R, Zibulevsky M, Elad M (2010) Double sparsity: learning sparse dictionaries for sparse signal approximation. IEEE Trans Signal Process 58(3):1553–1564MathSciNetCrossRefMATH Rubinstein R, Zibulevsky M, Elad M (2010) Double sparsity: learning sparse dictionaries for sparse signal approximation. IEEE Trans Signal Process 58(3):1553–1564MathSciNetCrossRefMATH
35.
Zurück zum Zitat Sahay A (2016) Data visualization, vol I. Business Expert Press, New York Sahay A (2016) Data visualization, vol I. Business Expert Press, New York
36.
Zurück zum Zitat Serra-Toro C, Hernández-Górriz Á, Traver VJ (2017) Strategies of dictionary usages for sparse representations for pedestrian classification. Pattern Recogn Image Anal IbPRIA 2017:96–103MathSciNetCrossRef Serra-Toro C, Hernández-Górriz Á, Traver VJ (2017) Strategies of dictionary usages for sparse representations for pedestrian classification. Pattern Recogn Image Anal IbPRIA 2017:96–103MathSciNetCrossRef
37.
Zurück zum Zitat Shekhar S, Patel VM, Nguyen HV, Chellappa R (2015) Coupled projections for adaptation of dictionaries. IEEE Trans Image Process 24(10):2941–2954MathSciNetCrossRef Shekhar S, Patel VM, Nguyen HV, Chellappa R (2015) Coupled projections for adaptation of dictionaries. IEEE Trans Image Process 24(10):2941–2954MathSciNetCrossRef
38.
Zurück zum Zitat Shi Q, Eriksson A, van den Hengel A, Shen C (2011) Is face recognition really a compressive sensing problem? In: Computer vision and pattern recognition (CVPR) Shi Q, Eriksson A, van den Hengel A, Shen C (2011) Is face recognition really a compressive sensing problem? In: Computer vision and pattern recognition (CVPR)
40.
Zurück zum Zitat Sironi A, Tekin B, Rigamonti R, Lepetit V, Fua P (2015) Learning separable filters. IEEE Trans Pattern Anal Mach Intell (PAMI) 37(1):94–106CrossRef Sironi A, Tekin B, Rigamonti R, Lepetit V, Fua P (2015) Learning separable filters. IEEE Trans Pattern Anal Mach Intell (PAMI) 37(1):94–106CrossRef
41.
Zurück zum Zitat Sivalingam R, Somasundaram G, Morellas V, Papanikolopoulos N, Lotfallah OA, Park Y (2010) Dictionary learning based object detection and counting in traffic scenes. In: International conference on distributed smart cameras Sivalingam R, Somasundaram G, Morellas V, Papanikolopoulos N, Lotfallah OA, Park Y (2010) Dictionary learning based object detection and counting in traffic scenes. In: International conference on distributed smart cameras
42.
Zurück zum Zitat Spratling MW (2014) Classification using sparse representations: a biologically plausible approach. Biol Cybern 108(1):61–73MathSciNetCrossRef Spratling MW (2014) Classification using sparse representations: a biologically plausible approach. Biol Cybern 108(1):61–73MathSciNetCrossRef
43.
Zurück zum Zitat Sulam J, Ophir B, Zibulevsky M, Elad M (2016) Trainlets: dictionary learning in high dimensions. IEEE Trans Signal Process 64(12):3180–3193MathSciNetCrossRef Sulam J, Ophir B, Zibulevsky M, Elad M (2016) Trainlets: dictionary learning in high dimensions. IEEE Trans Signal Process 64(12):3180–3193MathSciNetCrossRef
44.
Zurück zum Zitat Sun R, Zhang G, Yan X, Gao J (2016) Robust pedestrian classification based on hierarchical kernel sparse representation. Sensors 16(8):1296CrossRef Sun R, Zhang G, Yan X, Gao J (2016) Robust pedestrian classification based on hierarchical kernel sparse representation. Sensors 16(8):1296CrossRef
45.
Zurück zum Zitat Wang W, Yan Y, Zhang L, Hong R, Sebe N (2016) Collaborative sparse coding for multiview action recognition. IEEE Multimedia 23(4):80–87CrossRef Wang W, Yan Y, Zhang L, Hong R, Sebe N (2016) Collaborative sparse coding for multiview action recognition. IEEE Multimedia 23(4):80–87CrossRef
46.
Zurück zum Zitat Wilcoxon F (1945) Individual comparisons by ranking methods. Biom Bull 1(6):80–83CrossRef Wilcoxon F (1945) Individual comparisons by ranking methods. Biom Bull 1(6):80–83CrossRef
47.
Zurück zum Zitat Wright J et al (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell (PAMI) 31(2):210–227CrossRef Wright J et al (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell (PAMI) 31(2):210–227CrossRef
48.
Zurück zum Zitat Wright J et al (2010) Sparse representation for computer vision and pattern recognition. Proc IEEE 98(6):1031–1044CrossRef Wright J et al (2010) Sparse representation for computer vision and pattern recognition. Proc IEEE 98(6):1031–1044CrossRef
49.
Zurück zum Zitat Xie YF, Su SZ, Li SZ (2010) A pedestrian classification method based on transfer learning. In: 2010 International conference on image analysis and signal processing, pp 420–425 Xie YF, Su SZ, Li SZ (2010) A pedestrian classification method based on transfer learning. In: 2010 International conference on image analysis and signal processing, pp 420–425
50.
Zurück zum Zitat Xu R, Jiao J, Zhang B, Ye Q (2012) Pedestrian detection in images via cascaded \(L_1\)-norm minimization learning method. Pattern Recogn 45(7):2573–2583CrossRef Xu R, Jiao J, Zhang B, Ye Q (2012) Pedestrian detection in images via cascaded \(L_1\)-norm minimization learning method. Pattern Recogn 45(7):2573–2583CrossRef
51.
Zurück zum Zitat Yang J, Wright J, Huang TS, Ma Y (2010) Image super-resolution via sparse representation. IEEE Trans Image Process 19(11):2861–2873MathSciNetCrossRefMATH Yang J, Wright J, Huang TS, Ma Y (2010) Image super-resolution via sparse representation. IEEE Trans Image Process 19(11):2861–2873MathSciNetCrossRefMATH
52.
Zurück zum Zitat Yang M, Zhang L, Feng X, Zhang D (2011) Fisher discrimination dictionary learning for sparse representation. In: International conference on computer vision (ICCV), pp 543–550 Yang M, Zhang L, Feng X, Zhang D (2011) Fisher discrimination dictionary learning for sparse representation. In: International conference on computer vision (ICCV), pp 543–550
53.
Zurück zum Zitat Yao T, Wang Z, Xie Z, Gao J, Feng DD (2017) Learning universal multiview dictionary for human action recognition. Pattern Recogn 64:236–244CrossRef Yao T, Wang Z, Xie Z, Gao J, Feng DD (2017) Learning universal multiview dictionary for human action recognition. Pattern Recogn 64:236–244CrossRef
54.
Zurück zum Zitat Zhang L, Zhou WD, Chang PC, Liu J, Yan Z, Wang T, Li FZ (2012) Kernel sparse representation-based classifier. IEEE Trans Signal Process 60(4):1684–1695MathSciNetCrossRef Zhang L, Zhou WD, Chang PC, Liu J, Yan Z, Wang T, Li FZ (2012) Kernel sparse representation-based classifier. IEEE Trans Signal Process 60(4):1684–1695MathSciNetCrossRef
55.
Zurück zum Zitat Zheng J, Jiang Z, Chellappa R (2016) Cross-view action recognition via transferable dictionary learning. IEEE Trans Image Process 25(6):2542–2556MathSciNetCrossRef Zheng J, Jiang Z, Chellappa R (2016) Cross-view action recognition via transferable dictionary learning. IEEE Trans Image Process 25(6):2542–2556MathSciNetCrossRef
56.
Zurück zum Zitat Zheng M, Bu J, Chen C, Wang C, Zhang L, Qiu G, Cai D (2011) Graph regularized sparse coding for image representation. IEEE Trans Image Process 20(5):1327–1336MathSciNetCrossRefMATH Zheng M, Bu J, Chen C, Wang C, Zhang L, Qiu G, Cai D (2011) Graph regularized sparse coding for image representation. IEEE Trans Image Process 20(5):1327–1336MathSciNetCrossRefMATH
57.
Zurück zum Zitat Zheng M, Bu J, Chen C (2014) Hessian sparse coding. Neurocomputing 123:247–254CrossRef Zheng M, Bu J, Chen C (2014) Hessian sparse coding. Neurocomputing 123:247–254CrossRef
58.
Zurück zum Zitat Zhu Q, Yeh M, Cheng K, Avidan S (2006) Fast human detection using a cascade of histograms of oriented gradients. In: Computer vision and pattern recognition (CVPR), pp 1491–1498 Zhu Q, Yeh M, Cheng K, Avidan S (2006) Fast human detection using a cascade of histograms of oriented gradients. In: Computer vision and pattern recognition (CVPR), pp 1491–1498
59.
Zurück zum Zitat Zhu XX, Bamler R (2013) A sparse image fusion algorithm with application to pan-sharpening. IEEE Trans Geosci Remote Sens 51(5):2827–2836CrossRef Zhu XX, Bamler R (2013) A sparse image fusion algorithm with application to pan-sharpening. IEEE Trans Geosci Remote Sens 51(5):2827–2836CrossRef
Metadaten
Titel
Analysis of single- and dual-dictionary strategies in pedestrian classification
verfasst von
V. Javier Traver
Carlos Serra-Toro
Publikationsdatum
06.04.2018
Verlag
Springer London
Erschienen in
Pattern Analysis and Applications / Ausgabe 3/2018
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-018-0704-5

Weitere Artikel der Ausgabe 3/2018

Pattern Analysis and Applications 3/2018 Zur Ausgabe