Skip to main content
Erschienen in: Neural Computing and Applications 8/2019

30.12.2017 | Original Article

Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach

verfasst von: Abdul Qayyum, Aamir Saeed Malik, Naufal M. Saad, Mahboob Iqbal, Mohd Faris Abdullah, Waqas Rasheed, Tuan A. B. Rashid Abdullah, Mohd Yaqoob Bin Jafaar

Erschienen in: Neural Computing and Applications | Ausgabe 8/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This work offers an approach to aerial image classification for use in remote sensing object recognition, image processing and computer vision. Sparse coding (SC) is used to classify unmanned-aerial-vehicle (UAV) and satellite images because SC representation can generalize a large dataset and improve the detection of distinctive features by reducing calculation time for feature matching and classification. Features from images are extracted based on the following descriptors: (a) Scale Invariant Feature Transform; (b) Histogram of Oriented Gradients; and (c) Local Binary Patterns. SC representation and local image features are combined to represent global features for classification. Features are deployed in a sparse model to store descriptor features using extant dictionaries such as (a) the Discrete Cosine Transform and (b) the Discrete Wavelet Transform. An additional two dictionaries are proposed as developed for the present work: (c) the Discrete Ridgelet Transform (DRT) and (d) the Discrete Tchebichef Transform. The DRT dictionary is constructed by using the Ricker wavelet function to generate finite Ridgelet transforms as basis elements for a hybrid dictionary. Different pooling methods have also been employed to convert sparse-coded features into a feature matrix. Various machine learning algorithms are then applied to the feature matrix to classify objects contained in UAV and satellite imagery data. Experimental results show that the SC model secured better accuracy rates for extracted discriminative features contained in remote sensing images. The authors concluded that the proposed SC technique and proposed dictionaries provided feasible solutions for image classification and object recognition.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Hu F, Xia G-S, Hu J, Zhang L (2015) Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery. Remote Sens 7(11):14680–14707CrossRef Hu F, Xia G-S, Hu J, Zhang L (2015) Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery. Remote Sens 7(11):14680–14707CrossRef
2.
Zurück zum Zitat Zhou G (2009) Near real-time orthorectification and mosaic of small UAV video flow for time-critical event response. IEEE Trans Geosci Remote Sens 47(3):739–747CrossRef Zhou G (2009) Near real-time orthorectification and mosaic of small UAV video flow for time-critical event response. IEEE Trans Geosci Remote Sens 47(3):739–747CrossRef
3.
Zurück zum Zitat Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110CrossRef Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110CrossRef
4.
Zurück zum Zitat Swain MJ, Ballard DH (1991) Color indexing. Int J Comput Vis 7(1):11–32CrossRef Swain MJ, Ballard DH (1991) Color indexing. Int J Comput Vis 7(1):11–32CrossRef
5.
Zurück zum Zitat Puissant A, Hirsch J, Weber C (2005) The utility of texture analysis to improve per-pixel classification for high to very high spatial resolution imagery. Int J Remote Sens 26(4):733–745CrossRef Puissant A, Hirsch J, Weber C (2005) The utility of texture analysis to improve per-pixel classification for high to very high spatial resolution imagery. Int J Remote Sens 26(4):733–745CrossRef
6.
Zurück zum Zitat Yang Y, Newsam S (2013) Geographic Image retrieval using local invariant features. IEEE Trans Geosci Remote Sens 51(2):818–832CrossRef Yang Y, Newsam S (2013) Geographic Image retrieval using local invariant features. IEEE Trans Geosci Remote Sens 51(2):818–832CrossRef
7.
Zurück zum Zitat Yang Y, Newsam S (2008) Comparing SIFT descriptors and gabor texture features for classification of remote sensed imagery. In: 2008 15th IEEE international conference on image processing, 2008, pp 1852–1855 Yang Y, Newsam S (2008) Comparing SIFT descriptors and gabor texture features for classification of remote sensed imagery. In: 2008 15th IEEE international conference on image processing, 2008, pp 1852–1855
8.
Zurück zum Zitat Zhou XS, Huang TS (2003) Relevance feedback in image retrieval: a comprehensive review. Multimed Syst 8(6):536–544CrossRef Zhou XS, Huang TS (2003) Relevance feedback in image retrieval: a comprehensive review. Multimed Syst 8(6):536–544CrossRef
9.
Zurück zum Zitat van de Sande KEA, Gevers T, Snoek CGM (2010) Evaluating color descriptors for object and scene recognition. IEEE Trans Pattern Anal Mach Intell 32(9):1582–1596CrossRef van de Sande KEA, Gevers T, Snoek CGM (2010) Evaluating color descriptors for object and scene recognition. IEEE Trans Pattern Anal Mach Intell 32(9):1582–1596CrossRef
10.
Zurück zum Zitat Wu L, Hoi SCH, Yu N (2010) Semantics-preserving bag-of-words models and applications. IEEE Trans Image Process 19(7):1908–1920MathSciNetCrossRefMATH Wu L, Hoi SCH, Yu N (2010) Semantics-preserving bag-of-words models and applications. IEEE Trans Image Process 19(7):1908–1920MathSciNetCrossRefMATH
11.
Zurück zum Zitat Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE computer society conference on computer vision and pattern recognition—volume 2 (CVPR’06), vol 2, pp 2169–2178 Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE computer society conference on computer vision and pattern recognition—volume 2 (CVPR’06), vol 2, pp 2169–2178
12.
Zurück zum Zitat Olshausen BA, Field DJ (1997) Strategy employed by V1 ? Vis Res 37(23):3311–3325CrossRef Olshausen BA, Field DJ (1997) Strategy employed by V1 ? Vis Res 37(23):3311–3325CrossRef
13.
14.
Zurück zum Zitat Yang J, Wright J, Huang T, Ma Y (2008) Image super-resolution as sparse representation of raw image patches. In: 2008 IEEE conference on computer vision and pattern recognition, 2008, pp 1–8 Yang J, Wright J, Huang T, Ma Y (2008) Image super-resolution as sparse representation of raw image patches. In: 2008 IEEE conference on computer vision and pattern recognition, 2008, pp 1–8
15.
Zurück zum Zitat Mairal J, Bach F, Ponce J, Sapiro G, Zisserman A (2008) Discriminative learned dictionaries for local image analysis. In: IEEE Conference on computer vision and pattern recognition, pp 1–8 Mairal J, Bach F, Ponce J, Sapiro G, Zisserman A (2008) Discriminative learned dictionaries for local image analysis. In: IEEE Conference on computer vision and pattern recognition, pp 1–8
16.
Zurück zum Zitat Zeng S, Gou J, Yang X (2017) Improving sparsity of coefficients for robust sparse and collaborative representation-based image classification. Neural Comput Appl Zeng S, Gou J, Yang X (2017) Improving sparsity of coefficients for robust sparse and collaborative representation-based image classification. Neural Comput Appl
17.
Zurück zum Zitat Ranzato M, Huang FJ, Boureau Y-L, LeCun Y (2007) Unsupervised Learning of Invariant feature hierarchies with applications to object recognition. In: IEEE conference on computer vision and pattern recognition, pp 1–8 Ranzato M, Huang FJ, Boureau Y-L, LeCun Y (2007) Unsupervised Learning of Invariant feature hierarchies with applications to object recognition. In: IEEE conference on computer vision and pattern recognition, pp 1–8
18.
Zurück zum Zitat Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. In: IEEE Conference on computer vision and pattern recognition, pp 1794–1801 Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. In: IEEE Conference on computer vision and pattern recognition, pp 1794–1801
19.
Zurück zum Zitat Gao S, Tsang IW-H, Chia L-T, Zhao P (2010) Local features are not lonely & #x2013; Laplacian sparse coding for image classification. In: IEEE computer society conference on computer vision and pattern recognition, pp 3555–3561 Gao S, Tsang IW-H, Chia L-T, Zhao P (2010) Local features are not lonely & #x2013; Laplacian sparse coding for image classification. In: IEEE computer society conference on computer vision and pattern recognition, pp 3555–3561
20.
Zurück zum Zitat Agarwal V, Bhanot S (2017) Radial basis function neural network-based face recognition using firefly algorithm. Neural Comput Appl Agarwal V, Bhanot S (2017) Radial basis function neural network-based face recognition using firefly algorithm. Neural Comput Appl
21.
Zurück zum Zitat Lee JW, Lee JB, Park M, Song SH (2005) An extensive comparison of recent classification tools applied to microarray data. Comput Stat Data Anal 48(4):869–885MathSciNetCrossRefMATH Lee JW, Lee JB, Park M, Song SH (2005) An extensive comparison of recent classification tools applied to microarray data. Comput Stat Data Anal 48(4):869–885MathSciNetCrossRefMATH
22.
Zurück zum Zitat Statnikov A, Aliferis CF, Tsamardinos I, Hardin D, Levy S (2005) A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis. Bioinformatics 21(5):631–643CrossRef Statnikov A, Aliferis CF, Tsamardinos I, Hardin D, Levy S (2005) A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis. Bioinformatics 21(5):631–643CrossRef
23.
Zurück zum Zitat Nex F, Remondino F (2014) UAV for 3D mapping applications: a review. Appl Geomatics 6(1):1–15CrossRef Nex F, Remondino F (2014) UAV for 3D mapping applications: a review. Appl Geomatics 6(1):1–15CrossRef
24.
Zurück zum Zitat Lowe DG (1999) Object recognition from local scale-invariant features. In: Proceedings of the seventh IEEE international conference on computer vision, 1999, vol 2, pp 1150–1157 Lowe DG (1999) Object recognition from local scale-invariant features. In: Proceedings of the seventh IEEE international conference on computer vision, 1999, vol 2, pp 1150–1157
25.
Zurück zum Zitat Mondragon IF, Campoy P, Correa JF, Mejias L (2007) Visual model feature tracking for UAV Control. In IEEE international symposium on intelligent signal processing 2007, pp 1–6 Mondragon IF, Campoy P, Correa JF, Mejias L (2007) Visual model feature tracking for UAV Control. In IEEE international symposium on intelligent signal processing 2007, pp 1–6
26.
Zurück zum Zitat Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630CrossRef Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630CrossRef
27.
Zurück zum Zitat Ke y, Sukthankar R (2004) PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition, 2004. CVPR 2004., vol 2, pp 506–513 Ke y, Sukthankar R (2004) PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition, 2004. CVPR 2004., vol 2, pp 506–513
28.
Zurück zum Zitat Cai D, He X, Han J (2007) Spectral regression: a unified approach for sparse subspace learning. In: Seventh IEEE international conference on data mining (ICDM 2007), 2007, pp 73–82 Cai D, He X, Han J (2007) Spectral regression: a unified approach for sparse subspace learning. In: Seventh IEEE international conference on data mining (ICDM 2007), 2007, pp 73–82
29.
Zurück zum Zitat Raina R, Battle A, Lee H, Packer B, Ng AY (2007) Self-taught learning. In: Proceedings of the 24th international conference on machine learning—ICML’07, 2007, pp 759–766 Raina R, Battle A, Lee H, Packer B, Ng AY (2007) Self-taught learning. In: Proceedings of the 24th international conference on machine learning—ICML’07, 2007, pp 759–766
30.
Zurück zum Zitat Pham D-S, Venkatesh S (2008) Joint learning and dictionary construction for pattern recognition. In: IEEE conference on computer vision and pattern recognition, 2008, pp 1–8 Pham D-S, Venkatesh S (2008) Joint learning and dictionary construction for pattern recognition. In: IEEE conference on computer vision and pattern recognition, 2008, pp 1–8
31.
Zurück zum Zitat Wright J, Yang AY, Ganesh A, Sastry SS, Ma Yi (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227CrossRef Wright J, Yang AY, Ganesh A, Sastry SS, Ma Yi (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227CrossRef
32.
Zurück zum Zitat Siebert S, Teizer J (2014) Mobile 3D mapping for surveying earthwork projects using an Unmanned Aerial Vehicle (UAV) system. Autom Constr 41:1–14CrossRef Siebert S, Teizer J (2014) Mobile 3D mapping for surveying earthwork projects using an Unmanned Aerial Vehicle (UAV) system. Autom Constr 41:1–14CrossRef
33.
Zurück zum Zitat Elad M, Figueiredo MAT, Ma Y (2010) On the role of sparse and redundant representations in image processing. Proc IEEE 98(6):972–982CrossRef Elad M, Figueiredo MAT, Ma Y (2010) On the role of sparse and redundant representations in image processing. Proc IEEE 98(6):972–982CrossRef
34.
Zurück zum Zitat Mallat SG (1993) Matching pursuits with time-frequency dictionaries. IEEE Trans Signal Process 41(12):3397–3415CrossRefMATH Mallat SG (1993) Matching pursuits with time-frequency dictionaries. IEEE Trans Signal Process 41(12):3397–3415CrossRefMATH
35.
Zurück zum Zitat Mallat SG (1994) Adaptive time-frequency decompositions. Opt Eng 33(7):2183CrossRef Mallat SG (1994) Adaptive time-frequency decompositions. Opt Eng 33(7):2183CrossRef
36.
Zurück zum Zitat Wei Q, Bioucas-Dias J, Dobigeon N, Tourneret J-Y (2015) Hyperspectral and multispectral image fusion based on a sparse representation. IEEE Trans Geosci Remote Sens 53(7):3658–3668CrossRef Wei Q, Bioucas-Dias J, Dobigeon N, Tourneret J-Y (2015) Hyperspectral and multispectral image fusion based on a sparse representation. IEEE Trans Geosci Remote Sens 53(7):3658–3668CrossRef
37.
Zurück zum Zitat Qayyum A, Malik AS, Nuafal M, Mazher M, Ahmad RF, Abdullah MF (2015) Evaluation of optimization algorithms for sparse and redundant dictionaries. In: IEEE student symposium in biomedical engineering & sciences (ISSBES), 2015, pp 128–133 Qayyum A, Malik AS, Nuafal M, Mazher M, Ahmad RF, Abdullah MF (2015) Evaluation of optimization algorithms for sparse and redundant dictionaries. In: IEEE student symposium in biomedical engineering & sciences (ISSBES), 2015, pp 128–133
38.
39.
Zurück zum Zitat Rahmalan H, Abu NA, Wong SL (2010) Using tchebichef moment for fast and efficient image compression. Pattern Recognit Image Anal 20(4):505–512CrossRef Rahmalan H, Abu NA, Wong SL (2010) Using tchebichef moment for fast and efficient image compression. Pattern Recognit Image Anal 20(4):505–512CrossRef
40.
Zurück zum Zitat Boureau Y-L, Bach F, LeCun Y, Ponce J (2010) Learning mid-level features for recognition. In: IEEE computer society conference on computer vision and pattern recognition 2010, pp 2559–2566 Boureau Y-L, Bach F, LeCun Y, Ponce J (2010) Learning mid-level features for recognition. In: IEEE computer society conference on computer vision and pattern recognition 2010, pp 2559–2566
41.
Zurück zum Zitat Serre T, Wolf L, Poggio T (2005) Object recognition with features inspired by visual cortex. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), vol 2, pp 994–1000 Serre T, Wolf L, Poggio T (2005) Object recognition with features inspired by visual cortex. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), vol 2, pp 994–1000
42.
Zurück zum Zitat Agarwal A, Triggs B (2006) Hyperfeatures—multilevel local coding for visual recognition, 2006, pp 30–43 Agarwal A, Triggs B (2006) Hyperfeatures—multilevel local coding for visual recognition, 2006, pp 30–43
43.
Zurück zum Zitat Melgani F, Bruzzone L (2004) Classification of hyperspectral remote sensing images with support vector machines. IEEE Trans Geosci Remote Sens 42(8):1778–1790CrossRef Melgani F, Bruzzone L (2004) Classification of hyperspectral remote sensing images with support vector machines. IEEE Trans Geosci Remote Sens 42(8):1778–1790CrossRef
44.
Zurück zum Zitat Benediktsson JA, Pesaresi M, Arnason K (2003) Classification and feature extraction for remote sensing images from urban areas based on morphological transformations. IEEE Trans Geosci Remote Sens 41(9):1940–1949CrossRef Benediktsson JA, Pesaresi M, Arnason K (2003) Classification and feature extraction for remote sensing images from urban areas based on morphological transformations. IEEE Trans Geosci Remote Sens 41(9):1940–1949CrossRef
45.
Zurück zum Zitat Manikandan J, Venkataramani B (2009) Design of a modified one-against-all SVM classifier. In: 2009 IEEE international conference on systems, man and cybernetics, 2009, pp 1869–1874 Manikandan J, Venkataramani B (2009) Design of a modified one-against-all SVM classifier. In: 2009 IEEE international conference on systems, man and cybernetics, 2009, pp 1869–1874
46.
Zurück zum Zitat Ou G, Murphey YL (2007) Multi-class pattern classification using neural networks. Pattern Recognit 40(1):4–18CrossRefMATH Ou G, Murphey YL (2007) Multi-class pattern classification using neural networks. Pattern Recognit 40(1):4–18CrossRefMATH
47.
Zurück zum Zitat Kowalski PA, Kulczycki P (2015) Interval probabilistic neural network. Neural Comput Appl 28:817–834CrossRef Kowalski PA, Kulczycki P (2015) Interval probabilistic neural network. Neural Comput Appl 28:817–834CrossRef
48.
Zurück zum Zitat Orlowska-Kowalska T, Kaminski M (2014) Influence of the optimization methods on neural state estimation quality of the drive system with elasticity. Neural Comput Appl 24(6):1327–1340CrossRef Orlowska-Kowalska T, Kaminski M (2014) Influence of the optimization methods on neural state estimation quality of the drive system with elasticity. Neural Comput Appl 24(6):1327–1340CrossRef
49.
Zurück zum Zitat Huang G, Huang G-B, Song S, You K (2015) Trends in extreme learning machines: a review. Neural Networks 61:32–48CrossRefMATH Huang G, Huang G-B, Song S, You K (2015) Trends in extreme learning machines: a review. Neural Networks 61:32–48CrossRefMATH
50.
Zurück zum Zitat Huang G, Song S, Gupta JN, Wu C (2014) Semi-supervised and unsupervised extreme learning machines. IEEE Trans Cybern 44(12):2405–2417CrossRef Huang G, Song S, Gupta JN, Wu C (2014) Semi-supervised and unsupervised extreme learning machines. IEEE Trans Cybern 44(12):2405–2417CrossRef
51.
Zurück zum Zitat Zhang L, Jack LB, Nandi AK (2005) Extending genetic programming for multi-class classification by combining K-nearest neighbor. In: Proceedings. (ICASSP’05). IEEE international conference on acoustics, speech, and signal processing, 2005, vol 5, pp 349–352 Zhang L, Jack LB, Nandi AK (2005) Extending genetic programming for multi-class classification by combining K-nearest neighbor. In: Proceedings. (ICASSP’05). IEEE international conference on acoustics, speech, and signal processing, 2005, vol 5, pp 349–352
Metadaten
Titel
Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach
verfasst von
Abdul Qayyum
Aamir Saeed Malik
Naufal M. Saad
Mahboob Iqbal
Mohd Faris Abdullah
Waqas Rasheed
Tuan A. B. Rashid Abdullah
Mohd Yaqoob Bin Jafaar
Publikationsdatum
30.12.2017
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 8/2019
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-017-3300-5

Weitere Artikel der Ausgabe 8/2019

Neural Computing and Applications 8/2019 Zur Ausgabe

Original Article

Generalized BELBIC