Skip to main content
Top
Published in: Soft Computing 2/2018

11-10-2016 | Methodologies and Application

Metric forests based on Gaussian mixture model for visual image classification

Authors: Yong Xu, Qian Zhang, Lin Wang

Published in: Soft Computing | Issue 2/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Visual image classification plays an important role in computer vision and pattern recognition. In this paper, a new random forests method called metric forests is suggested. This method takes the distribution of datasets (including the original dataset and bootstrapped ones) into full consideration. The proposed method exploits the distribution similarity between the original dataset and the bootstrapped datasets. For each bootstrapped dataset, a metric decision tree is built based on Gaussian mixture model. The metric decision tree learned from bootstrapped dataset with a low or high similarity index is given small weight when voting, vice versa. The contribution of the proposed method is originated from that the dataset with low similarity may not represent the original dataset very well while the high one with a big chance to overfit. To evaluate the proposed metric forests method, extensive of experiments was conducted for visual image classification including texture image classification, flower image classification and food image classification. The experimental results validated the superiority of the proposed metric forests on the ALOT, Flower-102 and Food-101 datasets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Bellet A, Habrard A (2015) Robustness and generalization for metric learning. Neurocomputing 151:259CrossRef Bellet A, Habrard A (2015) Robustness and generalization for metric learning. Neurocomputing 151:259CrossRef
go back to reference Biau G, Devroye L, Lugosi G (2008) Consistency of random forests and other averaging classifiers. J Mach Learn Res 9:2015MathSciNetMATH Biau G, Devroye L, Lugosi G (2008) Consistency of random forests and other averaging classifiers. J Mach Learn Res 9:2015MathSciNetMATH
go back to reference Bishop CM et al (2006) Pattern recognition and machine learning. Springer, New YorkMATH Bishop CM et al (2006) Pattern recognition and machine learning. Springer, New YorkMATH
go back to reference Booth A, Gerding E, McGroarty F (2014) Automated trading with performance weighted random forests and seasonality. Expert Syst Appl 41(8):3651CrossRef Booth A, Gerding E, McGroarty F (2014) Automated trading with performance weighted random forests and seasonality. Expert Syst Appl 41(8):3651CrossRef
go back to reference Bosch A, Zisserman A, Muoz X (2007) In: IEEE 11th international conference on computer vision, 2007 ICCV 2007 (IEEE, 2007), pp 1–8 Bosch A, Zisserman A, Muoz X (2007) In: IEEE 11th international conference on computer vision, 2007 ICCV 2007 (IEEE, 2007), pp 1–8
go back to reference Bossard L, Guillaumin M, Van Gool L (2014) In: Computer vision-ECCV. Springer, pp 446–461 Bossard L, Guillaumin M, Van Gool L (2014) In: Computer vision-ECCV. Springer, pp 446–461
go back to reference Breiman L, Friedman J, Stone CJ, Olshen RA (1984) Classification and regression trees. CRC Press, Boca RatonMATH Breiman L, Friedman J, Stone CJ, Olshen RA (1984) Classification and regression trees. CRC Press, Boca RatonMATH
go back to reference Burghouts GJ, Geusebroek JM (2009) Material-specific adaptation of color invariant features. Pattern Recognit Lett 30(3):306CrossRef Burghouts GJ, Geusebroek JM (2009) Material-specific adaptation of color invariant features. Pattern Recognit Lett 30(3):306CrossRef
go back to reference Chen B, Shu H, Coatrieux G, Chen G, Sun X, Coatrieux JL (2015) Color image analysis by quaternion-type moments. J Math Imaging Vis 51(1):124MathSciNetMATHCrossRef Chen B, Shu H, Coatrieux G, Chen G, Sun X, Coatrieux JL (2015) Color image analysis by quaternion-type moments. J Math Imaging Vis 51(1):124MathSciNetMATHCrossRef
go back to reference Chen Q, Song Z, Huang Z, Hua Y, Yan S (2015) Contextualizing object detection and classification. IEEE Trans Pattern Anal Mach Intell 37(1):13CrossRef Chen Q, Song Z, Huang Z, Hua Y, Yan S (2015) Contextualizing object detection and classification. IEEE Trans Pattern Anal Mach Intell 37(1):13CrossRef
go back to reference Costa AF, Humpire-Mamani G, Traina AJM (2012) In: 25th SIBGRAPI conference on graphics, patterns and images (SIBGRAPI), 2012 (IEEE, 2012), pp 39–46 Costa AF, Humpire-Mamani G, Traina AJM (2012) In: 25th SIBGRAPI conference on graphics, patterns and images (SIBGRAPI), 2012 (IEEE, 2012), pp 39–46
go back to reference Dantone M, Gall J, Fanelli G, Van Gool L In: IEEE conference on computer vision and pattern recognition (CVPR), 2012 (IEEE, 2012), pp 2578–2585 Dantone M, Gall J, Fanelli G, Van Gool L In: IEEE conference on computer vision and pattern recognition (CVPR), 2012 (IEEE, 2012), pp 2578–2585
go back to reference Dapogny A, Bailly K, Dubuisson S (2015) In: IEEE international conference on computer vision (ICCV), 2015 (IEEE, 2015) Dapogny A, Bailly K, Dubuisson S (2015) In: IEEE international conference on computer vision (ICCV), 2015 (IEEE, 2015)
go back to reference Davis JV, Kulis B, Jain P, Sra S, Dhillon IS (2007) In: Proceedings of the 24th international conference on Machine learning (ACM, 2007), pp 209–216 Davis JV, Kulis B, Jain P, Sra S, Dhillon IS (2007) In: Proceedings of the 24th international conference on Machine learning (ACM, 2007), pp 209–216
go back to reference Denil M, Matheson D, De Freitas N (2014) In: Proceedings of The 31st international conference on machine learning, pp 665–673 Denil M, Matheson D, De Freitas N (2014) In: Proceedings of The 31st international conference on machine learning, pp 665–673
go back to reference Fanelli G, Dantone M, Gall J, Fossati A, Van Gool L (2013) Random forests for real time 3D face analysis. Int J Comput Vis 101(3):437CrossRef Fanelli G, Dantone M, Gall J, Fossati A, Van Gool L (2013) Random forests for real time 3D face analysis. Int J Comput Vis 101(3):437CrossRef
go back to reference Fanelli G, Gall J, Van Gool L (2011) In: IEEE conference on computer vision and pattern recognition (CVPR), 2011 (IEEE, 2011), pp. 617–624 Fanelli G, Gall J, Van Gool L (2011) In: IEEE conference on computer vision and pattern recognition (CVPR), 2011 (IEEE, 2011), pp. 617–624
go back to reference Fernando B, Fromont E, Tuytelaars T (2014) Mining mid-level features for image classification. Int J Comput Vis 108(3):186MathSciNetCrossRef Fernando B, Fromont E, Tuytelaars T (2014) Mining mid-level features for image classification. Int J Comput Vis 108(3):186MathSciNetCrossRef
go back to reference Gabrieli JD, Ghosh SS, Whitfield-Gabrieli S (2015) Prediction as a humanitarian and pragmatic contribution from human cognitive neuroscience. Neuron 85(1):11CrossRef Gabrieli JD, Ghosh SS, Whitfield-Gabrieli S (2015) Prediction as a humanitarian and pragmatic contribution from human cognitive neuroscience. Neuron 85(1):11CrossRef
go back to reference Gislason PO, Benediktsson JA, Sveinsson JR (2006) Random forests for land cover classification. Pattern Recognit Lett 27(4):294CrossRef Gislason PO, Benediktsson JA, Sveinsson JR (2006) Random forests for land cover classification. Pattern Recognit Lett 27(4):294CrossRef
go back to reference Guo L, Ma Y, Cukic B, Singh H (2004) In: 15th international symposium on software reliability engineering, 2004. ISSRE 2004 (IEEE, 2004), pp 417–428 Guo L, Ma Y, Cukic B, Singh H (2004) In: 15th international symposium on software reliability engineering, 2004. ISSRE 2004 (IEEE, 2004), pp 417–428
go back to reference Haasdonk B, Keysers D (2002) In: Proceedings of the 16th international conference on pattern recognition, 2002, vol 2 (IEEE, 2002), pp 864–868 Haasdonk B, Keysers D (2002) In: Proceedings of the 16th international conference on pattern recognition, 2002, vol 2 (IEEE, 2002), pp 864–868
go back to reference Hastie T, Tibshirani R, Friedman J, Hastie T, Friedman J, Tibshirani R (2009) The elements of statistical learning, 2nd edn. Springer, BerlinMATHCrossRef Hastie T, Tibshirani R, Friedman J, Hastie T, Friedman J, Tibshirani R (2009) The elements of statistical learning, 2nd edn. Springer, BerlinMATHCrossRef
go back to reference Hayat M, Bennamoun M, An S (2015) Deep reconstruction models for image set classification. IEEE Trans Pattern Anal Mach Intell 37(4):713CrossRef Hayat M, Bennamoun M, An S (2015) Deep reconstruction models for image set classification. IEEE Trans Pattern Anal Mach Intell 37(4):713CrossRef
go back to reference Hu W, Xie N, Hu R, Ling H, Chen Q, Yan S, Maybank S (2014) Bin ratio-based histogram distances and their application to image classification. IEEE Trans Pattern Anal Mach Intell 36(12):2338CrossRef Hu W, Xie N, Hu R, Ling H, Chen Q, Yan S, Maybank S (2014) Bin ratio-based histogram distances and their application to image classification. IEEE Trans Pattern Anal Mach Intell 36(12):2338CrossRef
go back to reference Huttenlocher DP, Klanderman GA, Rucklidge WJ (1993) Comparing images using the hausdorff distance. IEEE Trans Pattern Anal Mach Intell 15(9):850CrossRef Huttenlocher DP, Klanderman GA, Rucklidge WJ (1993) Comparing images using the hausdorff distance. IEEE Trans Pattern Anal Mach Intell 15(9):850CrossRef
go back to reference Ibarra-Berastegi G, Saenz J, Esnaola G, Ezcurra A, Ulazia A (2015) Short-term forecasting of the wave energy flux: analogues, random forests, and physics-based models. Ocean Eng 104:530CrossRef Ibarra-Berastegi G, Saenz J, Esnaola G, Ezcurra A, Ulazia A (2015) Short-term forecasting of the wave energy flux: analogues, random forests, and physics-based models. Ocean Eng 104:530CrossRef
go back to reference Kontschieder P, Bulo SR, Bischof H, Pelillo M (2014) Structured class-labels in random forests for semantic image labelling. IEEE Trans Pattern Anal Mach Intell 36(10):2104CrossRef Kontschieder P, Bulo SR, Bischof H, Pelillo M (2014) Structured class-labels in random forests for semantic image labelling. IEEE Trans Pattern Anal Mach Intell 36(10):2104CrossRef
go back to reference Kotsiantis S (2011) Combining bagging, boosting, rotation forest and random subspace methods. Artif Intell Rev 35(3):223CrossRef Kotsiantis S (2011) Combining bagging, boosting, rotation forest and random subspace methods. Artif Intell Rev 35(3):223CrossRef
go back to reference Lahouar A, Slama JBH (2015) Day-ahead load forecast using random forest and expert input selection. Energy Convers Manag 103:1040CrossRef Lahouar A, Slama JBH (2015) Day-ahead load forecast using random forest and expert input selection. Energy Convers Manag 103:1040CrossRef
go back to reference Li J, Li X, Yang B, Sun X (2015) Segmentation-based image copy-move forgery detection scheme. IEEE Trans Inf Forensics Secur 10(3):507CrossRef Li J, Li X, Yang B, Sun X (2015) Segmentation-based image copy-move forgery detection scheme. IEEE Trans Inf Forensics Secur 10(3):507CrossRef
go back to reference Li P, Ling H, Li X, Liao C (2015) In: IEEE international conference on computer vision (ICCV), 2015 (IEEE, 2015), pp 819–827 Li P, Ling H, Li X, Liao C (2015) In: IEEE international conference on computer vision (ICCV), 2015 (IEEE, 2015), pp 819–827
go back to reference Liu X, Song M, Tao D, Liu Z, Zhang L, Chen C, Bu J (2013) In: IEEE conference on computer vision and pattern recognition (CVPR), 2013 (IEEE, 2013), pp 492–499 Liu X, Song M, Tao D, Liu Z, Zhang L, Chen C, Bu J (2013) In: IEEE conference on computer vision and pattern recognition (CVPR), 2013 (IEEE, 2013), pp 492–499
go back to reference Ludwig N, Feuerriegel S, Neumann D (2015) Putting big data analytics to work: feature selection for forecasting electricity prices using the LASSO and random forests. J Decis Syst 24:19–36CrossRef Ludwig N, Feuerriegel S, Neumann D (2015) Putting big data analytics to work: feature selection for forecasting electricity prices using the LASSO and random forests. J Decis Syst 24:19–36CrossRef
go back to reference Ludwig N, Feuerriegel S, Neumann D (2015) Putting big data analytics to work: feature selection for forecasting electricity prices using the lasso and random forests. J Decis Syst 24(1):19CrossRef Ludwig N, Feuerriegel S, Neumann D (2015) Putting big data analytics to work: feature selection for forecasting electricity prices using the lasso and random forests. J Decis Syst 24(1):19CrossRef
go back to reference Marin J, Vázquez D, López AM, Amores J, Leibe B (2013) In: IEEE international conference on computer vision (ICCV), 2013 (IEEE, 2013), pp 2592–2599 Marin J, Vázquez D, López AM, Amores J, Leibe B (2013) In: IEEE international conference on computer vision (ICCV), 2013 (IEEE, 2013), pp 2592–2599
go back to reference Miao Y, Tao X, Sun Y, Li Y, Lu J (2015) Risk-based adaptive metric learning for nearest neighbour classification. Neurocomputing 156:33CrossRef Miao Y, Tao X, Sun Y, Li Y, Lu J (2015) Risk-based adaptive metric learning for nearest neighbour classification. Neurocomputing 156:33CrossRef
go back to reference Modolo D, Vezhnevets A, Ferrari V (2015) Context forest for efficient object detection with large mixture models. arXiv preprint arXiv:1503.00787 Modolo D, Vezhnevets A, Ferrari V (2015) Context forest for efficient object detection with large mixture models. arXiv preprint arXiv:​1503.​00787
go back to reference Nilsback ME, Zisserman A (2008) In: Sixth Indian conference on computer vision, graphics & image processing, 2008. ICVGIP’08 (IEEE, 2008), pp 722–729 Nilsback ME, Zisserman A (2008) In: Sixth Indian conference on computer vision, graphics & image processing, 2008. ICVGIP’08 (IEEE, 2008), pp 722–729
go back to reference Nock R, Bel Haj Ali W, D’Ambrosio R, Nielsen F, Barlaud M (2015) Gentle nearest neighbors boosting over proper scoring rules. IEEE Trans Pattern Anal Mach Intell 37(1):80CrossRef Nock R, Bel Haj Ali W, D’Ambrosio R, Nielsen F, Barlaud M (2015) Gentle nearest neighbors boosting over proper scoring rules. IEEE Trans Pattern Anal Mach Intell 37(1):80CrossRef
go back to reference Ojala T, Pietikainen M, Harwood D (1994) In: Proceedings of the 12th IAPR international conference on pattern recognition, 1994, vol 1—conference a: computer vision & image processing, pp 582–585 Ojala T, Pietikainen M, Harwood D (1994) In: Proceedings of the 12th IAPR international conference on pattern recognition, 1994, vol 1—conference a: computer vision & image processing, pp 582–585
go back to reference Pang H, Lin A, Holford M, Enerson BE, Lu B, Lawton MP, Floyd E, Zhao H (2006) Pathway analysis using random forests classification and regression. Bioinformatics 22(16):2028 Pang H, Lin A, Holford M, Enerson BE, Lu B, Lawton MP, Floyd E, Zhao H (2006) Pathway analysis using random forests classification and regression. Bioinformatics 22(16):2028
go back to reference Qian Z, Xu Y (2016) Block-based selection random forest for texture classification using multi-fractal spectrum feature. Neural Comput Appl 27(3):593CrossRef Qian Z, Xu Y (2016) Block-based selection random forest for texture classification using multi-fractal spectrum feature. Neural Comput Appl 27(3):593CrossRef
go back to reference Quan Y, Xu Y, Sun Y, Luo Y (2014) In: IEEE conference on computer vision and pattern recognition (CVPR), 2014 (IEEE, 2014), pp 160–167 Quan Y, Xu Y, Sun Y, Luo Y (2014) In: IEEE conference on computer vision and pattern recognition (CVPR), 2014 (IEEE, 2014), pp 160–167
go back to reference Razavian AS, Azizpour H, Sullivan J, Carlsson S (2014) In: 2014 IEEE conference on computer vision and pattern recognition workshops (CVPRW) (IEEE, 2014), pp 512–519 Razavian AS, Azizpour H, Sullivan J, Carlsson S (2014) In: 2014 IEEE conference on computer vision and pattern recognition workshops (CVPRW) (IEEE, 2014), pp 512–519
go back to reference Schulter S, Leistner C, Wohlhart P, Roth PM, Bischof H (2014): In: IEEE conference on computer vision and pattern recognition (CVPR), 2014 (IEEE, 2014), pp 923–930 Schulter S, Leistner C, Wohlhart P, Roth PM, Bischof H (2014): In: IEEE conference on computer vision and pattern recognition (CVPR), 2014 (IEEE, 2014), pp 923–930
go back to reference Seyedhosseini M, Tasdizen T (2015) Disjunctive normal random forests. Pattern Recognit 48(3):976 Seyedhosseini M, Tasdizen T (2015) Disjunctive normal random forests. Pattern Recognit 48(3):976
go back to reference Shotton J, Sharp T, Kipman A, Fitzgibbon A, Finocchio M, Blake A, Cook M, Moore R (2013) Real-time human pose recognition in parts from single depth images. Commun ACM 56(1):116CrossRef Shotton J, Sharp T, Kipman A, Fitzgibbon A, Finocchio M, Blake A, Cook M, Moore R (2013) Real-time human pose recognition in parts from single depth images. Commun ACM 56(1):116CrossRef
go back to reference Singh K, Guntuku SC, Thakur A, Hota C (2014) Big data analytics framework for peer-to-peer botnet detection using random forests. Inf Sci 278:488CrossRef Singh K, Guntuku SC, Thakur A, Hota C (2014) Big data analytics framework for peer-to-peer botnet detection using random forests. Inf Sci 278:488CrossRef
go back to reference Sonka M, Hlavac V, Boyle R (2014) Image processing, analysis, and machine vision. Cengage Learning, Boston Sonka M, Hlavac V, Boyle R (2014) Image processing, analysis, and machine vision. Cengage Learning, Boston
go back to reference Sun M, Kohli P, Shotton J (2012) In: IEEE conference on computer vision and pattern recognition (CVPR), 2012 (IEEE, 2012), pp 3394–3401 Sun M, Kohli P, Shotton J (2012) In: IEEE conference on computer vision and pattern recognition (CVPR), 2012 (IEEE, 2012), pp 3394–3401
go back to reference Trzcinski T, Christoudias M, Lepetit V (2015) Learning image descriptors with boosting. IEEE Trans Pattern Anal Mach Intell 37(3):597CrossRef Trzcinski T, Christoudias M, Lepetit V (2015) Learning image descriptors with boosting. IEEE Trans Pattern Anal Mach Intell 37(3):597CrossRef
go back to reference Verma N, Mahajan D, Sellamanickam S, Nair V (2012) In: IEEE conference on computer vision and pattern recognition (CVPR), 2012 (IEEE, 2012), pp 2280–2287 Verma N, Mahajan D, Sellamanickam S, Nair V (2012) In: IEEE conference on computer vision and pattern recognition (CVPR), 2012 (IEEE, 2012), pp 2280–2287
go back to reference Wang G, Zheng F, Shi C, Xue JH, Liu C, He L (2015) Embedding metric learning into set-based face recognition for video surveillance. Neurocomputing 151:1500CrossRef Wang G, Zheng F, Shi C, Xue JH, Liu C, He L (2015) Embedding metric learning into set-based face recognition for video surveillance. Neurocomputing 151:1500CrossRef
go back to reference Xia Z, Wang X, Sun X, Liu Q, Xiong N (2014a) Steganalysis of LSB matching using differences between nonadjacent pixels. Multimed Tools Appl 75:1–16 Xia Z, Wang X, Sun X, Liu Q, Xiong N (2014a) Steganalysis of LSB matching using differences between nonadjacent pixels. Multimed Tools Appl 75:1–16
go back to reference Xia Z, Wang X, Sun X, Wang B (2014b) Steganalysis of least significant bit matching using multi-order differences. Secur Commun Netw 7(8):1283CrossRef Xia Z, Wang X, Sun X, Wang B (2014b) Steganalysis of least significant bit matching using multi-order differences. Secur Commun Netw 7(8):1283CrossRef
go back to reference Xu Y, Ji H, Fermüller C (2009) Viewpoint invariant texture description using fractal analysis. Int J Comput Vis 83(1):85CrossRef Xu Y, Ji H, Fermüller C (2009) Viewpoint invariant texture description using fractal analysis. Int J Comput Vis 83(1):85CrossRef
go back to reference Yang H, Patras I (2013) In: 10th IEEE international conference and workshops on automatic face and gesture recognition (FG), 2013 (IEEE, 2013), pp 1–6 Yang H, Patras I (2013) In: 10th IEEE international conference and workshops on automatic face and gesture recognition (FG), 2013 (IEEE, 2013), pp 1–6
go back to reference Zheng Y, Jeon B, Xu D, Wu Q, Zhang H (2015) Image segmentation by generalized hierarchical fuzzy C-means algorithm. J Intell Fuzzy Syst 28(2):961 Zheng Y, Jeon B, Xu D, Wu Q, Zhang H (2015) Image segmentation by generalized hierarchical fuzzy C-means algorithm. J Intell Fuzzy Syst 28(2):961
go back to reference Zhu JY, Wu J, Wei Y, Chang E, Tu Z (2015) Unsupervised object class discovery via saliency-guided multiple class learning. IEEE Trans Pattern Anal Mach Intell 37(4):862CrossRef Zhu JY, Wu J, Wei Y, Chang E, Tu Z (2015) Unsupervised object class discovery via saliency-guided multiple class learning. IEEE Trans Pattern Anal Mach Intell 37(4):862CrossRef
Metadata
Title
Metric forests based on Gaussian mixture model for visual image classification
Authors
Yong Xu
Qian Zhang
Lin Wang
Publication date
11-10-2016
Publisher
Springer Berlin Heidelberg
Published in
Soft Computing / Issue 2/2018
Print ISSN: 1432-7643
Electronic ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-016-2350-4

Other articles of this Issue 2/2018

Soft Computing 2/2018 Go to the issue

Premium Partner