Skip to main content

2021 | OriginalPaper | Buchkapitel

4. Random Forests with Optimized Leaves for Hough-Voting

verfasst von : Hui Liang, Junsong Yuan

Erschienen in: Intelligent Scene Modeling and Human-Computer Interaction

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Random forest-based Hough-voting techniques are important in numerous computer vision problems such as pose estimation and gesture recognition. Particularly, the voting weights of leaf nodes in random forests have a big impact on performance. We propose to improve Hough-voting with random forests by learning optimized weights of leaf nodes during training. We have investigated two ways for the leaf weight optimization problem by either applying L2 constraints or L0 constraints to those weights. We show that with additional L0 constraints, we are able to simultaneously obtain optimized leaf weights and prune unreliable leaf nodes in the forests, but with additional costs of more computational costs involved during training. We have applied the proposed algorithms to a number of different problems in computer vision, including hand pose estimation, head pose estimation, and hand gesture recognition. The experimental results show that with L2-regularization, regression and classification accuracy are improved considerably. Further, with L0-regularization, many unreliable leaf nodes are suppressed and the tree structure is compressed considerably, while the performance is still comparable to L2-regularization.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Agarwal A, Triggs B (2006) Recovering 3d human pose from monocular images. IEEE Trans Pattern Anal Mach Intell Agarwal A, Triggs B (2006) Recovering 3d human pose from monocular images. IEEE Trans Pattern Anal Mach Intell
Zurück zum Zitat Ballard DH (1981) Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognit 13(2):111–122CrossRef Ballard DH (1981) Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognit 13(2):111–122CrossRef
Zurück zum Zitat Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal Mach Intell 24(5):603–619CrossRef Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal Mach Intell 24(5):603–619CrossRef
Zurück zum Zitat Duda RO, Hart PE (1972) Use of the Hough transformation to detect lines and curves in pictures. Commun ACM 15(1):11–15CrossRef Duda RO, Hart PE (1972) Use of the Hough transformation to detect lines and curves in pictures. Commun ACM 15(1):11–15CrossRef
Zurück zum Zitat Fanelli G, Dantone M, Gall J, Fossati A, Gool LV (2013) Random forests for real time 3d face analysis. Int J Comput Vis 101(3):437–458CrossRef Fanelli G, Dantone M, Gall J, Fossati A, Gool LV (2013) Random forests for real time 3d face analysis. Int J Comput Vis 101(3):437–458CrossRef
Zurück zum Zitat Fisher NI (2000) Statistical analysis of circular data. Cambridge University Press Fisher NI (2000) Statistical analysis of circular data. Cambridge University Press
Zurück zum Zitat Gall J, Lempitsky V (2009) Class-specific Hough forests for object detection. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 1022–1029 Gall J, Lempitsky V (2009) Class-specific Hough forests for object detection. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 1022–1029
Zurück zum Zitat Girshick R, Shotton J, Kohli P, Criminisi A, Fitzgibbon A (2011) Efficient regression of general-activity human poses from depth images. In: International conference on computer vision Girshick R, Shotton J, Kohli P, Criminisi A, Fitzgibbon A (2011) Efficient regression of general-activity human poses from depth images. In: International conference on computer vision
Zurück zum Zitat Gould N, Toint PL (2004) Preprocessing for quadratic programming. Math Program Ser B 100:95–132MathSciNetMATH Gould N, Toint PL (2004) Preprocessing for quadratic programming. Math Program Ser B 100:95–132MathSciNetMATH
Zurück zum Zitat Hara K, Chellappa R (2014) Growing regression forests by classification: applications to object pose estimation. In: European conference on computer vision. Springer, pp 552–567 Hara K, Chellappa R (2014) Growing regression forests by classification: applications to object pose estimation. In: European conference on computer vision. Springer, pp 552–567
Zurück zum Zitat Herdtweck C, Curio C (2013) Monocular car viewpoint estimation with circular regression forests. In: IEEE intelligent vehicles symposium Herdtweck C, Curio C (2013) Monocular car viewpoint estimation with circular regression forests. In: IEEE intelligent vehicles symposium
Zurück zum Zitat Keskin C, Kıraç F, Kara YE, Akarun L (2012) Hand pose estimation and hand shape classification using multi-layered randomized decision forests. In: European Conference on Computer Vision, Springer, pp 852–863 Keskin C, Kıraç F, Kara YE, Akarun L (2012) Hand pose estimation and hand shape classification using multi-layered randomized decision forests. In: European Conference on Computer Vision, Springer, pp 852–863
Zurück zum Zitat Leibe B, Leonardis A, Schiele B (2004) Combined object categorization and segmentation with an implicit shape model. In: ECCV workshop on statistical learning in computer vision, pp 17–32 Leibe B, Leonardis A, Schiele B (2004) Combined object categorization and segmentation with an implicit shape model. In: ECCV workshop on statistical learning in computer vision, pp 17–32
Zurück zum Zitat Liang H, Yuan J, Thalmann D (2012) 3d fingertip and palm tracking in depth image sequences. In: Proceedings of the 20th ACM international conference on Multimedia. ACM, pp 785–788 Liang H, Yuan J, Thalmann D (2012) 3d fingertip and palm tracking in depth image sequences. In: Proceedings of the 20th ACM international conference on Multimedia. ACM, pp 785–788
Zurück zum Zitat Liang H, Yuan J, Thalmann D (2014) Parsing the hand in depth images. IEEE Trans Multimed Liang H, Yuan J, Thalmann D (2014) Parsing the hand in depth images. IEEE Trans Multimed
Zurück zum Zitat Liang H, Hou J, Yuan J, Thalmann D (2016) Random forest with suppressed leaves for Hough voting. In: Asian conference on computer vision Liang H, Hou J, Yuan J, Thalmann D (2016) Random forest with suppressed leaves for Hough voting. In: Asian conference on computer vision
Zurück zum Zitat Liang H, Yuan J, Lee J, Ge L, Thalmann D (2019) Hough forest with optimized leaves for global hand pose estimation with arbitrary postures. IEEE Trans Cybern Liang H, Yuan J, Lee J, Ge L, Thalmann D (2019) Hough forest with optimized leaves for global hand pose estimation with arbitrary postures. IEEE Trans Cybern
Zurück zum Zitat Lin Z, Chen M, Ma Y (2010) The augmented Lagrange multiplier method for exact recovery of corrupted low-rank matrices. arXiv preprint arXiv:10095055 Lin Z, Chen M, Ma Y (2010) The augmented Lagrange multiplier method for exact recovery of corrupted low-rank matrices. arXiv preprint arXiv:​10095055
Zurück zum Zitat Maji S, Malik J (2009) Object detection using a max-margin Hough transform. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 1038–1045 Maji S, Malik J (2009) Object detection using a max-margin Hough transform. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 1038–1045
Zurück zum Zitat Papazov C, Marks TK, Jones M (2015) Real-time 3d head pose and facial landmark estimation from depth images using triangular surface patch features. In: IEEE conference on computer vision and pattern recognition Papazov C, Marks TK, Jones M (2015) Real-time 3d head pose and facial landmark estimation from depth images using triangular surface patch features. In: IEEE conference on computer vision and pattern recognition
Zurück zum Zitat Ren S, Cao X, Wei Y, Sun J (2015) Global refinement of random forest. In: The IEEE conference on computer vision and pattern recognition Ren S, Cao X, Wei Y, Sun J (2015) Global refinement of random forest. In: The IEEE conference on computer vision and pattern recognition
Zurück zum Zitat Ren Z, Yuan J, Meng J, Zhang Z (2013) Robust part-based hand gesture recognition using kinect sensor. IEEE Trans Multimed 15(5):1110–1120CrossRef Ren Z, Yuan J, Meng J, Zhang Z (2013) Robust part-based hand gesture recognition using kinect sensor. IEEE Trans Multimed 15(5):1110–1120CrossRef
Zurück zum Zitat Rota Bulo S, Kontschieder P (2014) Neural decision forests for semantic image labelling. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 81–88 Rota Bulo S, Kontschieder P (2014) Neural decision forests for semantic image labelling. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 81–88
Zurück zum Zitat Schulter S, Leistner C, Wohlhart P, Roth PM, Bischof H (2013) Alternating regression forests for object detection and pose estimation. In: 2013 IEEE international conference on computer vision (ICCV). IEEE, pp 417–424 Schulter S, Leistner C, Wohlhart P, Roth PM, Bischof H (2013) Alternating regression forests for object detection and pose estimation. In: 2013 IEEE international conference on computer vision (ICCV). IEEE, pp 417–424
Zurück zum Zitat Shotton J, Fitzgibbon A, Cook M, Sharp T, Finocchio M, Moore R, Kipman A, Blake A (2011) Realtime human pose recognition in parts from single depth images. In: IEEE conference on computer vision and pattern recognition Shotton J, Fitzgibbon A, Cook M, Sharp T, Finocchio M, Moore R, Kipman A, Blake A (2011) Realtime human pose recognition in parts from single depth images. In: IEEE conference on computer vision and pattern recognition
Zurück zum Zitat Shotton J, Girshick R, Fitzgibbon A, Sharp T, Cook M, Finocchio M, Moore R, Kohli P, Criminisi A, Kipman A et al (2013) Efficient human pose estimation from single depth images. IEEE Trans Pattern Anal Mach Intell 35(12):2821–2840CrossRef Shotton J, Girshick R, Fitzgibbon A, Sharp T, Cook M, Finocchio M, Moore R, Kohli P, Criminisi A, Kipman A et al (2013) Efficient human pose estimation from single depth images. IEEE Trans Pattern Anal Mach Intell 35(12):2821–2840CrossRef
Zurück zum Zitat Sirmaçek B, Ünsalan C (2010) Road network extraction using edge detection and spatial voting. In: International conference on pattern recognition. IEEE, pp 3113–3116 Sirmaçek B, Ünsalan C (2010) Road network extraction using edge detection and spatial voting. In: International conference on pattern recognition. IEEE, pp 3113–3116
Zurück zum Zitat Wohlhart P, Schulter S, Köstinger M, Roth PM, Bischof H (2012) Discriminative Hough forests for object detection. In: British machine vision conference, pp 1–11 Wohlhart P, Schulter S, Köstinger M, Roth PM, Bischof H (2012) Discriminative Hough forests for object detection. In: British machine vision conference, pp 1–11
Zurück zum Zitat Xu C, Cheng L (2013) Efficient hand pose estimation from a single depth image. In: International conference on computer vision Xu C, Cheng L (2013) Efficient hand pose estimation from a single depth image. In: International conference on computer vision
Zurück zum Zitat Yang H, Patras I (2013) Sieving regression forest votes for facial feature detection in the wild. In: IEEE international conference on computer vision. IEEE, pp 1936–1943 Yang H, Patras I (2013) Sieving regression forest votes for facial feature detection in the wild. In: IEEE international conference on computer vision. IEEE, pp 1936–1943
Zurück zum Zitat Zhang C, Tian Y (2015) Histogram of 3d facets: a depth descriptor for human action and hand gesture recognition. Comput Vis Image Underst 139:29–39CrossRef Zhang C, Tian Y (2015) Histogram of 3d facets: a depth descriptor for human action and hand gesture recognition. Comput Vis Image Underst 139:29–39CrossRef
Zurück zum Zitat Zhang C, Yang X, Tian Y (2013) Histogram of 3d facets: a characteristic descriptor for hand gesture recognition. In: IEEE international conference and workshops on automatic face and gesture recognition. IEEE, pp 1–8 Zhang C, Yang X, Tian Y (2013) Histogram of 3d facets: a characteristic descriptor for hand gesture recognition. In: IEEE international conference and workshops on automatic face and gesture recognition. IEEE, pp 1–8
Metadaten
Titel
Random Forests with Optimized Leaves for Hough-Voting
verfasst von
Hui Liang
Junsong Yuan
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-71002-6_4

Neuer Inhalt