Skip to main content
Erschienen in: Pattern Analysis and Applications 2/2012

01.05.2012 | Theoretical Advances

EnMS: early non-maxima suppression

Speeding up pattern localization and other tasks

verfasst von: Adam Herout, Michal Hradiš, Pavel Zemčík

Erschienen in: Pattern Analysis and Applications | Ausgabe 2/2012

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Detection of objects in images using statistical classifiers is a well studied and documented technique. Different applications of such detectors often require selection of the image position with the highest response of the detector—they perform non-maxima suppression. This article introduces the concept of early non-maxima suppression, which aims to reduce necessary computations by making the non-maxima suppression decision early based on incomplete information provided by a partially evaluated classifier. We show that the error of one such speculative decision with respect to a decision made based on response of the complete classifier can be estimated by collecting statistics on unlabeled data. The article then considers a sequential strategy of multiple early non-maxima suppression tests which follows the structure of soft-cascade detectors commonly used for object detection. We also show that an optimal (fastest for requested error rate) suppression strategy can be created by a novel variant of Wald’s sequential probability ratio test (SPRT) which we call the conditioned SPRT (CSPRT). Experimental results show that the early non-maxima suppression significantly reduces amount of computation in the case of object localization while the error rates are limited to low predefined values. The proposed approach notably outperforms the state-of-the-art detectors based on WaldBoost. The potential applications of the early non-maxima suppression approach are not limited to object localization and could be applied wherever the goal is to find the strongest response of a classifier among a set of classified samples.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bishop CM (2006) Pattern recognition and machine learning (information science and statistics). Springer, New York Bishop CM (2006) Pattern recognition and machine learning (information science and statistics). Springer, New York
2.
Zurück zum Zitat Boom BJ, van Rootseler RTA, Veldhuis RNJ (2007) Investigating the boosting framework for face recognition. In: Veldhuis RNJ, Cronie HS (eds) Proceedings of the 28th symposium on information theory in the Benelux, Eindhoven, May 2007. Werkgemeenschap voor Informatie- en Communicatietechniek, pp 189–196 Boom BJ, van Rootseler RTA, Veldhuis RNJ (2007) Investigating the boosting framework for face recognition. In: Veldhuis RNJ, Cronie HS (eds) Proceedings of the 28th symposium on information theory in the Benelux, Eindhoven, May 2007. Werkgemeenschap voor Informatie- en Communicatietechniek, pp 189–196
3.
Zurück zum Zitat Bourdev L, Brandt J (2005) Robust object detection via soft cascade. In: CVPR Bourdev L, Brandt J (2005) Robust object detection via soft cascade. In: CVPR
4.
Zurück zum Zitat Brubaker CS, Mullin MD, Rehg JM (2006) Towards optimal training of cascaded detectors. In: ECCV06, pp 325–337 Brubaker CS, Mullin MD, Rehg JM (2006) Towards optimal training of cascaded detectors. In: ECCV06, pp 325–337
5.
Zurück zum Zitat Cao Y, Xu J, Liu T-Y, Li H, Huang Y, Hon H-W (2006) Adapting ranking SVM to document retrieval. In: SIGIR ’06: proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval. ACM, New York (pp 186–193) Cao Y, Xu J, Liu T-Y, Li H, Huang Y, Hon H-W (2006) Adapting ranking SVM to document retrieval. In: SIGIR ’06: proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval. ACM, New York (pp 186–193)
6.
Zurück zum Zitat Cha Z, Viola P (2007) Multiple-instance pruning for learning efficient cascade detectors. In: NIPS Cha Z, Viola P (2007) Multiple-instance pruning for learning efficient cascade detectors. In: NIPS
7.
Zurück zum Zitat Grabner H, Grabner M, Bischof H (2006) Real-time tracking via on-line boosting, vol 1, p 47 Grabner H, Grabner M, Bischof H (2006) Real-time tracking via on-line boosting, vol 1, p 47
8.
Zurück zum Zitat Grabner H, Šochman J, Bischof H, Matas JG (2008) Training sequential on-line boosting classifier for visual tracking. In: ICPR08, international conference on pattern recognition, pp 1–4 Grabner H, Šochman J, Bischof H, Matas JG (2008) Training sequential on-line boosting classifier for visual tracking. In: ICPR08, international conference on pattern recognition, pp 1–4
9.
Zurück zum Zitat Herout A, Hradiš M, Jošth R, Juranek R, Zemčik P (2008) “Local rank differences” image faeture implemented on GPU. In: Advanced concepts for intelligent vision systems. Lecture notes in computer science, pp 170–181 Herout A, Hradiš M, Jošth R, Juranek R, Zemčik P (2008) “Local rank differences” image faeture implemented on GPU. In: Advanced concepts for intelligent vision systems. Lecture notes in computer science, pp 170–181
10.
Zurück zum Zitat Hradiš M, Herout A, Zemčik P (2008) Local rank patterns—novel features for rapid object detection. In: Proceedings of international conference on computer vision and graphics 2008. Lecture notes in computer science, pp 1–12 Hradiš M, Herout A, Zemčik P (2008) Local rank patterns—novel features for rapid object detection. In: Proceedings of international conference on computer vision and graphics 2008. Lecture notes in computer science, pp 1–12
11.
Zurück zum Zitat Huang C, Ai HZ, Li Y, Lao SH (2007) High-performance rotation invariant multiview face detection. PAMI 29(4):671–686CrossRef Huang C, Ai HZ, Li Y, Lao SH (2007) High-performance rotation invariant multiview face detection. PAMI 29(4):671–686CrossRef
12.
Zurück zum Zitat Jesorsky O, Kirchberg KJ, Frischholz R (2001) Robust face detection using the Hausdorff distance. In: AVBPA ’01: proceedings of the third international conference on audio- and video-based biometric person authentication. Springer, London, pp 90–95 Jesorsky O, Kirchberg KJ, Frischholz R (2001) Robust face detection using the Hausdorff distance. In: AVBPA ’01: proceedings of the third international conference on audio- and video-based biometric person authentication. Springer, London, pp 90–95
13.
Zurück zum Zitat Messer K, Matas J, Kittler J, Luettin J, Maiter G (1999) XM2VTSDB: the extended M2VTS database. In: Second international conference on audio and video-based biometric person authentication, pp 72–77 Messer K, Matas J, Kittler J, Luettin J, Maiter G (1999) XM2VTSDB: the extended M2VTS database. In: Second international conference on audio and video-based biometric person authentication, pp 72–77
14.
Zurück zum Zitat Mutch J, Lowe DG (2006) Multiclass object recognition with sparse, localized features. In: 2006 IEEE Computer Society conference on computer vision and pattern recognition, vol 1, pp 11–18 Mutch J, Lowe DG (2006) Multiclass object recognition with sparse, localized features. In: 2006 IEEE Computer Society conference on computer vision and pattern recognition, vol 1, pp 11–18
15.
Zurück zum Zitat Schapire RE, Singer Y (1999) Improved boosting algorithms using confidence-rated predictions. Mach Learn 37:297–336 Schapire RE, Singer Y (1999) Improved boosting algorithms using confidence-rated predictions. Mach Learn 37:297–336
16.
Zurück zum Zitat Schneiderman H (2004) Feature-centric evaluation for efficient cascaded object detection. In: IEEE Computer Society conference on computer vision and pattern recognition, vol 2, pp 29–36 Schneiderman H (2004) Feature-centric evaluation for efficient cascaded object detection. In: IEEE Computer Society conference on computer vision and pattern recognition, vol 2, pp 29–36
17.
Zurück zum Zitat Šochman J, Matas J (2005) WaldBoost—learning for time constrained sequential detection. In: CVPR ’05: proceedings of the 2005 IEEE Computer Society conference on computer vision and pattern recognition (CVPR’05), vol 2. IEEE Computer Society, Washington, DC. pp 150–156 Šochman J, Matas J (2005) WaldBoost—learning for time constrained sequential detection. In: CVPR ’05: proceedings of the 2005 IEEE Computer Society conference on computer vision and pattern recognition (CVPR’05), vol 2. IEEE Computer Society, Washington, DC. pp 150–156
18.
Zurück zum Zitat Šochman J, Matas J (2009) Learning fast emulators of binary decision processes. Int J Comput Vision 83(2):149–163 Šochman J, Matas J (2009) Learning fast emulators of binary decision processes. Int J Comput Vision 83(2):149–163
19.
Zurück zum Zitat Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. In: IEEE Computer Society conference on computer vision and pattern recognition, vol 1, p 511 Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. In: IEEE Computer Society conference on computer vision and pattern recognition, vol 1, p 511
Metadaten
Titel
EnMS: early non-maxima suppression
Speeding up pattern localization and other tasks
verfasst von
Adam Herout
Michal Hradiš
Pavel Zemčík
Publikationsdatum
01.05.2012
Verlag
Springer-Verlag
Erschienen in
Pattern Analysis and Applications / Ausgabe 2/2012
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-011-0213-2

Weitere Artikel der Ausgabe 2/2012

Pattern Analysis and Applications 2/2012 Zur Ausgabe

Premium Partner