Skip to main content
Erschienen in: Pattern Analysis and Applications 3/2005

01.12.2005 | Theoretical Advances

An effective 3D target recognition model imitating robust methods of the human visual system

verfasst von: Sungho Kim, Gijeong Jang, In So Kweon

Erschienen in: Pattern Analysis and Applications | Ausgabe 3/2005

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents a model of 3D object recognition motivated from the robust properties of human vision system (HVS). The HVS shows the best efficiency and robustness for an object identification task. The robust properties of the HVS are visual attention, contrast mechanism, feature binding, multi-resolution, size tuning, and part-based representation. In addition, bottom-up and top-down information are combined cooperatively. Based on these facts, a plausible computational model integrating these facts under the Monte Carlo optimization technique was proposed. In this scheme, object recognition is regarded as a parameter optimization problem. The bottom-up process is used to initialize parameters in a discriminative way; the top-down process is used to optimize them in a generative way. Experimental results show that the proposed recognition model is feasible for 3D object identification and pose estimation in visible and infrared band images.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Lowe DG (1987) Three-dimensional object recognition from single two-dimensional images. Artif Intell 31(3):355–395CrossRef Lowe DG (1987) Three-dimensional object recognition from single two-dimensional images. Artif Intell 31(3):355–395CrossRef
2.
Zurück zum Zitat Faugeras OD, Hebert M (1986) The representation recognition, and locating of 3-D objects. Int J Robotics Res 5(3):27–52CrossRef Faugeras OD, Hebert M (1986) The representation recognition, and locating of 3-D objects. Int J Robotics Res 5(3):27–52CrossRef
3.
Zurück zum Zitat Mundy J, Zisserman A (1992) Geometric invariance in computer vision. MIT, Cambridge, MA, pp 335–460 Mundy J, Zisserman A (1992) Geometric invariance in computer vision. MIT, Cambridge, MA, pp 335–460
4.
Zurück zum Zitat Rothwell CA (1993) Recognition using projective invariance, Ph.D Thesis, Oxford Rothwell CA (1993) Recognition using projective invariance, Ph.D Thesis, Oxford
5.
Zurück zum Zitat Murase H, Nayar S (1995) Visual learning and recognition of 3-D objects from appearance. Int JComput Vis 14:5–24CrossRef Murase H, Nayar S (1995) Visual learning and recognition of 3-D objects from appearance. Int JComput Vis 14:5–24CrossRef
6.
Zurück zum Zitat Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int JComput Vis 60(2):91–110CrossRef Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int JComput Vis 60(2):91–110CrossRef
7.
Zurück zum Zitat Rothganger F, Lazebnik S, Schmid C, Ponce J (2004) Segmenting, modeling, and matching video clips containing multiple moving objects. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Washington, DC, June, pp 914–921 Rothganger F, Lazebnik S, Schmid C, Ponce J (2004) Segmenting, modeling, and matching video clips containing multiple moving objects. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Washington, DC, June, pp 914–921
8.
Zurück zum Zitat Fergus R, Perona P, Zisserman A (2003) Object class recognition by unsupervised scale-invariant learning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Madison, Wisconsin, June, pp 264–271 Fergus R, Perona P, Zisserman A (2003) Object class recognition by unsupervised scale-invariant learning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Madison, Wisconsin, June, pp 264–271
9.
Zurück zum Zitat Peters G (2000) Theories of three-dimensional object perception—a survey. In recent research developments in pattern recognition, transworld research network, Part-I, vol 1, pp 179–197 Peters G (2000) Theories of three-dimensional object perception—a survey. In recent research developments in pattern recognition, transworld research network, Part-I, vol 1, pp 179–197
10.
11.
Zurück zum Zitat Siegel M, Kording KP, Konig P (2000) Integrating top-down and bottom-up sensory processing by somato-dendritic interactions. J Comput Neurosci 8:161–173PubMedCrossRef Siegel M, Kording KP, Konig P (2000) Integrating top-down and bottom-up sensory processing by somato-dendritic interactions. J Comput Neurosci 8:161–173PubMedCrossRef
12.
Zurück zum Zitat Bar M (2004) Visual objects in context. Nat Rev: Neurosci 5:617–629CrossRef Bar M (2004) Visual objects in context. Nat Rev: Neurosci 5:617–629CrossRef
13.
Zurück zum Zitat Treisman A (1998) Feature binding, attention and object perception. Philos Trans: Biol Sci 29 353(1373):1295–1306CrossRef Treisman A (1998) Feature binding, attention and object perception. Philos Trans: Biol Sci 29 353(1373):1295–1306CrossRef
14.
Zurück zum Zitat VanRullen R (2003) Visual saliency and spike timing in the ventral visual pathway. J Physiol (Paris) 97:365–377CrossRef VanRullen R (2003) Visual saliency and spike timing in the ventral visual pathway. J Physiol (Paris) 97:365–377CrossRef
15.
Zurück zum Zitat Fiser J, Subramaniam S, Biederman I (2001) Size Tuning in the absence of spatial frequency tuning in object recognition. Vis Res 41(15):1931–1950CrossRefPubMed Fiser J, Subramaniam S, Biederman I (2001) Size Tuning in the absence of spatial frequency tuning in object recognition. Vis Res 41(15):1931–1950CrossRefPubMed
16.
Zurück zum Zitat Biederman I (1987) Recognition by components: a theory of human image understanding. Psychol Rev 94(2):115–147CrossRefPubMed Biederman I (1987) Recognition by components: a theory of human image understanding. Psychol Rev 94(2):115–147CrossRefPubMed
17.
Zurück zum Zitat Pasupathy A, Connor CE (2001) Shape representation in area V4: position-specific tuning for boundary conformation. J Neurophysiol 86(5):2505–2519 Pasupathy A, Connor CE (2001) Shape representation in area V4: position-specific tuning for boundary conformation. J Neurophysiol 86(5):2505–2519
18.
Zurück zum Zitat Kuno Y, Ikeuchi K, Kanade T (1988) Model-based vision by cooperative processing of evidence and hypotheses using configuration spaces. SPIE Digital Opt Shape Representation Pattern Recognit 938:444–453 Kuno Y, Ikeuchi K, Kanade T (1988) Model-based vision by cooperative processing of evidence and hypotheses using configuration spaces. SPIE Digital Opt Shape Representation Pattern Recognit 938:444–453
19.
Zurück zum Zitat Zhu SC, Zhang R, Tu Z (2000) Integrating bottom-up/top-down for object recognition by data driven markov chain Monte Carlo. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Hilton Head, SC, June, pp 738–745 Zhu SC, Zhang R, Tu Z (2000) Integrating bottom-up/top-down for object recognition by data driven markov chain Monte Carlo. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Hilton Head, SC, June, pp 738–745
20.
Zurück zum Zitat Milanese R, Wechsler H, Gil S (1994) Integration of bottom-up and top-down for visual attention using non-linear relaxation. Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, Seattle, USA, June, pp 781–785 Milanese R, Wechsler H, Gil S (1994) Integration of bottom-up and top-down for visual attention using non-linear relaxation. Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, Seattle, USA, June, pp 781–785
21.
Zurück zum Zitat Kumar VP (2002) Towards trainable man-machine interfaces: combining top-down constraints with bottom-up learning in facial analysis. Ph.D Thesis, MIT Kumar VP (2002) Towards trainable man-machine interfaces: combining top-down constraints with bottom-up learning in facial analysis. Ph.D Thesis, MIT
22.
Zurück zum Zitat Serre T, Riesenhuber M (2004) Realistic modeling of simple and complex cell tuning in the HMX model, and implications for invariant object recognition in cortex. AIM, MIT Serre T, Riesenhuber M (2004) Realistic modeling of simple and complex cell tuning in the HMX model, and implications for invariant object recognition in cortex. AIM, MIT
23.
Zurück zum Zitat Tu Z, Chen X, Yuille A, Zhu SC (2005) Image parsing: unifying segmentation, detection, and object recognition (Marr Prize Issue, a short version appeared in ICCV 2003). Int J Comput Vis 63(2):113–140 Tu Z, Chen X, Yuille A, Zhu SC (2005) Image parsing: unifying segmentation, detection, and object recognition (Marr Prize Issue, a short version appeared in ICCV 2003). Int J Comput Vis 63(2):113–140
24.
Zurück zum Zitat Borgelt C, Kruse Z (2001) Graphical models: methods for data analysis and mining. Wiley, New York, pp 1–12 Borgelt C, Kruse Z (2001) Graphical models: methods for data analysis and mining. Wiley, New York, pp 1–12
25.
Zurück zum Zitat Green P (1996) Reversible jump markov chain Monte Carlo computation and bayesian Model Determination. Champman and Hall, London Green P (1996) Reversible jump markov chain Monte Carlo computation and bayesian Model Determination. Champman and Hall, London
26.
Zurück zum Zitat Doucet A, Freitas ND, Gordon N (2001) Sequential Monte Carlo methods in practice. Springer, New York, pp 432–444, 3–13 Doucet A, Freitas ND, Gordon N (2001) Sequential Monte Carlo methods in practice. Springer, New York, pp 432–444, 3–13
27.
Zurück zum Zitat Ristic B, Arulampalam S, Gordon N (2004) Beyond the Kalman filter: particle filters for tracking applications. Artech House, London, pp 35–62MATH Ristic B, Arulampalam S, Gordon N (2004) Beyond the Kalman filter: particle filters for tracking applications. Artech House, London, pp 35–62MATH
28.
Zurück zum Zitat Robert CP, Casella G (1999) Monte Carlo statistical methods. Springer, Berlin Heidelberg New YorkMATH Robert CP, Casella G (1999) Monte Carlo statistical methods. Springer, Berlin Heidelberg New YorkMATH
29.
Zurück zum Zitat Edelman S, Bülthoff H (1992) Orientation dependence in the recognition of familiar and novel views of 3D objects. Vis Res 32:2385–2400PubMedCrossRef Edelman S, Bülthoff H (1992) Orientation dependence in the recognition of familiar and novel views of 3D objects. Vis Res 32:2385–2400PubMedCrossRef
30.
Zurück zum Zitat Lindeberg T (1998) Feature detection with automatic scale selection. Int JComput Vis 30(2):77–116 Lindeberg T (1998) Feature detection with automatic scale selection. Int JComput Vis 30(2):77–116
31.
Zurück zum Zitat Kim S, Kweon IS (2005) Automatic model-based 3D object recognition by combining feature matching with tracking. Machine Vis Appl DOI 10.1007/s00138-005-0194-9 Kim S, Kweon IS (2005) Automatic model-based 3D object recognition by combining feature matching with tracking. Machine Vis Appl DOI 10.1007/s00138-005-0194-9
32.
Zurück zum Zitat Parkhurst D, Law K, Niebur E (2002) Modeling the role of salience in the allocation of overt visual attention. Vis Res 42:107–123CrossRefPubMed Parkhurst D, Law K, Niebur E (2002) Modeling the role of salience in the allocation of overt visual attention. Vis Res 42:107–123CrossRefPubMed
33.
Zurück zum Zitat Feldman J, Singh M (2005) Information along contours and object boundaries. Psychol Rev 112(1):243–252PubMedCrossRef Feldman J, Singh M (2005) Information along contours and object boundaries. Psychol Rev 112(1):243–252PubMedCrossRef
34.
Zurück zum Zitat Reisfeld D, Wolfson H, Yeshurun Y (1995) Context-free attentional Operators: the generalized symmetry transform. Int J Comput Vis 14(2):119–130CrossRef Reisfeld D, Wolfson H, Yeshurun Y (1995) Context-free attentional Operators: the generalized symmetry transform. Int J Comput Vis 14(2):119–130CrossRef
35.
Zurück zum Zitat Harris CJ, Stephens M (1988) A combined corner and edge detector. In Proceedings of 4th Alvey Vision Conference, Manchester, pp 147–151 Harris CJ, Stephens M (1988) A combined corner and edge detector. In Proceedings of 4th Alvey Vision Conference, Manchester, pp 147–151
36.
Zurück zum Zitat Schmid C, Mohr R, Bauckhage C (2000) Evaluation of interest point detectors. Int J Comput Vis 37(2):151–172CrossRefMATH Schmid C, Mohr R, Bauckhage C (2000) Evaluation of interest point detectors. Int J Comput Vis 37(2):151–172CrossRefMATH
37.
Zurück zum Zitat Canny J (1986) A computational approach to edge detection. IEEE Trans Pattern Anal Machine Intell 8(6):679–698CrossRef Canny J (1986) A computational approach to edge detection. IEEE Trans Pattern Anal Machine Intell 8(6):679–698CrossRef
38.
Zurück zum Zitat Mikolajczyk K, Schmid C (2003) A performance evaluation of local descriptors. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Madison, Wisconsin, pp 774–781 Mikolajczyk K, Schmid C (2003) A performance evaluation of local descriptors. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Madison, Wisconsin, pp 774–781
39.
Zurück zum Zitat Desolneux A, Moisan L, Morel JM (2004) Gestalt theory and computer vision. In: Carsetti A (ed) Seeing, thinking and knowing. Kluwer Academic, New York, pp 71–101CrossRef Desolneux A, Moisan L, Morel JM (2004) Gestalt theory and computer vision. In: Carsetti A (ed) Seeing, thinking and knowing. Kluwer Academic, New York, pp 71–101CrossRef
Metadaten
Titel
An effective 3D target recognition model imitating robust methods of the human visual system
verfasst von
Sungho Kim
Gijeong Jang
In So Kweon
Publikationsdatum
01.12.2005
Verlag
Springer-Verlag
Erschienen in
Pattern Analysis and Applications / Ausgabe 3/2005
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-005-0001-y

Weitere Artikel der Ausgabe 3/2005

Pattern Analysis and Applications 3/2005 Zur Ausgabe

Premium Partner