Skip to main content
Top
Published in: Machine Vision and Applications 6/2013

01-08-2013 | Original Paper

Analysis of object description methods in a video object tracking environment

Authors: Pedro Carvalho, Telmo Oliveira, Lucian Ciobanu, Filipe Gaspar, Luís F. Teixeira, Rafael Bastos, Jaime S. Cardoso, Miguel S. Dias, Luís Côrte-Real

Published in: Machine Vision and Applications | Issue 6/2013

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A key issue in video object tracking is the representation of the objects and how effectively it discriminates between different objects. Several techniques have been proposed, but without a generally accepted method. While analysis and comparisons of these individual methods have been presented in the literature, their evaluation as part of a global solution has been overlooked. The appearance model for the objects is a component of a video object tracking framework, depending on previous processing stages and affecting those that succeed it. As a result, these interdependencies should be taken into account when analysing the performance of the object description techniques. We propose an integrated analysis of object descriptors and appearance models through their comparison in a common object tracking solution. The goal is to contribute to a better understanding of object description methods and their impact on the tracking process. Our contributions are threefold: propose a novel descriptor evaluation and characterisation paradigm; perform the first integrated analysis of state-of-the-art description methods in a scenario of people tracking; put forward some ideas for appearance models to use in this context. This work provides foundations for future tests and the proposed assessment approach contributes to the informed selection of techniques more adequately for a given tracking application context.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Footnotes
1
The reader may note some similarities with the architecture proposed by Moeslund [26].
 
Literature
1.
go back to reference Alahi, A., Vandergheynst, P., Bierlaire, M., Kunt, M.: Cascade of descriptors to detect and track objects across any network of cameras. Comput. Vis. Image Underst. 114, 624–640 (2010)CrossRef Alahi, A., Vandergheynst, P., Bierlaire, M., Kunt, M.: Cascade of descriptors to detect and track objects across any network of cameras. Comput. Vis. Image Underst. 114, 624–640 (2010)CrossRef
2.
go back to reference Bashir, F., Porikli, F.: Performance evaluation of object detection and tracking systems. In: PETS, Proceedings of IEEE International Workshop on Performance Evaluation of Tracking and Surveillance (2006) Bashir, F., Porikli, F.: Performance evaluation of object detection and tracking systems. In: PETS, Proceedings of IEEE International Workshop on Performance Evaluation of Tracking and Surveillance (2006)
3.
go back to reference Bastos, R., Dias, M.S.: FIRST–Fast Invariant to Rotation and Scale Transform: Invariant Image Features for Augmented Reality and Computer Vision. VDM Verlag, Saarbrucken (2009) Bastos, R., Dias, M.S.: FIRST–Fast Invariant to Rotation and Scale Transform: Invariant Image Features for Augmented Reality and Computer Vision. VDM Verlag, Saarbrucken (2009)
4.
go back to reference Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110, 346–359 (2008)CrossRef Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110, 346–359 (2008)CrossRef
5.
go back to reference Black, J., Ellis, T., Rosin, P.: A novel method for video tracking performance evaluation. In: Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS), pp. 125–132 (2003) Black, J., Ellis, T., Rosin, P.: A novel method for video tracking performance evaluation. In: Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS), pp. 125–132 (2003)
6.
go back to reference Bradski, G.R.: Computer vision face tracking for use in a perceptual user interface. Intel Technol. J. (Q2) (1998) Bradski, G.R.: Computer vision face tracking for use in a perceptual user interface. Intel Technol. J. (Q2) (1998)
7.
go back to reference Brown, M., Lowe, D.: Invariant features from interest point groups. In: British Machine Vision Conference, pp. 656–665 (2002) Brown, M., Lowe, D.: Invariant features from interest point groups. In: British Machine Vision Conference, pp. 656–665 (2002)
8.
go back to reference Cardoso, J.S., Carvalho, P., Teixeira, L.F., Corte-Real, L.: Partition-distance methods for assessing spatial segmentations of images and videos. Comput. Vis. Image Underst. 113(7), 811–823 (2009)CrossRef Cardoso, J.S., Carvalho, P., Teixeira, L.F., Corte-Real, L.: Partition-distance methods for assessing spatial segmentations of images and videos. Comput. Vis. Image Underst. 113(7), 811–823 (2009)CrossRef
12.
go back to reference Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), vol. 1, CVPR ’05, pp. 886–893. IEEE Computer Society, Washington (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), vol. 1, CVPR ’05, pp. 886–893. IEEE Computer Society, Washington (2005)
13.
go back to reference Denman, S., Fookes, C., Sridharan, S., Lakemond, R.: Dynamic performance measures for object tracking systems. In: Proceedings of the 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS ’09, pp. 541–546. IEEE Computer Society, Washington (2009) Denman, S., Fookes, C., Sridharan, S., Lakemond, R.: Dynamic performance measures for object tracking systems. In: Proceedings of the 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS ’09, pp. 541–546. IEEE Computer Society, Washington (2009)
14.
go back to reference Ellis, T.: Performance metrics and methods for tracking in surveillance. In: 3rd IEEE International Workshop on Performance Evaluation of Tracking and Surveillance PETS’2002. Copenhagen, Denmark (2002) Ellis, T.: Performance metrics and methods for tracking in surveillance. In: 3rd IEEE International Workshop on Performance Evaluation of Tracking and Surveillance PETS’2002. Copenhagen, Denmark (2002)
15.
go back to reference Erdem, Ç.E., Sankur, B., Tekalp, A.M.: Performance measures for video object segmentation and tracking. IEEE Trans. Image Process. 13(7), 937–951 (2004)CrossRef Erdem, Ç.E., Sankur, B., Tekalp, A.M.: Performance measures for video object segmentation and tracking. IEEE Trans. Image Process. 13(7), 937–951 (2004)CrossRef
16.
go back to reference Han, Z., Ye, Q., Jiao, J.: Combined feature evaluation for adaptive visual object tracking. Comput. Vis. Image Underst. 115, 69–80 (2011)CrossRef Han, Z., Ye, Q., Jiao, J.: Combined feature evaluation for adaptive visual object tracking. Comput. Vis. Image Underst. 115, 69–80 (2011)CrossRef
17.
go back to reference Jiang, Y.G., Yang, J., Ngo, C.W., Hauptmann, A.G.: Representations of Keypoint-Based semantic concept detection: a comprehensive study. IEEE Trans. Multimed. 12(1), 42–53 (2009)CrossRef Jiang, Y.G., Yang, J., Ngo, C.W., Hauptmann, A.G.: Representations of Keypoint-Based semantic concept detection: a comprehensive study. IEEE Trans. Multimed. 12(1), 42–53 (2009)CrossRef
18.
go back to reference Jiang, Z., Huynh, D.Q., Moran, W., Challa, S., Spadaccini, N.: Multiple pedestrian tracking using colour and motion models. Digit. Image Comput. Tech. Appl. 328–334 (2010) Jiang, Z., Huynh, D.Q., Moran, W., Challa, S., Spadaccini, N.: Multiple pedestrian tracking using colour and motion models. Digit. Image Comput. Tech. Appl. 328–334 (2010)
19.
go back to reference Ke, Y., Sukthankar, R.: PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR’04, pp. 506–513. IEEE Computer Society, Washington (2004) Ke, Y., Sukthankar, R.: PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR’04, pp. 506–513. IEEE Computer Society, Washington (2004)
20.
go back to reference Lazarevic-McManus, N., Renno, J.R., Makris, D., Jones, G.A.: An object-based comparative methodology for motion detection based on the F-Measure. Comput. Vis. Image Underst. 111(1), 74–85 (2008)CrossRef Lazarevic-McManus, N., Renno, J.R., Makris, D., Jones, G.A.: An object-based comparative methodology for motion detection based on the F-Measure. Comput. Vis. Image Underst. 111(1), 74–85 (2008)CrossRef
21.
go back to reference List, T., Fisher, R.B.: CVML–an XML-based computer vision markup language. In: Proceedings of the Pattern Recognition, 17th International Conference on (ICPR’04) vol. 1, ICPR ’04, pp. 789–792. IEEE Computer Society, Washington (2004) List, T., Fisher, R.B.: CVML–an XML-based computer vision markup language. In: Proceedings of the Pattern Recognition, 17th International Conference on (ICPR’04) vol. 1, ICPR ’04, pp. 789–792. IEEE Computer Society, Washington (2004)
22.
go back to reference Liu, H., Yu, Z., Zha, H., Zou, Y., Zhang, L.: Robust human tracking based on multi-cue integration and mean-shift. Pattern Recognit. Lett. 30 (2009) Liu, H., Yu, Z., Zha, H., Zou, Y., Zhang, L.: Robust human tracking based on multi-cue integration and mean-shift. Pattern Recognit. Lett. 30 (2009)
23.
go back to reference Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef
24.
go back to reference Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. Int. J. Comput. Vis. 60, 63–86 (2004)CrossRef Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. Int. J. Comput. Vis. 60, 63–86 (2004)CrossRef
25.
go back to reference Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 2(10), 1615–1630 (2005)CrossRef Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 2(10), 1615–1630 (2005)CrossRef
26.
go back to reference Moeslund, T.B., Granum, E.: A survey of computer vision-based human motion capture. Comput. Vis. Image Underst. 81, 231–268 (2001)MATHCrossRef Moeslund, T.B., Granum, E.: A survey of computer vision-based human motion capture. Comput. Vis. Image Underst. 81, 231–268 (2001)MATHCrossRef
27.
go back to reference Nghiem, A.T., Bremond, F., Thonnat, M., Valentin, V.: Etiseo, performance evaluation for video surveillance systems. In: Proceedings of the 2007 IEEE Conference on Advanced Video and Signal Based Surveillance, pp. 476–481. IEEE Computer Society, Washington (2007) Nghiem, A.T., Bremond, F., Thonnat, M., Valentin, V.: Etiseo, performance evaluation for video surveillance systems. In: Proceedings of the 2007 IEEE Conference on Advanced Video and Signal Based Surveillance, pp. 476–481. IEEE Computer Society, Washington (2007)
28.
go back to reference Opelt, A., Pinz, A., Zisserman, A.: Incremental learning of object detectors using a visual shape alphabet. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. 1, 3–10 (2006) Opelt, A., Pinz, A., Zisserman, A.: Incremental learning of object detectors using a visual shape alphabet. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. 1, 3–10 (2006)
30.
go back to reference Schlogl, T., Beleznai, C., Winter, M., Bischof, H.: Performance evaluation metrics for motion detection and tracking. In: ICPR ’04: Proceedings of the Pattern Recognition, 17th International Conference on (ICPR’04), vol. 4, pp. 519–522. IEEE Computer Society, Washington (2004) Schlogl, T., Beleznai, C., Winter, M., Bischof, H.: Performance evaluation metrics for motion detection and tracking. In: ICPR ’04: Proceedings of the Pattern Recognition, 17th International Conference on (ICPR’04), vol. 4, pp. 519–522. IEEE Computer Society, Washington (2004)
31.
go back to reference Schmid, C., Mohr, R., Bauckhage, C.: Evaluation of interest point detectors. Int. J. Comput. Vis. 37, 151–172 (2000)MATHCrossRef Schmid, C., Mohr, R., Bauckhage, C.: Evaluation of interest point detectors. Int. J. Comput. Vis. 37, 151–172 (2000)MATHCrossRef
32.
go back to reference Shahed, S.M.N., Ho, J., Yang, M.H.: Online visual tracking with histograms and articulating blocks. Comput. Vis. Image Underst. 114(8), 901–914 (2010)CrossRef Shahed, S.M.N., Ho, J., Yang, M.H.: Online visual tracking with histograms and articulating blocks. Comput. Vis. Image Underst. 114(8), 901–914 (2010)CrossRef
33.
go back to reference Sivic, J., Russell, B.C., Efros, A.A., Zisserman, A., Freeman, W.T.: Discovering objects and their localization in images. In: Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV’05), vol. 1, pp. 370–377. IEEE Computer Society, Washington (2005) Sivic, J., Russell, B.C., Efros, A.A., Zisserman, A., Freeman, W.T.: Discovering objects and their localization in images. In: Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV’05), vol. 1, pp. 370–377. IEEE Computer Society, Washington (2005)
34.
go back to reference Szeliski, R.: Computer Vision: Algorithms and Applications. Springer, New York (2010) Szeliski, R.: Computer Vision: Algorithms and Applications. Springer, New York (2010)
35.
go back to reference Tang, F., Tao, H.: Object tracking with dynamic feature graph. In: ICCCN ’05: Proceedings of the 14th International Conference on Computer Communications and, Networks, pp. 25–32 (2005) Tang, F., Tao, H.: Object tracking with dynamic feature graph. In: ICCCN ’05: Proceedings of the 14th International Conference on Computer Communications and, Networks, pp. 25–32 (2005)
36.
go back to reference Teixeira, L., Carvalho, P., Cardoso, J., Corte-Real, L.: Automatic description of object appearances in a wide-area surveillance scenario. In: 19th IEEE International Conference on Image Processing (ICIP), pp. 1609–1612 (2012) Teixeira, L., Carvalho, P., Cardoso, J., Corte-Real, L.: Automatic description of object appearances in a wide-area surveillance scenario. In: 19th IEEE International Conference on Image Processing (ICIP), pp. 1609–1612 (2012)
37.
go back to reference Teixeira, L.F., Cardoso, J.S., Corte-Real, L.: Object segmentation using background modelling and cascaded change detection. J. Multimed. (JMM) 2, 55–65 (2007) Teixeira, L.F., Cardoso, J.S., Corte-Real, L.: Object segmentation using background modelling and cascaded change detection. J. Multimed. (JMM) 2, 55–65 (2007)
38.
go back to reference Tell, D., Carlsson, S.: Combining appearance and topology for wide baseline matching. In: Proceedings of the 7th European Conference on Computer Vision-Part I, ECCV ’02, pp. 68–81. Springer, London (2002) Tell, D., Carlsson, S.: Combining appearance and topology for wide baseline matching. In: Proceedings of the 7th European Conference on Computer Vision-Part I, ECCV ’02, pp. 68–81. Springer, London (2002)
39.
go back to reference Tissainayagam, P., Suter, D.: Assessing the performance of corner detectors for point feature tracking applications. Image Vis. Comput. 22, 663–679 (2004)CrossRef Tissainayagam, P., Suter, D.: Assessing the performance of corner detectors for point feature tracking applications. Image Vis. Comput. 22, 663–679 (2004)CrossRef
40.
go back to reference Venetianer, P.L., Deng, H.: Performance evaluation of an intelligent video surveillance system—a case study. Comput. Vis. Image Underst. 114, 1292–1302 (2010)CrossRef Venetianer, P.L., Deng, H.: Performance evaluation of an intelligent video surveillance system—a case study. Comput. Vis. Image Underst. 114, 1292–1302 (2010)CrossRef
41.
go back to reference Vizireanu, D.N.: Generalizations of binary morphological shape decomposition. J. Electron. Imaging 16, 013,002 (2007) Vizireanu, D.N.: Generalizations of binary morphological shape decomposition. J. Electron. Imaging 16, 013,002 (2007)
42.
go back to reference Vizireanu, N., Halunga, S., Marghescu, G.: Morphological skeleton decomposition interframe interpolation method. J. Electron. Imaging 19, 023,018 (2010) Vizireanu, N., Halunga, S., Marghescu, G.: Morphological skeleton decomposition interframe interpolation method. J. Electron. Imaging 19, 023,018 (2010)
43.
go back to reference Wu, L., Hu, Y., Li, M., Yu, N., Hua, X.S.: Scale-invariant visual language modeling for object categorization. IEEE Trans. Multimed. 11, 286–294 (2009)CrossRef Wu, L., Hu, Y., Li, M., Yu, N., Hua, X.S.: Scale-invariant visual language modeling for object categorization. IEEE Trans. Multimed. 11, 286–294 (2009)CrossRef
44.
go back to reference Zhao, T.: Model-based segmentation and tracking of multiple humans in complex situations. Ph.D. thesis, Faculty of the Graduate School of the University of Southern California (2004) Zhao, T.: Model-based segmentation and tracking of multiple humans in complex situations. Ph.D. thesis, Faculty of the Graduate School of the University of Southern California (2004)
45.
go back to reference Zhao, T., Nevatia, R.: Tracking multiple humans in complex situations. IEEE Trans. Pattern Anal. Mach. Intell. 26(9), 1208–1211 (2004) Zhao, T., Nevatia, R.: Tracking multiple humans in complex situations. IEEE Trans. Pattern Anal. Mach. Intell. 26(9), 1208–1211 (2004)
46.
go back to reference Zhou, H., Yuan, Y., Shi, C.: Object tracking using SIFT features and mean shift. Comput. Vis. Image Underst. 113, 345–352 (2009) Zhou, H., Yuan, Y., Shi, C.: Object tracking using SIFT features and mean shift. Comput. Vis. Image Underst. 113, 345–352 (2009)
Metadata
Title
Analysis of object description methods in a video object tracking environment
Authors
Pedro Carvalho
Telmo Oliveira
Lucian Ciobanu
Filipe Gaspar
Luís F. Teixeira
Rafael Bastos
Jaime S. Cardoso
Miguel S. Dias
Luís Côrte-Real
Publication date
01-08-2013
Publisher
Springer Berlin Heidelberg
Published in
Machine Vision and Applications / Issue 6/2013
Print ISSN: 0932-8092
Electronic ISSN: 1432-1769
DOI
https://doi.org/10.1007/s00138-013-0523-z

Other articles of this Issue 6/2013

Machine Vision and Applications 6/2013 Go to the issue

Premium Partner