Skip to main content
Top
Published in: Machine Vision and Applications 7/2016

01-10-2016 | Original Paper

Continuous localization and mapping of a pan–tilt–zoom camera for wide area tracking

Authors: Giuseppe Lisanti, Iacopo Masi, Federico Pernici, Alberto Del Bimbo

Published in: Machine Vision and Applications | Issue 7/2016

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Pan–tilt–zoom (PTZ) cameras are well suited for object identification and recognition in far-field scenes. However, the effective use of PTZ cameras is complicated by the fact that a continuous online camera calibration is needed and the absolute pan, tilt and zoom values provided by the camera actuators cannot be used because they are not synchronized with the video stream. So, accurate calibration must be directly extracted from the visual content of the frames. Moreover, the large and abrupt scale changes, the scene background changes due to the camera operation and the need of camera motion compensation make target tracking with these cameras extremely challenging. In this paper, we present a solution that provides continuous online calibration of PTZ cameras which is robust to rapid camera motion, changes of the environment due to varying illumination or moving objects. The approach also scales beyond thousands of scene landmarks extracted with the SURF keypoint detector. The method directly derives the relationship between the position of a target in the ground plane and the corresponding scale and position in the image and allows real-time tracking of multiple targets with high and stable degree of accuracy even at far distances and any zoom level.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Footnotes
1
In the case of a PTZ sensor, the homography between each keyframe and the reference keyframe is the infinite homography \(\mathtt {H}_\infty \) that puts in relation vanishing lines and vanishing points between the images.
 
Literature
1.
go back to reference Agapito, L., Hayman, E., Reid, I.D.: Self-calibration of rotating and zooming cameras. Int. J. Comput. Vision 45(2), 107–127 (2001)CrossRefMATH Agapito, L., Hayman, E., Reid, I.D.: Self-calibration of rotating and zooming cameras. Int. J. Comput. Vision 45(2), 107–127 (2001)CrossRefMATH
2.
go back to reference Arth, C., Klopschitz, M., Reitmayr, G., Schmalstieg, D.: Real-time self-localization from panoramic images on mobile devices. In: Proceedings of IEEE International Symposium on Mixed and Augmented Reality (2011) Arth, C., Klopschitz, M., Reitmayr, G., Schmalstieg, D.: Real-time self-localization from panoramic images on mobile devices. In: Proceedings of IEEE International Symposium on Mixed and Augmented Reality (2011)
3.
go back to reference Barceló, L., Binefa, X., Kender, J.R.: Robust methods and representations for soccer player tracking and collision resolution. In: Proceedings of the International Conference on Image and Video Retrieval (2005) Barceló, L., Binefa, X., Kender, J.R.: Robust methods and representations for soccer player tracking and collision resolution. In: Proceedings of the International Conference on Image and Video Retrieval (2005)
4.
go back to reference Bay, H., Tuytelaars, T., Gool, L.V.: SURF: speeded up robust features. In: Proceedings of the European Conference on Computer Vision (2006) Bay, H., Tuytelaars, T., Gool, L.V.: SURF: speeded up robust features. In: Proceedings of the European Conference on Computer Vision (2006)
5.
go back to reference Bernardin, K., Stiefelhagen, R.: Evaluating multiple object tracking performance: the CLEAR MOT metrics. J. Image Video Process. 2008, 1–10 (2008) Bernardin, K., Stiefelhagen, R.: Evaluating multiple object tracking performance: the CLEAR MOT metrics. J. Image Video Process. 2008, 1–10 (2008)
6.
go back to reference Breitenstein, M., Reichlin, F., Leibe, B., Koller-Meier, E., Van Gool, L.: Online multi-person tracking-by-detection from a single, uncalibrated camera. IEEE Trans. Pattern Anal. Mach. Intell. 33(9), 1820–1833 (2011)CrossRef Breitenstein, M., Reichlin, F., Leibe, B., Koller-Meier, E., Van Gool, L.: Online multi-person tracking-by-detection from a single, uncalibrated camera. IEEE Trans. Pattern Anal. Mach. Intell. 33(9), 1820–1833 (2011)CrossRef
7.
go back to reference Brendel, W., Amer, M., Todorovic, S.: Multiobject tracking as maximum weight independent set. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2011) Brendel, W., Amer, M., Todorovic, S.: Multiobject tracking as maximum weight independent set. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2011)
8.
go back to reference Civera, J., Davison, A.J., Magallon, J.A., Montiel, J.M.M.: Drift-free real-time sequential mosaicing. Int. J. Comput. Vision 81(2), 128–137 (2009)CrossRef Civera, J., Davison, A.J., Magallon, J.A., Montiel, J.M.M.: Drift-free real-time sequential mosaicing. Int. J. Comput. Vision 81(2), 128–137 (2009)CrossRef
9.
go back to reference Collins, R.T., Tsin, Y.: Calibration of an outdoor active camera system. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (1999) Collins, R.T., Tsin, Y.: Calibration of an outdoor active camera system. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (1999)
10.
go back to reference Criminisi, A., Reid, I., Zisserman, A.: A plane measuring device. Image Vis. Comput. 17(8), 625–634 (1999)CrossRef Criminisi, A., Reid, I., Zisserman, A.: A plane measuring device. Image Vis. Comput. 17(8), 625–634 (1999)CrossRef
11.
go back to reference Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2005)
12.
go back to reference Davis, J., Chen, X.: Calibrating pan-tilt cameras in wide-area surveillance networks. In: Proceedings of IEEE International Conference on Computer Vision (2003) Davis, J., Chen, X.: Calibrating pan-tilt cameras in wide-area surveillance networks. In: Proceedings of IEEE International Conference on Computer Vision (2003)
13.
go back to reference Del Bimbo, A., Lisanti, G., Masi, I., Pernici, F.: Device-tagged feature-based localization and mapping of wide areas with a ptz camera. In: Proceedings of CVPR Workshops, Socially Intelligent Surveillance and Monitoring (2010) Del Bimbo, A., Lisanti, G., Masi, I., Pernici, F.: Device-tagged feature-based localization and mapping of wide areas with a ptz camera. In: Proceedings of CVPR Workshops, Socially Intelligent Surveillance and Monitoring (2010)
14.
go back to reference Del Bimbo, A., Lisanti, G., Masi, I., Pernici, F.: Continuous recovery for real time pan tilt zoom localization and mapping. In: Advanced Video and Signal-Based Surveillance (AVSS) (2011) Del Bimbo, A., Lisanti, G., Masi, I., Pernici, F.: Continuous recovery for real time pan tilt zoom localization and mapping. In: Advanced Video and Signal-Based Surveillance (AVSS) (2011)
15.
go back to reference Del Bimbo, A., Lisanti, G., Pernici, F.: Scale invariant 3D multi-person tracking using a base set of bundle adjusted visual landmarks. In: Proceedings of ICCV Workshops, International Workshop on Visual Surveillance (2009) Del Bimbo, A., Lisanti, G., Pernici, F.: Scale invariant 3D multi-person tracking using a base set of bundle adjusted visual landmarks. In: Proceedings of ICCV Workshops, International Workshop on Visual Surveillance (2009)
16.
go back to reference Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)MathSciNetCrossRef Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)MathSciNetCrossRef
17.
18.
go back to reference Hartley, R.: Self-calibration from multiple views with a rotating camera. In: Proceedings of the European Conference on Computer Vision (1994) Hartley, R.: Self-calibration from multiple views with a rotating camera. In: Proceedings of the European Conference on Computer Vision (1994)
19.
go back to reference Hayman, E., Thorhallsson, T., Murray, D.W.: Zoom-invariant tracking using points and lines in affine views—an application of the affine multifocal tensors. In: Proceedings of the International Conference on Computer Vision (1999) Hayman, E., Thorhallsson, T., Murray, D.W.: Zoom-invariant tracking using points and lines in affine views—an application of the affine multifocal tensors. In: Proceedings of the International Conference on Computer Vision (1999)
20.
go back to reference Kang, S., Paik, J.K., Koschan, A., Abidi, B.R., Abidi, M.A.: Real-time video tracking using PTZ cameras. In: Proceedings of International Conference on Quality Control by Artificial Vision (2003) Kang, S., Paik, J.K., Koschan, A., Abidi, B.R., Abidi, M.A.: Real-time video tracking using PTZ cameras. In: Proceedings of International Conference on Quality Control by Artificial Vision (2003)
21.
go back to reference Klein, G., Murray, D.: Parallel tracking and mapping for small AR workspaces. In: Proceedings of the IEEE and ACM International Symposium on Mixed and Augmented Reality (2007) Klein, G., Murray, D.: Parallel tracking and mapping for small AR workspaces. In: Proceedings of the IEEE and ACM International Symposium on Mixed and Augmented Reality (2007)
22.
go back to reference Leibe, B., Leonardis, A., Schiele, B.: Robust object detection with interleaved categorization and segmentation. Int. J. Comput. Vision 77(1), 259–289 (2007) Leibe, B., Leonardis, A., Schiele, B.: Robust object detection with interleaved categorization and segmentation. Int. J. Comput. Vision 77(1), 259–289 (2007)
23.
go back to reference Liebowitz, D., Zisserman, A.: Metric rectification for perspective images of planes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (1998) Liebowitz, D., Zisserman, A.: Metric rectification for perspective images of planes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (1998)
24.
go back to reference Lim, H., Sinha, S., Cohen, M., Uyttendaele, M., Kim, H.J.: Real-time monocular image-based 6-dof localization. Int. J. Robot. Res.: IJRR 34(4–5), 476–4925 (2015)CrossRef Lim, H., Sinha, S., Cohen, M., Uyttendaele, M., Kim, H.J.: Real-time monocular image-based 6-dof localization. Int. J. Robot. Res.: IJRR 34(4–5), 476–4925 (2015)CrossRef
25.
go back to reference Lovegrove, S., Davison, A.J.: Real-time spherical mosaicing using whole image alignment. In: Proceedings of European Conference on Computer Vision (2010) Lovegrove, S., Davison, A.J.: Real-time spherical mosaicing using whole image alignment. In: Proceedings of European Conference on Computer Vision (2010)
26.
go back to reference Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef
27.
go back to reference Okuma, K., Taleghani, A., de Freitas, N., Little, J.J., Lowe, D.G.: A boosted particle filter: multitarget detection and tracking. In: Proceedings of the European Conference on Computer Vision (2004) Okuma, K., Taleghani, A., de Freitas, N., Little, J.J., Lowe, D.G.: A boosted particle filter: multitarget detection and tracking. In: Proceedings of the European Conference on Computer Vision (2004)
28.
go back to reference Pernici, F., Del Bimbo, A.: Object tracking by oversampling local features. IEEE Trans. Pattern Anal. Mach. Intell. 36(12), 2538–2551 (2014)CrossRef Pernici, F., Del Bimbo, A.: Object tracking by oversampling local features. IEEE Trans. Pattern Anal. Mach. Intell. 36(12), 2538–2551 (2014)CrossRef
29.
go back to reference Seo, Y., Choi, S., Kim, H., Hong, K.S.: Where are the ball and players? Soccer game analysis with color based tracking and image mosaick. In: Proceedings of the International Conference on Image Analysis and Processing (1997) Seo, Y., Choi, S., Kim, H., Hong, K.S.: Where are the ball and players? Soccer game analysis with color based tracking and image mosaick. In: Proceedings of the International Conference on Image Analysis and Processing (1997)
30.
go back to reference Sinha, S., Pollefeys, M.: Towards calibrating a pan-tilt-zoom cameras network. In: Proceedings of ECCV Workshops, Omnidirectional Vision and Camera Networks (2004) Sinha, S., Pollefeys, M.: Towards calibrating a pan-tilt-zoom cameras network. In: Proceedings of ECCV Workshops, Omnidirectional Vision and Camera Networks (2004)
31.
go back to reference Sinha, S., Pollefeys, M.: Pan-tilt-zoom camera calibration and high-resolution mosaic generation. Comput. Vis. Image Underst. 103(3), 170–183 (2006)CrossRef Sinha, S., Pollefeys, M.: Pan-tilt-zoom camera calibration and high-resolution mosaic generation. Comput. Vis. Image Underst. 103(3), 170–183 (2006)CrossRef
32.
go back to reference Song, D., Goldberg, K.: A minimum variance calibration algorithm for pan-tilt robotic cameras in natural environments. In: IEEE International Conference on Robotics and Automation (2006) Song, D., Goldberg, K.: A minimum variance calibration algorithm for pan-tilt robotic cameras in natural environments. In: IEEE International Conference on Robotics and Automation (2006)
33.
go back to reference Tao, H., Sawhney, H.S., Kumar, R.: Object tracking with bayesian estimation of dynamic layer representations. IEEE Trans. Pattern Anal. Mach. Intell. 24(1), 75–89 (2002)CrossRef Tao, H., Sawhney, H.S., Kumar, R.: Object tracking with bayesian estimation of dynamic layer representations. IEEE Trans. Pattern Anal. Mach. Intell. 24(1), 75–89 (2002)CrossRef
34.
go back to reference Tordoff, B., Murray, D.: Reactive control of zoom while fixating using perspective and affine cameras. IEEE Trans. Pattern Anal. Mach. Intell. 26(1), 98–112 (2004)CrossRef Tordoff, B., Murray, D.: Reactive control of zoom while fixating using perspective and affine cameras. IEEE Trans. Pattern Anal. Mach. Intell. 26(1), 98–112 (2004)CrossRef
35.
go back to reference Varcheie, P., Bilodeau, G.A.: Active people tracking by a PTZ camera in ip surveillance system. In: IEEE International Workshop on Robotic and Sensors Environments (2009) Varcheie, P., Bilodeau, G.A.: Active people tracking by a PTZ camera in ip surveillance system. In: IEEE International Workshop on Robotic and Sensors Environments (2009)
36.
go back to reference Williams, B., Klein, G., Reid, I.: Real-time SLAM relocalisation. In: Proceedings of the IEEE International Conference on Computer Vision (2007) Williams, B., Klein, G., Reid, I.: Real-time SLAM relocalisation. In: Proceedings of the IEEE International Conference on Computer Vision (2007)
37.
go back to reference Wu, Z., Radke, R.: Keeping a pan-tilt-zoom camera calibrated. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1994–2007 (2013) Wu, Z., Radke, R.: Keeping a pan-tilt-zoom camera calibrated. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1994–2007 (2013)
38.
go back to reference Yang, B., Huang, C., Nevatia, R.: Learning affinities and dependencies for multi-target tracking using a CRF model. In: Proceedings of IEEE Conference on Computer Vision and Patter Recognition (2011) Yang, B., Huang, C., Nevatia, R.: Learning affinities and dependencies for multi-target tracking using a CRF model. In: Proceedings of IEEE Conference on Computer Vision and Patter Recognition (2011)
39.
go back to reference Cai, Y.,de Freitas, N., Little, J.: Robust visual tracking for multiple targets. In: Proceedings of the European Conference on Computer Vision (2006) Cai, Y.,de Freitas, N., Little, J.: Robust visual tracking for multiple targets. In: Proceedings of the European Conference on Computer Vision (2006)
Metadata
Title
Continuous localization and mapping of a pan–tilt–zoom camera for wide area tracking
Authors
Giuseppe Lisanti
Iacopo Masi
Federico Pernici
Alberto Del Bimbo
Publication date
01-10-2016
Publisher
Springer Berlin Heidelberg
Published in
Machine Vision and Applications / Issue 7/2016
Print ISSN: 0932-8092
Electronic ISSN: 1432-1769
DOI
https://doi.org/10.1007/s00138-016-0799-x

Other articles of this Issue 7/2016

Machine Vision and Applications 7/2016 Go to the issue

Premium Partner