Skip to main content
Top
Published in: Neural Computing and Applications 11/2018

08-03-2017 | Original Article

Tracking topology structure adaptively with deep neural networks

Authors: Xueying Shi, Guangyong Chen, Pheng Ann Heng, Zhang Yi

Published in: Neural Computing and Applications | Issue 11/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Object tracking still remains challenging in computer vision because of the severe object variation, e.g., deformation, occlusion, and rotation. To handle the object variation and achieve robust object tracking performance, we propose a novel relationship-based tracking algorithm using neural networks in this paper. Compared with existing approaches in the literature, our method assumes the targeted object to be consisted of several parts and considers the evolution of the topology structure among these parts. After training a candidate neural network for predicting the probable areas each part may locate at in the successive frame, we then design a novel collaboration neural network to determine the precise area each part will locate at with account for the topology structure among these individual parts, which is learned from their historical physical locations during online tracking process. Experimental results show that the proposed method outperforms state-of-the-art trackers on a benchmark dataset, yielding the significant improvements in accuracy on high-distorted sequences.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE Computer Society conference on computer vision and pattern recognition (CVPR), San Diego, USA, pp 886–893 Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE Computer Society conference on computer vision and pattern recognition (CVPR), San Diego, USA, pp 886–893
2.
go back to reference Tang F, Brennan S, Zhao Q, Tao H (2007) Co-tracking using semi-supervised support vector machines. In: IEEE international conference on computer vision (ICCV), Rio de Janeiro, Brazil, pp 1–8 Tang F, Brennan S, Zhao Q, Tao H (2007) Co-tracking using semi-supervised support vector machines. In: IEEE international conference on computer vision (ICCV), Rio de Janeiro, Brazil, pp 1–8
3.
go back to reference Mei X, Ling H (2011) Robust visual tracking and vehicle classification via sparse representation. IEEE Trans Pattern Anal Mach Intell (PAMI) 33:2259–2272 CrossRef Mei X, Ling H (2011) Robust visual tracking and vehicle classification via sparse representation. IEEE Trans Pattern Anal Mach Intell (PAMI) 33:2259–2272 CrossRef
4.
go back to reference Ross DA, Lim J, Lin R-S, Yang M-H (2008) Incremental learning for robust visual tracking. Int J Comput Vis (IJCV) 77:125–141CrossRef Ross DA, Lim J, Lin R-S, Yang M-H (2008) Incremental learning for robust visual tracking. Int J Comput Vis (IJCV) 77:125–141CrossRef
5.
go back to reference Zhang T, Ghanem B, Liu S, Ahuja N (2012) Robust visual tracking via multi-task sparse learning. In: IEEE conference on computer vision and pattern recognition (CVPR), Rhode Island, USA, pp 2042–2049 Zhang T, Ghanem B, Liu S, Ahuja N (2012) Robust visual tracking via multi-task sparse learning. In: IEEE conference on computer vision and pattern recognition (CVPR), Rhode Island, USA, pp 2042–2049
6.
go back to reference Zhong W, Lu H, Yang M-H (2014) Robust object tracking via sparse collaborative appearance model. IEEE Trans Image Process (TIP) 23:2356–2368MathSciNetCrossRef Zhong W, Lu H, Yang M-H (2014) Robust object tracking via sparse collaborative appearance model. IEEE Trans Image Process (TIP) 23:2356–2368MathSciNetCrossRef
7.
go back to reference Mei X, Ling H, Wu Y, Blasch E, Bai L (2011) Minimum error bounded efficient \(\ell \) 1 tracker with occlusion detection. In: IEEE computer society conference on computer vision and pattern recognition (CVPR). Colorado Springs, USA, pp 1257–1264 Mei X, Ling H, Wu Y, Blasch E, Bai L (2011) Minimum error bounded efficient \(\ell \) 1 tracker with occlusion detection. In: IEEE computer society conference on computer vision and pattern recognition (CVPR). Colorado Springs, USA, pp 1257–1264
8.
go back to reference Wang N, Li S, Gupta A, Yeung D-Y (2015) Transferring rich feature hierarchies for robust visual tracking. arXiv preprint arXiv:1501.04587 Wang N, Li S, Gupta A, Yeung D-Y (2015) Transferring rich feature hierarchies for robust visual tracking. arXiv preprint arXiv:​1501.​04587
9.
go back to reference Wang L, Ouyang W, Wang X, Lu H (2015) Visual tracking with fully convolutional networks. In: IEEE international conference on computer vision (ICCV), Santiago, Chile, pp 3119–3127 Wang L, Ouyang W, Wang X, Lu H (2015) Visual tracking with fully convolutional networks. In: IEEE international conference on computer vision (ICCV), Santiago, Chile, pp 3119–3127
10.
go back to reference Ma C, Huang J-B, Yang X, Yang M-H (2015) Hierarchical convolutional features for visual tracking. In: IEEE international conference on computer vision (ICCV), Santiago, Chile, pp 3074–3082 Ma C, Huang J-B, Yang X, Yang M-H (2015) Hierarchical convolutional features for visual tracking. In: IEEE international conference on computer vision (ICCV), Santiago, Chile, pp 3074–3082
11.
go back to reference Adam A, Rivlin E, Shimshoni I (2006) Robust fragments-based tracking using the integral histogram. In: IEEE computer society conference on computer vision and pattern recognition (CVPR), New York, USA, pp 798–805 Adam A, Rivlin E, Shimshoni I (2006) Robust fragments-based tracking using the integral histogram. In: IEEE computer society conference on computer vision and pattern recognition (CVPR), New York, USA, pp 798–805
12.
go back to reference Kwon J, Lee KM (2009) Tracking of a non-rigid object via patch-based dynamic appearance modeling and adaptive basin hopping Monte Carlo sampling. In: IEEE conference on computer vision and pattern recognition (CVPR), Miami, USA, pp 1208–1215 Kwon J, Lee KM (2009) Tracking of a non-rigid object via patch-based dynamic appearance modeling and adaptive basin hopping Monte Carlo sampling. In: IEEE conference on computer vision and pattern recognition (CVPR), Miami, USA, pp 1208–1215
13.
go back to reference Yang S, Luo P, Loy C-C, Tang X (2015) From facial parts responses to face detection: a deep learning approach. In: IEEE international conference on computer vision (ICCV), Santiago, Chile, pp 3676–3684 Yang S, Luo P, Loy C-C, Tang X (2015) From facial parts responses to face detection: a deep learning approach. In: IEEE international conference on computer vision (ICCV), Santiago, Chile, pp 3676–3684
14.
go back to reference Wang N, Shi J, Yeung D-Y, Jia J (2015) Understanding and diagnosing visual tracking systems. In: IEEE international conference on computer vision (ICCV), Santiago, Chile, pp 3101–3109 Wang N, Shi J, Yeung D-Y, Jia J (2015) Understanding and diagnosing visual tracking systems. In: IEEE international conference on computer vision (ICCV), Santiago, Chile, pp 3101–3109
15.
go back to reference Wu Y, Lim J, Yang M-H (2015) Object tracking benchmark. IEEE Trans Pattern Anal Mach Intell (PAMI) 37:1834–1848CrossRef Wu Y, Lim J, Yang M-H (2015) Object tracking benchmark. IEEE Trans Pattern Anal Mach Intell (PAMI) 37:1834–1848CrossRef
16.
go back to reference Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE conference on computer vision and pattern recognition (CVPR), Columbus, USA, pp 580–587 Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE conference on computer vision and pattern recognition (CVPR), Columbus, USA, pp 580–587
17.
go back to reference Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y (2014) Overfeat: integrated recognition, localization and detection using convolutional networks. In: International conference on learning representations (ICLR), Banff, Canada Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y (2014) Overfeat: integrated recognition, localization and detection using convolutional networks. In: International conference on learning representations (ICLR), Banff, Canada
18.
go back to reference Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: IEEE conference on computer vision and pattern recognition (CVPR), Boston, USA, pp 1–9 Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: IEEE conference on computer vision and pattern recognition (CVPR), Boston, USA, pp 1–9
19.
go back to reference Ouyang W, Luo P, Zeng X, Qiu S, Tian Y, Li H, Yang S, Wang Z, Xiong Y, Qian C, Zhu Z, Wang R, Loy C-C, Wang X, Tang X (2014) Deepid-net: multi-stage and deformable deep convolutional neural networks for object detection. arXiv preprint arXiv:1409.3505 Ouyang W, Luo P, Zeng X, Qiu S, Tian Y, Li H, Yang S, Wang Z, Xiong Y, Qian C, Zhu Z, Wang R, Loy C-C, Wang X, Tang X (2014) Deepid-net: multi-stage and deformable deep convolutional neural networks for object detection. arXiv preprint arXiv:​1409.​3505
20.
go back to reference Lin M, Chen Q, Yan S (2014) Network in network. In: International conference on learning representations (ICLR), Banff, Canada Lin M, Chen Q, Yan S (2014) Network in network. In: International conference on learning representations (ICLR), Banff, Canada
21.
go back to reference Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. In: International conference on learning representations (ICLR), Banff, Canada Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. In: International conference on learning representations (ICLR), Banff, Canada
22.
go back to reference He K, Zhang X, Ren S, Sun J (2014) Spatial pyramid pooling in deep convolutional networks for visual recognition. In: European conference on computer vision (ECCV), Zurich, Switzerland, pp 346–361 He K, Zhang X, Ren S, Sun J (2014) Spatial pyramid pooling in deep convolutional networks for visual recognition. In: European conference on computer vision (ECCV), Zurich, Switzerland, pp 346–361
23.
go back to reference Wang N, Yeung D-Y (2013) Learning a deep compact image representation for visual tracking. In: Advances in Neural Information Processing Systems 26 (NIPS 2013), Lake Tahoe, USA, pp 809–817 Wang N, Yeung D-Y (2013) Learning a deep compact image representation for visual tracking. In: Advances in Neural Information Processing Systems 26 (NIPS 2013), Lake Tahoe, USA, pp 809–817
24.
go back to reference Torralba A, Fergus R, Freeman WT (2008) 80 million tiny images: a large data set for nonparametric object and scene recognition. IEEE Trans Pattern Anal Mach Intell (PAMI) 30:1958–1970CrossRef Torralba A, Fergus R, Freeman WT (2008) 80 million tiny images: a large data set for nonparametric object and scene recognition. IEEE Trans Pattern Anal Mach Intell (PAMI) 30:1958–1970CrossRef
25.
go back to reference Jia X, Lu H, Yang M-H (2012) Visual tracking via adaptive structural local sparse appearance model. In: IEEE conference on computer vision and pattern recognition (CVPR), Rhode Island, USA, pp 1822–1829 Jia X, Lu H, Yang M-H (2012) Visual tracking via adaptive structural local sparse appearance model. In: IEEE conference on computer vision and pattern recognition (CVPR), Rhode Island, USA, pp 1822–1829
26.
go back to reference Fischer A, Igel C (2012) An introduction to restricted Boltzmann machines. In: Alvarez L, Mejail ME, Gomez LE, Jacobo JE (eds) 17th Iberoamerican Congress, CIARP 2012, Buenos Aires, Argentina, pp 14–36CrossRef Fischer A, Igel C (2012) An introduction to restricted Boltzmann machines. In: Alvarez L, Mejail ME, Gomez LE, Jacobo JE (eds) 17th Iberoamerican Congress, CIARP 2012, Buenos Aires, Argentina, pp 14–36CrossRef
27.
go back to reference Arulampalam MS, Maskell S, Gordon N, Clapp T (2002) A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans Signal Process 50:174–188CrossRef Arulampalam MS, Maskell S, Gordon N, Clapp T (2002) A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans Signal Process 50:174–188CrossRef
28.
go back to reference Hare S, Saffari A, Torr PHS (2011) Struck: Structured output tracking with kernels. In: IEEE international conference on computer vision (ICCV), Rhode Island, USA, pp 263–270 Hare S, Saffari A, Torr PHS (2011) Struck: Structured output tracking with kernels. In: IEEE international conference on computer vision (ICCV), Rhode Island, USA, pp 263–270
29.
go back to reference Henriques JF, Caseiro R, Martins P, Batista J (2012) Exploiting the circulant structure of tracking-by-detection with kernels. In: European conference on computer vision (ECCV), Firenze, Italy, pp 702–715 Henriques JF, Caseiro R, Martins P, Batista J (2012) Exploiting the circulant structure of tracking-by-detection with kernels. In: European conference on computer vision (ECCV), Firenze, Italy, pp 702–715
30.
go back to reference Zhang K, Zhang L, Yang M-H (2012) Real-time compressive tracking. In: European conference on computer vision (ECCV), Firenze, Italy, pp 864–877 Zhang K, Zhang L, Yang M-H (2012) Real-time compressive tracking. In: European conference on computer vision (ECCV), Firenze, Italy, pp 864–877
31.
32.
go back to reference Dinh TB, Vo N, Medioni G (2011) Context tracker: exploring supporters and distracters in unconstrained environments. In: IEEE conference on computer vision and pattern recognition (CVPR), Colorado, USA, pp 1177–1184 Dinh TB, Vo N, Medioni G (2011) Context tracker: exploring supporters and distracters in unconstrained environments. In: IEEE conference on computer vision and pattern recognition (CVPR), Colorado, USA, pp 1177–1184
33.
go back to reference Sevilla-Lara L, Learned-Miller E (2012) Distribution fields for tracking. In: IEEE conference on computer vision and pattern recognition (CVPR), Rhode Island, USA, pp 1910–1917 Sevilla-Lara L, Learned-Miller E (2012) Distribution fields for tracking. In: IEEE conference on computer vision and pattern recognition (CVPR), Rhode Island, USA, pp 1910–1917
Metadata
Title
Tracking topology structure adaptively with deep neural networks
Authors
Xueying Shi
Guangyong Chen
Pheng Ann Heng
Zhang Yi
Publication date
08-03-2017
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 11/2018
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-017-2906-y

Other articles of this Issue 11/2018

Neural Computing and Applications 11/2018 Go to the issue

Premium Partner