Skip to main content
Top

2018 | OriginalPaper | Chapter

11. Simultaneous Visual Recognition and Tracking Based on Joint Decision and Estimation

Authors : Zhongliang Jing, Han Pan, Yuankai Li, Peng Dong

Published in: Non-Cooperative Target Tracking, Fusion and Control

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Visual target tracking and recognition have been increasingly important in video surveillance. Conventional works deal with tracking and recognition as separate steps, whereas tracking and recognition are closely interrelated and can help each other potentially and significantly. To tackle this problem, based on the joint decision and estimation (JDE) model which guarantees the general decision (recognition) and estimation (tracking) arriving at the global optimization, a simultaneous visual recognition and tracking method is provided. Besides, the structured sparse representation (SSR) model shows great efficiency and robustness in exploiting both holistic and local information of the target appearance. We show that constructing the appearance model with SSR can improve the performance of the proposed algorithm. Then, the contribution of each test candidate is considered into the learning procedure by a kernel function. Furthermore, the new joint weights of the kernel function provide flexibility with appearance changes and thus robustness to the dynamic scene. The experimental results demonstrate that the proposed method performs well in terms of accuracy and robustness.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Babenko B, Yang MH, Belongie S (2011) Robust object tracking with online multiple instance learning. IEEE Trans Pattern Anal Mach Intell 33(8):1619–1632CrossRef Babenko B, Yang MH, Belongie S (2011) Robust object tracking with online multiple instance learning. IEEE Trans Pattern Anal Mach Intell 33(8):1619–1632CrossRef
2.
go back to reference Bai T, Li Y (2014) Robust visual tracking using flexible structured sparse representation. IEEE Trans Ind Inf 10(1):538–547CrossRef Bai T, Li Y (2014) Robust visual tracking using flexible structured sparse representation. IEEE Trans Ind Inf 10(1):538–547CrossRef
3.
go back to reference Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720CrossRef Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720CrossRef
4.
go back to reference Bhaskar H (2012) Integrated human target detection, identification and tracking for surveillance applications. In: 6th IEEE international conference intelligent systems (IS). IEEE, New York, pp 467–475 Bhaskar H (2012) Integrated human target detection, identification and tracking for surveillance applications. In: 6th IEEE international conference intelligent systems (IS). IEEE, New York, pp 467–475
6.
go back to reference Conaire CÓ, O’Connor NE, Smeaton A (2008) Thermo-visual feature fusion for object tracking using multiple spatiogram trackers. Mach Vis Appl 19(5–6):483–494CrossRef Conaire CÓ, O’Connor NE, Smeaton A (2008) Thermo-visual feature fusion for object tracking using multiple spatiogram trackers. Mach Vis Appl 19(5–6):483–494CrossRef
7.
go back to reference Eldar YC, Kuppinger P, Bolcskei H (2010) Block-sparse signals: uncertainty relations and efficient recovery. IEEE Trans Signal Process 58(6):3042–3054MathSciNetCrossRef Eldar YC, Kuppinger P, Bolcskei H (2010) Block-sparse signals: uncertainty relations and efficient recovery. IEEE Trans Signal Process 58(6):3042–3054MathSciNetCrossRef
8.
go back to reference Fan J, Shen X, Wu Y (2013) What are we tracking: a unified approach of tracking and recognition. IEEE Trans Image Process 22(2):549–560MathSciNetCrossRef Fan J, Shen X, Wu Y (2013) What are we tracking: a unified approach of tracking and recognition. IEEE Trans Image Process 22(2):549–560MathSciNetCrossRef
9.
go back to reference Gong J, Fan G, Yu L, Havlicek JP, Chen D (2012) Joint view-identity manifold for target tracking and recognition. In: 19th IEEE international conference on image processing (ICIP). IEEE, New York, pp 1357–1360 Gong J, Fan G, Yu L, Havlicek JP, Chen D (2012) Joint view-identity manifold for target tracking and recognition. In: 19th IEEE international conference on image processing (ICIP). IEEE, New York, pp 1357–1360
10.
go back to reference Hare S, Golodetz S, Saffari A, Vineet V, Cheng MM, Hicks SL, Torr PH (2016) Struck: structured output tracking with kernels. IEEE Trans Pattern Anal Mach Intell 38(10):2096–2109CrossRef Hare S, Golodetz S, Saffari A, Vineet V, Cheng MM, Hicks SL, Torr PH (2016) Struck: structured output tracking with kernels. IEEE Trans Pattern Anal Mach Intell 38(10):2096–2109CrossRef
11.
go back to reference Jiang N, Liu W, Wu Y (2011) Learning adaptive metric for robust visual tracking. IEEE Trans Image Process 20(8):2288–2300MathSciNetCrossRef Jiang N, Liu W, Wu Y (2011) Learning adaptive metric for robust visual tracking. IEEE Trans Image Process 20(8):2288–2300MathSciNetCrossRef
12.
go back to reference Kim M, Kumar S, Pavlovic V, Rowley H (2008) Face tracking and recognition with visual constraints in real-world videos. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, New York, pp 1–8 Kim M, Kumar S, Pavlovic V, Rowley H (2008) Face tracking and recognition with visual constraints in real-world videos. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, New York, pp 1–8
13.
go back to reference Kwon J, Lee KM (2010) Visual tracking decomposition. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, New York, pp 1269–1276 Kwon J, Lee KM (2010) Visual tracking decomposition. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, New York, pp 1269–1276
14.
go back to reference Lee KC, Kriegman D (2005) Online learning of probabilistic appearance manifolds for video-based recognition and tracking. In: IEEE Computer Society conference on computer vision and pattern recognition (CVPR), vol 1. IEEE, New York, pp 852–859 Lee KC, Kriegman D (2005) Online learning of probabilistic appearance manifolds for video-based recognition and tracking. In: IEEE Computer Society conference on computer vision and pattern recognition (CVPR), vol 1. IEEE, New York, pp 852–859
15.
go back to reference Lee KC, Ho J, Yang MH, Kriegman D (2005) Visual tracking and recognition using probabilistic appearance manifolds. Comput Vis Image Underst 99(3), 303–331CrossRef Lee KC, Ho J, Yang MH, Kriegman D (2005) Visual tracking and recognition using probabilistic appearance manifolds. Comput Vis Image Underst 99(3), 303–331CrossRef
16.
go back to reference Li XR (2007) Optimal Bayes joint decision and estimation. In: 10th International conference on information fusion. IEEE, New York, pp 1–8 Li XR (2007) Optimal Bayes joint decision and estimation. In: 10th International conference on information fusion. IEEE, New York, pp 1–8
17.
go back to reference Li H, Shen C, Shi Q (2011) Real-time visual tracking using compressive sensing. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, New York, pp 1305–1312 Li H, Shen C, Shi Q (2011) Real-time visual tracking using compressive sensing. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, New York, pp 1305–1312
18.
go back to reference Liu Y, Li XR (2011) Recursive joint decision and estimation based on generalized Bayes risk. In: Proceedings of the 14th international conference on information fusion. IEEE, New York, pp 1–8 Liu Y, Li XR (2011) Recursive joint decision and estimation based on generalized Bayes risk. In: Proceedings of the 14th international conference on information fusion. IEEE, New York, pp 1–8
19.
go back to reference Liwicki S, Zafeiriou S, Tzimiropoulos G, Pantic M (2012) Efficient online subspace learning with an indefinite kernel for visual tracking and recognition. IEEE Trans Neural Netw Learn Syst 23(10):1624–1636CrossRef Liwicki S, Zafeiriou S, Tzimiropoulos G, Pantic M (2012) Efficient online subspace learning with an indefinite kernel for visual tracking and recognition. IEEE Trans Neural Netw Learn Syst 23(10):1624–1636CrossRef
20.
go back to reference Lu WL, Ting JA, Little JJ, Murphy KP (2013) Learning to track and identify players from broadcast sports videos. IEEE Trans Pattern Anal Mach Intell 35(7):1704–1716CrossRef Lu WL, Ting JA, Little JJ, Murphy KP (2013) Learning to track and identify players from broadcast sports videos. IEEE Trans Pattern Anal Mach Intell 35(7):1704–1716CrossRef
21.
go back to reference Luo M, Sun F, Liu H (2013) Hierarchical structured sparse representation for t–s fuzzy systems identification. IEEE Trans Fuzzy Syst 21(6):1032–1043CrossRef Luo M, Sun F, Liu H (2013) Hierarchical structured sparse representation for t–s fuzzy systems identification. IEEE Trans Fuzzy Syst 21(6):1032–1043CrossRef
22.
go back to reference Mei X, Ling H (2011) Robust visual tracking and vehicle classification via sparse representation. IEEE Trans Pattern Anal Mach Intell 33(11):2259–2272CrossRef Mei X, Ling H (2011) Robust visual tracking and vehicle classification via sparse representation. IEEE Trans Pattern Anal Mach Intell 33(11):2259–2272CrossRef
23.
go back to reference Mei X, Zhou SK, Wu H (2006) Integrated detection, tracking and recognition for ir video-based vehicle classification. In: IEEE international conference on acoustics, speech and signal processing, vol 5. IEEE, New York, pp 745–748 Mei X, Zhou SK, Wu H (2006) Integrated detection, tracking and recognition for ir video-based vehicle classification. In: IEEE international conference on acoustics, speech and signal processing, vol 5. IEEE, New York, pp 745–748
24.
go back to reference Pentland A, Moghaddam B, Starner T et al (1994) View-based and modular eigenspaces for face recognition. In: IEEE Computer Society conference on computer vision and pattern recognition (CVPR), vol 94, pp 84–91 Pentland A, Moghaddam B, Starner T et al (1994) View-based and modular eigenspaces for face recognition. In: IEEE Computer Society conference on computer vision and pattern recognition (CVPR), vol 94, pp 84–91
25.
go back to reference Pinson R, Howard R, Heaton A (2008) Orbital express advanced video guidance sensor: ground testing, flight results and comparisons. In: AIAA guidance, navigation and control conference and exhibit, p 7318 Pinson R, Howard R, Heaton A (2008) Orbital express advanced video guidance sensor: ground testing, flight results and comparisons. In: AIAA guidance, navigation and control conference and exhibit, p 7318
26.
27.
go back to reference Ross DA, Lim J, Lin RS, Yang MH (2008) Incremental learning for robust visual tracking. Int J Comput Vis 77(1):125–141CrossRef Ross DA, Lim J, Lin RS, Yang MH (2008) Incremental learning for robust visual tracking. Int J Comput Vis 77(1):125–141CrossRef
28.
go back to reference Scholkopf B, Sung KK, Burges CJ, Girosi F, Niyogi P, Poggio T, Vapnik V (1997) Comparing support vector machines with gaussian kernels to radial basis function classifiers. IEEE Trans Signal Process 45(11):2758–2765CrossRef Scholkopf B, Sung KK, Burges CJ, Girosi F, Niyogi P, Poggio T, Vapnik V (1997) Comparing support vector machines with gaussian kernels to radial basis function classifiers. IEEE Trans Signal Process 45(11):2758–2765CrossRef
29.
go back to reference Sheng G, Yang W, Yu L, Sun H (2012) Cluster structured sparse representation for high resolution satellite image classification. In: IEEE 11th international conference on signal processing (ICSP), vol 1. IEEE, New York, pp 693–696 Sheng G, Yang W, Yu L, Sun H (2012) Cluster structured sparse representation for high resolution satellite image classification. In: IEEE 11th international conference on signal processing (ICSP), vol 1. IEEE, New York, pp 693–696
30.
go back to reference Tang C, Ou Y, Jiang G, Xie Q, Xu Y (2012) Hand tracking and pose recognition via depth and color information. In: IEEE international conference on robotics and biomimetics (ROBIO). IEEE, New York, pp 1104–1109 Tang C, Ou Y, Jiang G, Xie Q, Xu Y (2012) Hand tracking and pose recognition via depth and color information. In: IEEE international conference on robotics and biomimetics (ROBIO). IEEE, New York, pp 1104–1109
31.
go back to reference Tzimiropoulos G, Zafeiriou S, Pantic M (2011) Sparse representations of image gradient orientations for visual recognition and tracking. In: IEEE computer society conference on computer vision and pattern recognition workshops (CVPRW). IEEE, New York, pp 26–33 Tzimiropoulos G, Zafeiriou S, Pantic M (2011) Sparse representations of image gradient orientations for visual recognition and tracking. In: IEEE computer society conference on computer vision and pattern recognition workshops (CVPRW). IEEE, New York, pp 26–33
32.
go back to reference Wang C, Wang Y, Zhang Z, Wang Y (2013) Face tracking and recognition via incremental local sparse representation. In: Seventh international conference on image and graphics (ICIG). IEEE, New York, pp 493–498 Wang C, Wang Y, Zhang Z, Wang Y (2013) Face tracking and recognition via incremental local sparse representation. In: Seventh international conference on image and graphics (ICIG). IEEE, New York, pp 493–498
33.
go back to reference Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227CrossRef Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227CrossRef
34.
go back to reference Yamamoto T, Kataoka H, Hayashi M, Aoki Y, Oshima K, Tanabiki M (2013) Multiple players tracking and identification using group detection and player number recognition in sports video. In: 39th Annual conference of the IEEE industrial electronics society (ECON). IEEE, New York, pp 2442–2446 Yamamoto T, Kataoka H, Hayashi M, Aoki Y, Oshima K, Tanabiki M (2013) Multiple players tracking and identification using group detection and player number recognition in sports video. In: 39th Annual conference of the IEEE industrial electronics society (ECON). IEEE, New York, pp 2442–2446
35.
go back to reference Yun X, Zhongliang J (2016) Kernel joint visual tracking and recognition based on structured sparse representation. Neurocomputing 193:181–192CrossRef Yun X, Zhongliang J (2016) Kernel joint visual tracking and recognition based on structured sparse representation. Neurocomputing 193:181–192CrossRef
36.
go back to reference Zhang CL (2013) Research on visual tracking methods based on joint decision from multiple regions. Ph.D. thesis, Shanghai Jiao Tong University Zhang CL (2013) Research on visual tracking methods based on joint decision from multiple regions. Ph.D. thesis, Shanghai Jiao Tong University
37.
go back to reference Zhang T, Ghanem B, Liu S, Ahuja N (2012) Robust visual tracking via multi-task sparse learning. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, New York, pp 2042–2049 Zhang T, Ghanem B, Liu S, Ahuja N (2012) Robust visual tracking via multi-task sparse learning. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, New York, pp 2042–2049
38.
go back to reference Zhang K, Zhang L, Yang MH (2013) Real-time object tracking via online discriminative feature selection. IEEE Trans. Image Process 22(12):4664–4677MathSciNetCrossRef Zhang K, Zhang L, Yang MH (2013) Real-time object tracking via online discriminative feature selection. IEEE Trans. Image Process 22(12):4664–4677MathSciNetCrossRef
39.
go back to reference Zhang K, Zhang L, Yang MH (2014) Fast compressive tracking. IEEE Trans Pattern Anal Mach Intell 36(10):2002–2015CrossRef Zhang K, Zhang L, Yang MH (2014) Fast compressive tracking. IEEE Trans Pattern Anal Mach Intell 36(10):2002–2015CrossRef
40.
go back to reference Zhang K, Zhang L, Liu Q, Zhang D, Yang MH (2014) Fast visual tracking via dense spatio-temporal context learning. In: European conference on computer vision (ECCV). Springer, New York, pp 127–141 Zhang K, Zhang L, Liu Q, Zhang D, Yang MH (2014) Fast visual tracking via dense spatio-temporal context learning. In: European conference on computer vision (ECCV). Springer, New York, pp 127–141
41.
go back to reference Zhou SK, Chellappa R, Moghaddam B (2004) Visual tracking and recognition using appearance-adaptive models in particle filters. IEEE Trans Image Process 13(11):1491–1506CrossRef Zhou SK, Chellappa R, Moghaddam B (2004) Visual tracking and recognition using appearance-adaptive models in particle filters. IEEE Trans Image Process 13(11):1491–1506CrossRef
Metadata
Title
Simultaneous Visual Recognition and Tracking Based on Joint Decision and Estimation
Authors
Zhongliang Jing
Han Pan
Yuankai Li
Peng Dong
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-90716-1_11

Premium Partner