Skip to main content
Erschienen in: Neural Processing Letters 3/2018

23.10.2017

Deep Learning and Preference Learning for Object Tracking: A Combined Approach

verfasst von: Shuchao Pang, Juan José del Coz, Zhezhou Yu, Oscar Luaces, Jorge Díez

Erschienen in: Neural Processing Letters | Ausgabe 3/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Object tracking is one of the most important processes for object recognition in the field of computer vision. The aim is to find accurately a target object in every frame of a video sequence. In this paper we propose a combination technique of two algorithms well-known among machine learning practitioners. Firstly, we propose a deep learning approach to automatically extract the features that will be used to represent the original images. Deep learning has been successfully applied in different computer vision applications. Secondly, object tracking can be seen as a ranking problem, since the regions of an image can be ranked according to their level of overlapping with the target object (ground truth in each video frame). During object tracking, the target position and size can change, so the algorithms have to propose several candidate regions in which the target can be found. We propose to use a preference learning approach to build a ranking function which will be used to select the bounding box that ranks higher, i.e., that will likely enclose the target object. The experimental results obtained by our method, called \( DPL ^{2}\) (Deep and Preference Learning), are competitive with respect to other algorithms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Babenko B, Yang MH, Belongie S (2009) Visual tracking with online multiple instance learning. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’09), pp 983–990 Babenko B, Yang MH, Belongie S (2009) Visual tracking with online multiple instance learning. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’09), pp 983–990
2.
Zurück zum Zitat Bahamonde A, Bayón GF, Díez J, Quevedo JR, Luaces O, del Coz JJ, Alonso J, Goyache F (2004) Feature subset selection for learning preferences: a case study. In: Proceedings of ICML’04. ACM Bahamonde A, Bayón GF, Díez J, Quevedo JR, Luaces O, del Coz JJ, Alonso J, Goyache F (2004) Feature subset selection for learning preferences: a case study. In: Proceedings of ICML’04. ACM
3.
Zurück zum Zitat Bai Y, Tang M (2012) Robust tracking via weakly supervised ranking svm. In: IEEE Conference on Computer Vision and Pattern Recognition. IEEE, pp 1854–1861 Bai Y, Tang M (2012) Robust tracking via weakly supervised ranking svm. In: IEEE Conference on Computer Vision and Pattern Recognition. IEEE, pp 1854–1861
4.
Zurück zum Zitat Bao C, Wu Y, Ling H, Ji H (2012) Real time robust l1 tracker using accelerated proximal gradient approach. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’12), pp 1830–1837 Bao C, Wu Y, Ling H, Ji H (2012) Real time robust l1 tracker using accelerated proximal gradient approach. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’12), pp 1830–1837
5.
Zurück zum Zitat Comaniciu D, Ramesh V, Meer P (2003) Kernel-based object tracking. IEEE Trans Pattern Anal Mach Intell 25(5):564–577CrossRef Comaniciu D, Ramesh V, Meer P (2003) Kernel-based object tracking. IEEE Trans Pattern Anal Mach Intell 25(5):564–577CrossRef
6.
Zurück zum Zitat Dai P, Liu K, Xie Y, Li C (2014) Online co-training ranking svm for visual tracking. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp 6568–6572 Dai P, Liu K, Xie Y, Li C (2014) Online co-training ranking svm for visual tracking. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp 6568–6572
7.
Zurück zum Zitat Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30MathSciNetMATH Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30MathSciNetMATH
8.
Zurück zum Zitat Dinh TB, Vo N, Medioni G (2011) Context tracker: exploring supporters and distracters in unconstrained environments. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’11), pp 1177–1184 Dinh TB, Vo N, Medioni G (2011) Context tracker: exploring supporters and distracters in unconstrained environments. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’11), pp 1177–1184
9.
Zurück zum Zitat Hare S, Saffari A, Torr PH (2011) Struck: structured output tracking with kernels. In: IEEE International Conference on Computer Vision (ICCV’11), pp 263–270 Hare S, Saffari A, Torr PH (2011) Struck: structured output tracking with kernels. In: IEEE International Conference on Computer Vision (ICCV’11), pp 263–270
10.
Zurück zum Zitat Henriques JF, Caseiro R, Martins P, Batista J (2012) Exploiting the circulant structure of tracking-by-detection with kernels. In: Computer Vision—ECCV 2012. Springer, Berlin, pp 702–715 Henriques JF, Caseiro R, Martins P, Batista J (2012) Exploiting the circulant structure of tracking-by-detection with kernels. In: Computer Vision—ECCV 2012. Springer, Berlin, pp 702–715
11.
Zurück zum Zitat Herbrich R, Graepel T, Obermayer K (1999) Support vector learning for ordinal regression. In: International Conference on Artificial Neural Networks, pp 97–102 Herbrich R, Graepel T, Obermayer K (1999) Support vector learning for ordinal regression. In: International Conference on Artificial Neural Networks, pp 97–102
12.
Zurück zum Zitat Jepson AD, Fleet DJ, El-Maraghi TF (2003) Robust online appearance models for visual tracking. IEEE Trans Pattern Anal Mach Intell 25(10):1296–1311CrossRef Jepson AD, Fleet DJ, El-Maraghi TF (2003) Robust online appearance models for visual tracking. IEEE Trans Pattern Anal Mach Intell 25(10):1296–1311CrossRef
13.
Zurück zum Zitat Jia X, Lu H, Yang MH (2012) Visual tracking via adaptive structural local sparse appearance model. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’12), pp 1822–1829 Jia X, Lu H, Yang MH (2012) Visual tracking via adaptive structural local sparse appearance model. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’12), pp 1822–1829
14.
Zurück zum Zitat Joachims T (2002) Optimizing search engines using clickthrough data. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’02, pp 133–142 Joachims T (2002) Optimizing search engines using clickthrough data. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’02, pp 133–142
15.
Zurück zum Zitat Kalal Z, Matas J, Mikolajczyk K (2010) Pn learning: bootstrapping binary classifiers by structural constraints. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’10), pp 49–56 Kalal Z, Matas J, Mikolajczyk K (2010) Pn learning: bootstrapping binary classifiers by structural constraints. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’10), pp 49–56
16.
Zurück zum Zitat Kwon J, Lee KM (2010) Visual tracking decomposition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’10), pp 1269–1276 Kwon J, Lee KM (2010) Visual tracking decomposition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’10), pp 1269–1276
17.
Zurück zum Zitat Kwon J, Lee KM (2011) Tracking by sampling trackers. In: IEEE International Conference on Computer Vision (ICCV’11), pp 1195–1202 Kwon J, Lee KM (2011) Tracking by sampling trackers. In: IEEE International Conference on Computer Vision (ICCV’11), pp 1195–1202
18.
Zurück zum Zitat Lawrence S, Giles CL, Tsoi AC, Back AD (1997) Face recognition: a convolutional neural-network approach. IEEE Trans Neural Netw 8(1):98–113CrossRef Lawrence S, Giles CL, Tsoi AC, Back AD (1997) Face recognition: a convolutional neural-network approach. IEEE Trans Neural Netw 8(1):98–113CrossRef
19.
Zurück zum Zitat LeCun Y, Bengio Y (1995) Convolutional networks for images, speech, and time series. Handb Brain Theory Neural Netw 3361(10) LeCun Y, Bengio Y (1995) Convolutional networks for images, speech, and time series. Handb Brain Theory Neural Netw 3361(10)
20.
Zurück zum Zitat LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444CrossRef LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444CrossRef
21.
Zurück zum Zitat Li X, Hu W, Shen C, Zhang Z, Dick A, Hengel AVD (2013) A survey of appearance models in visual object tracking. ACM Trans Intell Syst Technol (TIST) 4(4):58 Li X, Hu W, Shen C, Zhang Z, Dick A, Hengel AVD (2013) A survey of appearance models in visual object tracking. ACM Trans Intell Syst Technol (TIST) 4(4):58
22.
Zurück zum Zitat Lin TY, Cui Y, Belongie S, Hays J, Tech C (2015) Learning deep representations for ground-to-aerial geolocalization. In: Proceedings of the IEEE CVPR’15, pp 5007–5015 Lin TY, Cui Y, Belongie S, Hays J, Tech C (2015) Learning deep representations for ground-to-aerial geolocalization. In: Proceedings of the IEEE CVPR’15, pp 5007–5015
23.
Zurück zum Zitat Liu B, Huang J, Yang L, Kulikowsk C (2011) Robust tracking using local sparse appearance model and k-selection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’11), pp 1313–1320 Liu B, Huang J, Yang L, Kulikowsk C (2011) Robust tracking using local sparse appearance model and k-selection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’11), pp 1313–1320
24.
Zurück zum Zitat Luque-Baena RM, Ortiz-de Lazcano-Lobato JM, López-Rubio E, Domínguez E, Palomo EJ (2013) A competitive neural network for multiple object tracking in video sequence analysis. Neural Process Lett 37(1):47–67CrossRef Luque-Baena RM, Ortiz-de Lazcano-Lobato JM, López-Rubio E, Domínguez E, Palomo EJ (2013) A competitive neural network for multiple object tracking in video sequence analysis. Neural Process Lett 37(1):47–67CrossRef
25.
Zurück zum Zitat Pang S, del Coz JJ, Yu Z, Luaces O, Díez J (2016) Combining deep learning and preference learning for object tracking. In: Hirose A, Ozawa S, Doya K, Ikeda K, Lee M, Liu D (eds) Neural Information Processing: 23rd International Conference, ICONIP 2016, Kyoto, Japan, October 16–21, 2016, Proceedings, Part III, pp 70–77. Springer, Berlin Pang S, del Coz JJ, Yu Z, Luaces O, Díez J (2016) Combining deep learning and preference learning for object tracking. In: Hirose A, Ozawa S, Doya K, Ikeda K, Lee M, Liu D (eds) Neural Information Processing: 23rd International Conference, ICONIP 2016, Kyoto, Japan, October 16–21, 2016, Proceedings, Part III, pp 70–77. Springer, Berlin
26.
Zurück zum Zitat Pang S, Yu Z (2015) Face recognition: a novel deep learning approach. J Opt Technol 82(4):237–245CrossRef Pang S, Yu Z (2015) Face recognition: a novel deep learning approach. J Opt Technol 82(4):237–245CrossRef
27.
Zurück zum Zitat Ross DA, Lim J, Lin RS, Yang MH (2008) Incremental learning for robust visual tracking. Int J Comput Vis 77(1–3):125–141CrossRef Ross DA, Lim J, Lin RS, Yang MH (2008) Incremental learning for robust visual tracking. Int J Comput Vis 77(1–3):125–141CrossRef
28.
Zurück zum Zitat Torralba A, Fergus R, Freeman WT (2008) 80 million tiny images: a large data set for nonparametric object and scene recognition. IEEE Trans Pattern Anal Mach Intell 30(11):1958–1970CrossRef Torralba A, Fergus R, Freeman WT (2008) 80 million tiny images: a large data set for nonparametric object and scene recognition. IEEE Trans Pattern Anal Mach Intell 30(11):1958–1970CrossRef
29.
Zurück zum Zitat Vapnik V (1998) Statistical learning theory. Wiley, New York, NYMATH Vapnik V (1998) Statistical learning theory. Wiley, New York, NYMATH
30.
Zurück zum Zitat Wang N, Yeung DY (2013) Learning a deep compact image representation for visual tracking. In: Burges CJC, Bottou L, Welling M, Ghahramani Z, Weinberger KQ (eds) Advances in neural information processing systems. Neural Information Processing Systems Foundation Inc, Lake Tahoe, pp 809–817 Wang N, Yeung DY (2013) Learning a deep compact image representation for visual tracking. In: Burges CJC, Bottou L, Welling M, Ghahramani Z, Weinberger KQ (eds) Advances in neural information processing systems. Neural Information Processing Systems Foundation Inc, Lake Tahoe, pp 809–817
31.
Zurück zum Zitat Wu Y, Lim J, Yang M (2015) Object tracking benchmark. IEEE Trans Pattern Anal Mach Intell 37(9):1834–1848CrossRef Wu Y, Lim J, Yang M (2015) Object tracking benchmark. IEEE Trans Pattern Anal Mach Intell 37(9):1834–1848CrossRef
32.
Zurück zum Zitat Wu Y, Lim J, Yang MH (2013) Online object tracking: a benchmark. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’13), pp 2411–2418 Wu Y, Lim J, Yang MH (2013) Online object tracking: a benchmark. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’13), pp 2411–2418
33.
Zurück zum Zitat Yilmaz A, Javed O, Shah M (2006) Object tracking: a survey. ACM Comput Surv (CSUR) 38(4):13CrossRef Yilmaz A, Javed O, Shah M (2006) Object tracking: a survey. ACM Comput Surv (CSUR) 38(4):13CrossRef
34.
Zurück zum Zitat Zhang K, Zhang L, Yang MH (2012) Real-time compressive tracking. In: Computer Vision—ECCV 2012. Springer, Berlin, pp 864–877 Zhang K, Zhang L, Yang MH (2012) Real-time compressive tracking. In: Computer Vision—ECCV 2012. Springer, Berlin, pp 864–877
35.
Zurück zum Zitat Zhang T, Ghanem B, Liu S, Ahuja N (2012) Robust visual tracking via multi-task sparse learning. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’12), pp 2042–2049 Zhang T, Ghanem B, Liu S, Ahuja N (2012) Robust visual tracking via multi-task sparse learning. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’12), pp 2042–2049
36.
Zurück zum Zitat Zhong W, Lu H, Yang MH (2012) Robust object tracking via sparsity-based collaborative model. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’12), pp 1838–1845 Zhong W, Lu H, Yang MH (2012) Robust object tracking via sparsity-based collaborative model. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’12), pp 1838–1845
37.
Zurück zum Zitat Zhou S, Chen Q, Wang X (2013) Convolutional deep networks for visual data classification. Neural Process Lett 38(1):17–27CrossRef Zhou S, Chen Q, Wang X (2013) Convolutional deep networks for visual data classification. Neural Process Lett 38(1):17–27CrossRef
Metadaten
Titel
Deep Learning and Preference Learning for Object Tracking: A Combined Approach
verfasst von
Shuchao Pang
Juan José del Coz
Zhezhou Yu
Oscar Luaces
Jorge Díez
Publikationsdatum
23.10.2017
Verlag
Springer US
Erschienen in
Neural Processing Letters / Ausgabe 3/2018
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-017-9720-5

Weitere Artikel der Ausgabe 3/2018

Neural Processing Letters 3/2018 Zur Ausgabe

Neuer Inhalt