Skip to main content
Top
Published in: Multimedia Systems 2/2015

01-03-2015 | Special Issue Paper

Soft-assigned bag of features for object tracking

Authors: Tongwei Ren, Zhongyan Qiu, Yan Liu, Tong Yu, Jia Bei

Published in: Multimedia Systems | Issue 2/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Hard assignment-based bag of features (BoF) representation inevitably brings in quantization errors, which may lead to inaccuracy, even failure in object tracking. In this paper, we propose a novel soft-assigned BoF tracking approach, in which soft assignment is utilized to improve the robustness and discrimination of BoF representation. After labeling the tracked target, we first randomly sample the circle patches with adaptive size within and outside the labeled target, extract the local features from the patches, and construct the codebooks by k-means clustering. When tracking in a new frame, we generate the BoF representation of each candidate target, and select the most similar candidate target in the previous tracked result based on BoF representation. To improve tracking performance, we also continuously update the codebooks and refine the tracking results. Experiments show that our approach outperforms the state-of-the-art tracking methods under complex tracking conditions.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Zhong, W., Lu, H., Yang M.H.: Robust object tracking via sparsity-based collaborative model. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1838–1845. Providence, USA (2012) Zhong, W., Lu, H., Yang M.H.: Robust object tracking via sparsity-based collaborative model. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1838–1845. Providence, USA (2012)
2.
go back to reference Wang, M., Hong, R., Li, G., Zha, Z.J., Yan, S., Chua, T.S.: Event driven web video summarization by tag localization and key-shot identification. IEEE Trans. Multimedia 14(4), 975–985 (2012)CrossRef Wang, M., Hong, R., Li, G., Zha, Z.J., Yan, S., Chua, T.S.: Event driven web video summarization by tag localization and key-shot identification. IEEE Trans. Multimedia 14(4), 975–985 (2012)CrossRef
3.
go back to reference Zhang, P., Thomas, T., Emmanuel, S.: Privacy enabled video surveillance using a two state Indicatov tracking algorithm. Multimedia Syst. 18(2), 175–199 (2012)CrossRef Zhang, P., Thomas, T., Emmanuel, S.: Privacy enabled video surveillance using a two state Indicatov tracking algorithm. Multimedia Syst. 18(2), 175–199 (2012)CrossRef
4.
go back to reference Wachs, J.P., Kölsch, M., Stern, H., Edan, Y.: Vision-based hand-gesture applications. Commun. ACM 54(2), 60–71 (2011)CrossRef Wachs, J.P., Kölsch, M., Stern, H., Edan, Y.: Vision-based hand-gesture applications. Commun. ACM 54(2), 60–71 (2011)CrossRef
5.
go back to reference Wang, M., Hua, X.S., Hong, R., Tang, J., Qi, G.J., Song, Y.: Unified video annotation via multigraph learning. IEEE Trans. Circ. Syst. Video Technol. 19(5), 733–746 (2009)CrossRef Wang, M., Hua, X.S., Hong, R., Tang, J., Qi, G.J., Song, Y.: Unified video annotation via multigraph learning. IEEE Trans. Circ. Syst. Video Technol. 19(5), 733–746 (2009)CrossRef
6.
go back to reference Park, B.S., Yoo, S.J., Park, J.B., Choi, Y.H.: A simple adaptive control approach for trajectory tracking of electrically driven nonholonomic mobile robots. IEEE Trans. Control Syst. Technol. 18(5), 1199–1206 (2010)CrossRef Park, B.S., Yoo, S.J., Park, J.B., Choi, Y.H.: A simple adaptive control approach for trajectory tracking of electrically driven nonholonomic mobile robots. IEEE Trans. Control Syst. Technol. 18(5), 1199–1206 (2010)CrossRef
7.
go back to reference Li, A., Tang, F., Guo, Y., Tao H.: Discriminative nonorthogonal binary subspace tracking. In: European Conference on Computer Vision, pp. 258–271. Heraklion, Greece (2010) Li, A., Tang, F., Guo, Y., Tao H.: Discriminative nonorthogonal binary subspace tracking. In: European Conference on Computer Vision, pp. 258–271. Heraklion, Greece (2010)
8.
go back to reference Li, X., Hu, W., Shen, C., Zhang, Z., Dick, A., Hengel, A.: A survey of appearance models in visual object tracking. ACM Trans. Intell. Syst. Technol. 4(4), 1–48 (2013) Li, X., Hu, W., Shen, C., Zhang, Z., Dick, A., Hengel, A.: A survey of appearance models in visual object tracking. ACM Trans. Intell. Syst. Technol. 4(4), 1–48 (2013)
9.
go back to reference Ying, L., Xu, C., Guo W.: Extended MHT algorithm for multiple object tracking. In: International Conference on Internet Multimedia Computing and Service, pp. 75–79. Wuhan, China (2012) Ying, L., Xu, C., Guo W.: Extended MHT algorithm for multiple object tracking. In: International Conference on Internet Multimedia Computing and Service, pp. 75–79. Wuhan, China (2012)
10.
go back to reference Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Locality-constrained linear coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3360–3367. San Francisco, USA (2010) Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Locality-constrained linear coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3360–3367. San Francisco, USA (2010)
11.
go back to reference Jiang, Y. G., Ngo, C. W., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. In: ACM International Conference on Image and Video Retrieval, pp. 494–501. Amsterdam, The Netherlands (2007) Jiang, Y. G., Ngo, C. W., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. In: ACM International Conference on Image and Video Retrieval, pp. 494–501. Amsterdam, The Netherlands (2007)
12.
go back to reference Jégou, H., Douze, M., Schmid, C.: Improving bag-of-features for large scale image search. Int. J. Comput. Vis. 87(3), 316–336 (2010)CrossRef Jégou, H., Douze, M., Schmid, C.: Improving bag-of-features for large scale image search. Int. J. Comput. Vis. 87(3), 316–336 (2010)CrossRef
13.
go back to reference Yang, F., Lu, H., Chen, Y.: Bag of features tracking. In: International Conference on Pattern Recognition, pp. 153–156. Istanbul, Turkey (2010) Yang, F., Lu, H., Chen, Y.: Bag of features tracking. In: International Conference on Pattern Recognition, pp. 153–156. Istanbul, Turkey (2010)
14.
go back to reference Yang, F., Lu, H., Zhang, W., Yang, G.: Visual tracking via bag of features. IET Image Process. 6(2), 115–128 (2012)CrossRefMathSciNet Yang, F., Lu, H., Zhang, W., Yang, G.: Visual tracking via bag of features. IET Image Process. 6(2), 115–128 (2012)CrossRefMathSciNet
15.
go back to reference Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: improving particular object retrieval in large scale image databases. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. Anchorage, USA (2008) Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: improving particular object retrieval in large scale image databases. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. Anchorage, USA (2008)
16.
go back to reference Qiu, Z., Yu, T., Ren, T., Liu, Y., Bei, J.: Soft-assigned bag of features tracking. In: International Conference on Internet Multimedia Computing and Service. Huangshan, China (2013) Qiu, Z., Yu, T., Ren, T., Liu, Y., Bei, J.: Soft-assigned bag of features tracking. In: International Conference on Internet Multimedia Computing and Service. Huangshan, China (2013)
17.
go back to reference Pérez, P., Hue, C., Vermaak, J., Gangnet, M.: Color-based probabilistic tracking. In: European Conference on Computer Vision, pp. 661–675. Copenhagen, Denmark (2002) Pérez, P., Hue, C., Vermaak, J., Gangnet, M.: Color-based probabilistic tracking. In: European Conference on Computer Vision, pp. 661–675. Copenhagen, Denmark (2002)
18.
go back to reference Allili, M.S., Ziou, D.: Object of interest segmentation and tracking by using feature selection and active contours. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. Minneapolis, USA (2007) Allili, M.S., Ziou, D.: Object of interest segmentation and tracking by using feature selection and active contours. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. Minneapolis, USA (2007)
19.
go back to reference Santner, J., Leistner, C., Saffari, A., Pock, T., Bischof, H.: Prost: Parallel robust online simple tracking. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 723–730. San Francisco, USA (2010) Santner, J., Leistner, C., Saffari, A., Pock, T., Bischof, H.: Prost: Parallel robust online simple tracking. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 723–730. San Francisco, USA (2010)
20.
go back to reference Hu, W., Li, X., Zhang, X., Shi, X., Maybank, S., Zhang, Z.: Incremental tensor subspace learning and its applications to foreground segmentation and tracking. Int. J. Comput. Vis. 91(3), 303–327 (2011)CrossRefMATH Hu, W., Li, X., Zhang, X., Shi, X., Maybank, S., Zhang, Z.: Incremental tensor subspace learning and its applications to foreground segmentation and tracking. Int. J. Comput. Vis. 91(3), 303–327 (2011)CrossRefMATH
21.
go back to reference Kim, Z.: Real time object tracking based on dynamic feature grouping with background subtraction. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. Anchorage, USA (2008) Kim, Z.: Real time object tracking based on dynamic feature grouping with background subtraction. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. Anchorage, USA (2008)
22.
go back to reference Zhou, H., Yuan, Y., Shi, C.: Object tracking using SIFT features and mean shift. Comput. Vis. Image Underst. 113(3), 345–352 (2009)CrossRef Zhou, H., Yuan, Y., Shi, C.: Object tracking using SIFT features and mean shift. Comput. Vis. Image Underst. 113(3), 345–352 (2009)CrossRef
23.
go back to reference He, W., Yamashita, T., Lu, H., Lao, S.: Surf tracking. In: IEEE International Conference on Computer Vision, pp. 1586–1592. Kyoto, Japan (2009) He, W., Yamashita, T., Lu, H., Lao, S.: Surf tracking. In: IEEE International Conference on Computer Vision, pp. 1586–1592. Kyoto, Japan (2009)
24.
go back to reference Wang, S., Lu, H., Yang, F., Yang, M.H.: Superpixel tracking. In: IEEE International Conference on Computer Vision, pp. 1323–1330. Barcelona, Spain (2011) Wang, S., Lu, H., Yang, F., Yang, M.H.: Superpixel tracking. In: IEEE International Conference on Computer Vision, pp. 1323–1330. Barcelona, Spain (2011)
25.
go back to reference Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: ECCV Workshop on Statistical Learning in Computer Vision, pp. 1–22. Prague, Czech Republic (2004) Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: ECCV Workshop on Statistical Learning in Computer Vision, pp. 1–22. Prague, Czech Republic (2004)
26.
go back to reference Nowak, E., Jurie, F., Triggs, B.: Sampling strategies for bag-of-features image classification. In: European Conference on Computer Vision, pp. 490–503. Graz, Austria (2006) Nowak, E., Jurie, F., Triggs, B.: Sampling strategies for bag-of-features image classification. In: European Conference on Computer Vision, pp. 490–503. Graz, Austria (2006)
27.
go back to reference Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef
28.
go back to reference Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110(3), 346–359 (2008)CrossRef Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110(3), 346–359 (2008)CrossRef
29.
go back to reference Manjunath, B.S., Ohm, J.R., Vasudevan, V.V., Yamada, A.: Color and texture descriptors. IEEE Trans. Circ. Syst. Video Technol. 11(6), 703–715 (2001)CrossRef Manjunath, B.S., Ohm, J.R., Vasudevan, V.V., Yamada, A.: Color and texture descriptors. IEEE Trans. Circ. Syst. Video Technol. 11(6), 703–715 (2001)CrossRef
30.
go back to reference Fei-Fei, L., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: IEEE Computer Vision and Pattern Recognition, pp. 524–531. San Diego, USA (2005) Fei-Fei, L., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: IEEE Computer Vision and Pattern Recognition, pp. 524–531. San Diego, USA (2005)
31.
go back to reference Wang, M., Hua, X.S., Tang, J., Hong, R.: Beyond distance measurement: Constructing neighborhood similarity for video annotation. IEEE Trans. Multimedia 11(3), 465–476 (2009)CrossRef Wang, M., Hua, X.S., Tang, J., Hong, R.: Beyond distance measurement: Constructing neighborhood similarity for video annotation. IEEE Trans. Multimedia 11(3), 465–476 (2009)CrossRef
32.
go back to reference Jiang, Y.G., Yang, J., Ngo, C.W., Hauptmann, A.G.: Representations of keypoint-based semantic concept detection: A comprehensive study. IEEE Trans. Multimedia 12(1), 42–53 (2010)CrossRef Jiang, Y.G., Yang, J., Ngo, C.W., Hauptmann, A.G.: Representations of keypoint-based semantic concept detection: A comprehensive study. IEEE Trans. Multimedia 12(1), 42–53 (2010)CrossRef
33.
go back to reference Gemert, J.C., Geusebroek, J.M., Veenman, C.J., Smeulders, A.W.M.: Kernel codebooks for scene categorization. In: European Conference on Computer Vision, pp. 696–709. Marseille, France (2008) Gemert, J.C., Geusebroek, J.M., Veenman, C.J., Smeulders, A.W.M.: Kernel codebooks for scene categorization. In: European Conference on Computer Vision, pp. 696–709. Marseille, France (2008)
34.
go back to reference Zhu, S., Wang, G., Ngo, C. W., Jiang, Y.G.: On the sampling of web images for learning visual concept classifiers. In: ACM International Conference on Image and Video Retrieval, pp. 50–57. Xi’an, China (2010) Zhu, S., Wang, G., Ngo, C. W., Jiang, Y.G.: On the sampling of web images for learning visual concept classifiers. In: ACM International Conference on Image and Video Retrieval, pp. 50–57. Xi’an, China (2010)
35.
go back to reference Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C.D., Silverman, R., Wu, A.Y.: An efficient k-means clustering algorithm: Analysis and implementation. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 881–892 (2002)CrossRef Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C.D., Silverman, R., Wu, A.Y.: An efficient k-means clustering algorithm: Analysis and implementation. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 881–892 (2002)CrossRef
36.
go back to reference Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 971–987 (2002)CrossRef Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 971–987 (2002)CrossRef
38.
go back to reference Adam, A., Rivlin, E., Shimshoni, I.: Robust fragments-based tracking using the integral histogram. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 798–805. New York, USA (2006) Adam, A., Rivlin, E., Shimshoni, I.: Robust fragments-based tracking using the integral histogram. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 798–805. New York, USA (2006)
39.
go back to reference Ross, D., Lim, J., Lin, R., Yang, M.: Incremental learning for robust visual tracking. Int. J. Comput. Vis. 77(1–3), 125–141 (2008)CrossRef Ross, D., Lim, J., Lin, R., Yang, M.: Incremental learning for robust visual tracking. Int. J. Comput. Vis. 77(1–3), 125–141 (2008)CrossRef
40.
go back to reference Babenko, B., Yang, M.H., Belongie, S.: Visual tracking with online multiple instance learning. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 983–990. Miami, USA (2009) Babenko, B., Yang, M.H., Belongie, S.: Visual tracking with online multiple instance learning. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 983–990. Miami, USA (2009)
41.
go back to reference Kwon, J., Lee, K.M.: Visual tracking decomposition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1269–1276. San Francisco, USA (2010) Kwon, J., Lee, K.M.: Visual tracking decomposition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1269–1276. San Francisco, USA (2010)
42.
go back to reference Kalal, Z., Mikolajczyk, K., Matas, J.: Tracking learning detection. IEEE Trans. Pattern Anal. Mach. Intell. 34(7), 1409–1422 (2012)CrossRef Kalal, Z., Mikolajczyk, K., Matas, J.: Tracking learning detection. IEEE Trans. Pattern Anal. Mach. Intell. 34(7), 1409–1422 (2012)CrossRef
43.
go back to reference Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)CrossRef Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)CrossRef
Metadata
Title
Soft-assigned bag of features for object tracking
Authors
Tongwei Ren
Zhongyan Qiu
Yan Liu
Tong Yu
Jia Bei
Publication date
01-03-2015
Publisher
Springer Berlin Heidelberg
Published in
Multimedia Systems / Issue 2/2015
Print ISSN: 0942-4962
Electronic ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-014-0384-y

Other articles of this Issue 2/2015

Multimedia Systems 2/2015 Go to the issue