skip to main content
research-article

Learning Adaptive Spatial-Temporal Context-Aware Correlation Filters for UAV Tracking

Authors Info & Claims
Published:04 March 2022Publication History
Skip Abstract Section

Abstract

Tracking in the unmanned aerial vehicle (UAV) scenarios is one of the main components of target-tracking tasks. Different from the target-tracking task in the general scenarios, the target-tracking task in the UAV scenarios is very challenging because of factors such as small scale and aerial view. Although the discriminative correlation filter (DCF)-based tracker has achieved good results in tracking tasks in general scenarios, the boundary effect caused by the dense sampling method will reduce the tracking accuracy, especially in UAV-tracking scenarios. In this work, we propose learning an adaptive spatial-temporal context-aware (ASTCA) model in the DCF-based tracking framework to improve the tracking accuracy and reduce the influence of boundary effect, thereby enabling our tracker to more appropriately handle UAV-tracking tasks. Specifically, our ASTCA model can learn a spatial-temporal context weight, which can precisely distinguish the target and background in the UAV-tracking scenarios. Besides, considering the small target scale and the aerial view in UAV-tracking scenarios, our ASTCA model incorporates spatial context information within the DCF-based tracker, which could effectively alleviate background interference. Extensive experiments demonstrate that our ASTCA method performs favorably against state-of-the-art tracking methods on some standard UAV datasets.

REFERENCES

  1. [1] An Na and Yan Wei Qi. 2021. Multitarget tracking using siamese neural networks. ACM Transactions on Multimedia Computing, Communications, and Applications 17 (2021), Article 75, 16 pages.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Bertinetto Luca, Valmadre Jack, Golodetz Stuart, Miksik Ondrej, and Torr Philip H. S.. 2016. Staple: Complementary learners for real-time tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 14011409.Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Bertinetto Luca, Valmadre Jack, Henriques Joao F., Vedaldi Andrea, and Torr Philip H. S.. 2016. Fully-convolutional siamese networks for object tracking. In Proceedings of the European Conference on Computer Vision. Springer, Amsterdam, The Netherlands, 850865.Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Bolme David S., Beveridge J. Ross, Draper Bruce A., and Lui Yui Man. 2010. Visual object tracking using adaptive correlation filters. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 25442550.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Chang Xiaojun, Yu Yaoliang, Yang Yi, and Xing Eric P.. 2017. Semantic pooling for complex event analysis in untrimmed videos. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 8 (2017), 16171632.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. [6] Chen Kaixuan, Yao Lina, Zhang Dalin, Wang Xianzhi, Chang Xiaojun, and Nie Feiping. 2020. A semisupervised recurrent convolutional attention model for human activity recognition. IEEE Transactions on Neural Networks and Learning Systems 31, 5 (2020), 17471756.Google ScholarGoogle ScholarCross RefCross Ref
  7. [7] Chen Ke, Zhou Zhong, and Wu Wei. 2015. Progressive motion vector clustering for motion estimation and auxiliary tracking. ACM Transactions on Multimedia Computing, Communications, and Applications 11(2015), Article 33, 23 pages.Google ScholarGoogle Scholar
  8. [8] Chen Zedu, Zhong Bineng, Li Guorong, Zhang Shengping, and Ji Rongrong. 2020. Siamese box adaptive network for visual tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 66686677.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. [9] Cheng Zhiyong, Chang Xiaojun, Zhu Lei, Kanjirathinkal Rose Catherine, and Kankanhalli Mohan S.. 2019. MMALFM: Explainable recommendation by leveraging reviews and images. ACM Transactions on Information Systems 37, 2 (2019), 16:1–16:28.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Dai Kenan, Wang Dong, Lu Huchuan, Sun Chong, and Li Jianhua. 2019. Visual tracking via adaptive spatially-regularized correlation filters. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 46704679.Google ScholarGoogle ScholarCross RefCross Ref
  11. [11] Danelljan Martin, Bhat Goutam, Khan Fahad Shahbaz, and Felsberg Michael. 2017. ECO: Efficient convolution operators for tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 66386646.Google ScholarGoogle ScholarCross RefCross Ref
  12. [12] Danelljan Martin, Häger Gustav, Khan Fahad Shahbaz, and Felsberg Michael. 2017. Discriminative scale space tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 8 (2017), 15611575.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Danelljan Martin, Hager Gustav, Khan Fahad Shahbaz, and Felsberg Michael. 2015. Convolutional features for correlation filter based visual tracking. In Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop. IEEE, 5866.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Danelljan Martin, Hager Gustav, Khan Fahad Shahbaz, and Felsberg Michael. 2015. Learning spatially regularized correlation filters for visual tracking. In Proceedings of the 2015 IEEE International Conference on Computer Vision. IEEE, 43104318.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] Danelljan Martin, Hager Gustav, Khan Fahad Shahbaz, and Felsberg Michael. 2016. Adaptive decontamination of the training set: A unified formulation for discriminative visual tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 14301438.Google ScholarGoogle ScholarCross RefCross Ref
  16. [16] Danelljan Martin, Robinson Andreas, Khan Fahad Shahbaz, and Felsberg Michael. 2016. Beyond correlation filters: Learning continuous convolution operators for visual tracking. In Proceedings of the European Conference on Computer Vision. Springer, Amsterdam, 472488.Google ScholarGoogle ScholarCross RefCross Ref
  17. [17] Danelljan Martin, Khan Fahad Shahbaz, Felsberg Michael, and Weijer Joost Van de. 2014. Adaptive color attributes for real-time visual tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 10901097.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. [18] Du Dawei, Qi Yuankai, Yu Hongyang, Yang Yifan, Duan Kaiwen, Li Guorong, Zhang Weigang, Huang Qingming, and Tian Qi. 2018. The unmanned aerial vehicle benchmark: Object detection and tracking. In Proceedings of the European Conference on Computer Vision. Springer, 370386.Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Fan Heng and Ling Haibin. 2017. Parallel tracking and verifying: A framework for real-time and high accuracy visual tracking. In Proceedings of the International Conference on Computer Vision. IEEE, 54865494.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Fan Jiaqing, Song Huihui, Zhang Kaihua, Yang Kang, and Liu Qingshan. 2020. Feature alignment and aggregation siamese networks for fast visual tracking. IEEE Transactions on Circuits and Systems for Video Technology 31, 4 (2020), 12961307.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Feng Li, Cheng Tian, Zuo Wangmeng, Lei Zhang, and Yang Ming Hsuan. 2018. Learning spatial-temporal regularized correlation filters for visual tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 49044913.Google ScholarGoogle Scholar
  22. [22] Fu Changhong, Huang Ziyuan, Li Yiming, Duan Ran, and Lu Peng. 2019. Boundary effect-aware visual tracking for UAV with online enhanced background learning and multi-frame consensus verification. In Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 44154422.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. [23] Fu Sichao, Liu Weifeng, Guan Weili, Zhou Yicong, Tao Dapeng, and Xu Changsheng. 2021. Dynamic graph learning convolutional networks for semi-supervised classification. ACM Transactions on Multimedia Computing, Communications, and Applications 17, 1 (2021), 113.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] Guo Changyong, Zhang Zhaoxin, Li Jinjiang, Jiang Xuesong, Zhang Jun, and Zhang Lei. 2020. Robust visual tracking using kernel sparse coding on multiple covariance descriptors. ACM Transactions on Multimedia Computing, Communications, and Applications 16, 1s, Article 20 (2020), 22 pages.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. [25] Guo Jianting, Zheng Peijia, and Huang Jiwu. 2017. An efficient motion detection and tracking scheme for encrypted surveillance videos. ACM Transactions on Multimedia Computing, Communications, and Applications 13 (2017), Article 61 , 23 pages.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Han Junwei, Yang Le, Zhang Dingwen, Chang Xiaojun, and Liang Xiaodan. 2018. Reinforcement cutting-agent learning for video object segmentation. In Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition. 90809089.Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Han Mingfei, Wang Yali, Chang Xiaojun, and Qiao Yu. 2020. Mining inter-video proposal relations for video object detection. In Proceedings of the European Conference on Computer Vision 2020Lecture Notes in Computer Science, Vol. 12366. 431446.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. [28] Henriques João F., Caseiro Rui, Martins Pedro, and Batista Jorge. 2012. Exploiting the circulant structure of tracking-by-detection with kernels. In Proceedings of the European Conference on Computer Vision. Springer, 702715.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. [29] Henriques João F., Caseiro Rui, Martins Pedro, and Batista Jorge. 2014. High-speed tracking with kernelized correlation filters. IEEE Transactions on Pattern Analysis and Machine Intelligence 37, 3 (2014), 583596.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. [30] Hong Zhibin, Zhe Chen, Wang Chaohui, Xue Mei, Prokhorov Danil, and Tao Dacheng. 2015. MUlti-store tracker (MUSTer): A cognitive psychology inspired approach to object tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 749758.Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Huang Ziyuan, Fu Changhong, Li Yiming, Lin Fuling, and Lu Peng. 2019. Learning aberrance repressed correlation filters for real-time UAV tracking. In Proceedings of the International Conference on Computer Vision. IEEE, 28912900.Google ScholarGoogle ScholarCross RefCross Ref
  32. [32] Galoogahi Hamed Kiani, Fagg Ashton, and Lucey Simon. 2017. Learning background-aware correlation filters for visual tracking. In Proceedings of the International Conference on Computer Vision. IEEE, 11351143.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Galoogahi Hamed Kiani, Sim Terence, and Lucey Simon. 2015. Correlation filters with limited boundaries. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 46304638.Google ScholarGoogle ScholarCross RefCross Ref
  34. [34] Li Siyi and Yeung Dit-Yan. 2017. Visual object tracking for unmanned aerial vehicles: A benchmark and new motion models. In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Press, San Francisco, CA,41404146.Google ScholarGoogle ScholarCross RefCross Ref
  35. [35] Li Xin, Liu Qiao, Fan Nana, Zhou Zikun, He Zhenyu, and Jing Xiao-yuan. 2020. Dual-regression model for visual tracking. Neural Networks 132 (2020), 364374.Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Li Yiming, Fu Changhong, Ding Fangqiang, Huang Ziyuan, and Lu Geng. 2020. AutoTrack: Towards high-performance visual tracking for UAV with automatic spatio-temporal regularization. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 1192311932.Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Li Yang and Zhu Jianke. 2014. A scale adaptive kernel correlation filter tracker with feature integration. In Proceedings of the European Conference on Computer Vision. Springer, 254265.Google ScholarGoogle Scholar
  38. [38] Li Yang, Zhu Jianke, Hoi Steven C. H., Song Wenjie, Wang Zhefeng, and Liu Hantang. 2019. Robust estimation of similarity transformation for visual object tracking. In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Press, 86668673.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. [39] Li Zhihui, Yao Lina, Chang Xiaojun, Zhan Kun, Sun Jiande, and Zhang Huaxiang. 2019. Zero-shot event detection via event-adaptive concept relevance mining. Pattern Recognition 88 (2019), 595603.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Liu Kaikai and Li Xiaolin. 2015. Enabling context-aware indoor augmented reality via smartphone sensing and vision tracking. ACM Transactions on Multimedia Computing, Communications, and Applications 12 (2015), Article 15 , 23 pages.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. [41] Liu Qiao, Li Xin, He Zhenyu, Fan Nana, Yuan Di, and Wang Hongpeng. 2020. Learning deep multi-level similarity for thermal infrared object tracking. IEEE Transactions on Multimedia 23 (2020), 21142126.Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Luo Minnan, Yan Caixia, Zheng Qinghua, Chang Xiaojun, Chen Ling, and Nie Feiping. 2019. Discrete multi-graph clustering. IEEE Transactions on Image Processing 28, 9 (2019), 47014712.Google ScholarGoogle ScholarCross RefCross Ref
  43. [43] Ma Zhigang, Chang Xiaojun, Xu Zhongwen, Sebe Nicu, and Hauptmann Alexander G.. 2018. Joint attributes and event analysis for multimedia event detection. IEEE Transactions on Neural Networks and Learning Systems 29, 7 (2018), 29212930.Google ScholarGoogle Scholar
  44. [44] Mueller Matthias, Smith Neil, and Ghanem Bernard. 2016. A benchmark and simulator for UAV tracking. In Proceedings of the European Conference on Computer Vision. Springer, 445461.Google ScholarGoogle ScholarCross RefCross Ref
  45. [45] Mueller Matthias, Smith Neil, and Ghanem Bernard. 2017. Context-aware correlation filter tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 13961404.Google ScholarGoogle ScholarCross RefCross Ref
  46. [46] Qi Yuankai, Zhang Shengping, Qin Lei, Yao Hongxun, Huang Qingming, Lim Jongwoo, and Yang Ming-Hsuan. 2016. Hedged deep tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 43034311.Google ScholarGoogle ScholarCross RefCross Ref
  47. [47] Ren Pengzhen, Xiao Yun, Chang Xiaojun, Huang Poyao, Li Zhihui, Chen Xiaojiang, and Wang Xin. 2021. A comprehensive survey of neural architecture search: Challenges and solutions. ACM Computing Surveys 54, 4 (2021), 76:1–76:34.Google ScholarGoogle Scholar
  48. [48] Shu Xiu, Yang Yunyun, and Wu Boying. 2021. Adaptive segmentation model for liver CT images based on neural network and level set method. Neurocomputing 453 (2021), 438452.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. [49] Shu Xiu, Yang Yunyun, and Wu Boying. 2021. A neighbor level set framework minimized with the split Bregman method for medical image segmentation. Signal Processing 189 (2021), 108293.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. [50] Song Yibing, Ma Chao, Gong Lijun, Zhang Jiawei, Lau Rynson WH, and Yang Ming-Hsuan. 2017. CREST: Convolutional residual learning for visual tracking. In Proceedings of the 2017 IEEE International Conference on Computer Vision. IEEE, 25742583.Google ScholarGoogle ScholarCross RefCross Ref
  51. [51] Tao Ran, Gavves Efstratios, and Smeulders Arnold W. M.. 2016. Siamese instance search for tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 14201429.Google ScholarGoogle ScholarCross RefCross Ref
  52. [52] Tian Chunwei, Fei Lunke, Zheng Wenxian, Xu Yong, Zuo Wangmeng, and Lin Chia-Wen. 2020. Deep learning on image denoising: An overview. Neural Networks 131 (2020), 251275.Google ScholarGoogle ScholarCross RefCross Ref
  53. [53] Valmadre Jack, Bertinetto Luca, Henriques João, Vedaldi Andrea, and Torr Philip H. S.. 2017. End-to-end representation learning for correlation filter based tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 28052813.Google ScholarGoogle ScholarCross RefCross Ref
  54. [54] Wang Lijun, Ouyang Wanli, Wang Xiaogang, and Lu Huchuan. 2015. Visual tracking with fully convolutional networks. In Proceedings of the International Conference on Computer Vision. IEEE, 31193127.Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. [55] Wang Lijun, Ouyang Wanli, Wang Xiaogang, and Lu Huchuan. 2016. STCT: Sequentially training convolutional networks for visual tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 13731381.Google ScholarGoogle ScholarCross RefCross Ref
  56. [56] Wang Ning, Song Yibing, Ma Chao, Zhou Wengang, Liu Wei, and Li Houqiang. 2019. Unsupervised deep tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 13081317.Google ScholarGoogle ScholarCross RefCross Ref
  57. [57] Wang Ning, Zhou Wengang, Tian Qi, Hong Richang, Wang Meng, and Li Houqiang. 2018. Multi-cue correlation filters for robust visual tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 48444853.Google ScholarGoogle ScholarCross RefCross Ref
  58. [58] Wang Yong, Ding Lu, and Laganiere Robert. 2019. Real-time UAV tracking based on PSR stability. In Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshop. IEEE, 19.Google ScholarGoogle ScholarCross RefCross Ref
  59. [59] Wu Yi, Lim Jongwoo, and Yang Ming-Hsuan. 2013. Online object tracking: A benchmark. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 24112418.Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. [60] Xu Tianyang, Feng Zhen-Hua, Wu Xiao-Jun, and Kittler Josef. 2019. Joint group feature selection and discriminative filter learning for robust visual object tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 79507960.Google ScholarGoogle ScholarCross RefCross Ref
  61. [61] Yan Caixia, Chang Xiaojun, Luo Minnan, Zheng Qinghua, Zhang Xiaoqin, Li Zhihui, and Nie Feiping. 2020. Self-weighted robust LDA for multiclass classification with edge classes. ACM Transactions on Intelligent Systems and Technology 12 (2020), Article 4, 19 pages.Google ScholarGoogle Scholar
  62. [62] Yan Chenggang, Gong Biao, Wei Yuxuan, and Gao Yue. 2020. Deep multi-view enhancement hashing for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 4 (2020), 14451451.Google ScholarGoogle ScholarCross RefCross Ref
  63. [63] Yan Chenggang, Li Zhisheng, Zhang Yongbing, Liu Yutao, Ji Xiangyang, and Zhang Yongdong. 2020. Depth image denoising using nuclear norm and learning graph model. ACM Transactions on Multimedia Computing, Communications, and Applications 16, 4 (2020), 117.Google ScholarGoogle ScholarDigital LibraryDigital Library
  64. [64] Yan Chenggang, Shao Biyao, Zhao Hao, Ning Ruixin, Zhang Yongdong, and Xu Feng. 2020. 3D room layout estimation from a single RGB image. IEEE Transactions on Multimedia 22, 11 (2020), 30143024.Google ScholarGoogle ScholarCross RefCross Ref
  65. [65] Yan Caixia, Zheng Qinghua, Chang Xiaojun, Luo Minnan, Yeh Chung-Hsing, and Hauptmann Alexander G.. 2020. Semantics-preserving graph propagation for zero-shot object detection. IEEE Transactions on Image Processing 29 (2020), 81638176.Google ScholarGoogle ScholarDigital LibraryDigital Library
  66. [66] Yang Yunyun, Shu Xiu, Wang Ruofan, Feng Chong, and Jia Wenjing. 2020. Parallelizable and robust image segmentation model based on the shape prior information. Applied Mathematical Modelling 83 (2020), 357370.Google ScholarGoogle ScholarCross RefCross Ref
  67. [67] Yu En, Sun Jiande, Li Jing, Chang Xiaojun, Han Xian-Hua, and Hauptmann Alexander G.. 2019. Adaptive semi-supervised feature selection for cross-modal retrieval. IEEE Transactions on Multimedia 21, 5 (2019), 12761288.Google ScholarGoogle ScholarDigital LibraryDigital Library
  68. [68] Yuan Di, Chang Xiaojun, Huang Po-Yao, Liu Qiao, and He Zhenyu. 2021. Self-supervised deep correlation tracking. IEEE Transactions on Image Processing 30 (2021), 976985.Google ScholarGoogle ScholarDigital LibraryDigital Library
  69. [69] Yuan Di, Fan Nana, and He Zhenyu. 2020. Learning target-focusing convolutional regression model for visual object tracking. Knowledge-Based Systems 194 (2020), 105526.Google ScholarGoogle ScholarCross RefCross Ref
  70. [70] Yuan Di, Kang Wei, and He Zhenyu. 2020. Robust visual tracking with correlation filters and metric learning. Knowledge-Based Systems 195 (2020), 105697.Google ScholarGoogle ScholarCross RefCross Ref
  71. [71] Yuan Di, Li Xin, He Zhenyu, Liu Qiao, and Lu Shuwei. 2020. Visual object tracking with adaptive structural convolutional network. Knowledge-Based Systems 194 (2020), 105554.Google ScholarGoogle ScholarCross RefCross Ref
  72. [72] Zhang Dalin, Yao Lina, Chen Kaixuan, Wang Sen, Chang Xiaojun, and Liu Yunhao. 2020. Making sense of spatio-temporal preserving representations for EEG-based human intention recognition. IEEE Transactions on Cybernetics 50, 7 (2020), 30333044.Google ScholarGoogle ScholarCross RefCross Ref
  73. [73] Zhang Feifei, Xu Mingliang, Mao Qirong, and Xu Changsheng. 2020. Joint attribute manipulation and modality alignment learning for composing text and image to image retrieval. In Proceedings of the 28th ACM International Conference on Multimedia. ACM, 33673376.Google ScholarGoogle ScholarDigital LibraryDigital Library
  74. [74] Zhang Kaihua, Fan Jiaqing, Liu Qingshan, Yang Jian, and Lian Wei. 2018. Parallel attentive correlation tracking. IEEE Transactions on Image Processing 28, 1 (2018), 479491.Google ScholarGoogle ScholarDigital LibraryDigital Library
  75. [75] Zhang Lingling, Chang Xiaojun, Liu Jun, Luo Minnan, Prakash Mahesh, and Hauptmann Alexander G.. 2020. Few-shot activity recognition with cross-modal memory network. Pattern Recognition 108 (2020), 107348.Google ScholarGoogle ScholarCross RefCross Ref
  76. [76] Zhang Lingling, Luo Minnan, Liu Jun, Chang Xiaojun, Yang Yi, and Hauptmann Alexander G.. 2020. Deep top-$k$ ranking for image-sentence matching. IEEE Transactions on Multimedia 22, 3 (2020), 775785.Google ScholarGoogle ScholarCross RefCross Ref
  77. [77] Zhang Tianzhu, Xu Changsheng, and Yang Ming-Hsuan. 2017. Multi-task correlation particle filter for robust object tracking. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 48194827.Google ScholarGoogle ScholarCross RefCross Ref
  78. [78] Zhong Bineng, Bai Bing, Li Jun, Zhang Yulun, and Fu Yun. 2018. Hierarchical tracking by reinforcement learning-based searching and coarse-to-fine verifying. IEEE Transactions on Image Processing 28, 5 (2018), 23312341.Google ScholarGoogle ScholarCross RefCross Ref
  79. [79] Zhou Runwu, Chang Xiaojun, Shi Lei, Shen Yi-Dong, Yang Yi, and Nie Feiping. 2020. Person reidentification via multi-feature fusion with adaptive graph learning. IEEE Transactions on Neural Networks and Learning Systems 31, 5 (2020), 15921601.Google ScholarGoogle ScholarCross RefCross Ref
  80. [80] Zhu Fengda, Zhu Yi, Chang Xiaojun, and Liang Xiaodan. 2020. Vision-language navigation with self-supervised auxiliary reasoning tasks. In Proceedings of the Computer Vision and Pattern Recognition. IEEE, 1001210022.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Learning Adaptive Spatial-Temporal Context-Aware Correlation Filters for UAV Tracking

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 18, Issue 3
      August 2022
      478 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3505208
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 4 March 2022
      • Accepted: 1 September 2021
      • Revised: 1 August 2021
      • Received: 1 May 2021
      Published in tomm Volume 18, Issue 3

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text

    HTML Format

    View this article in HTML Format .

    View HTML Format