Skip to main content

2022 | OriginalPaper | Buchkapitel

29. MPSiam: A Fast Multiplexing Siamese Tracking Network

verfasst von : Donghao Li, Ce Shen, Jinxing Hu, Diping Yuan

Erschienen in: Advances in Smart Vehicular Technology, Transportation, Communication and Applications

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Siamese trackers have achieved remarkable performance in accuracy. However, the high memory cost and inference speed have restricted the deployment of the state-of-the-art trackers in mobile applications. To address this issue, this paper presents a backbone consisting of multiplexing convolution blocks that newly proposed by us, which combine the spatial multiplexing operation and channel multiplexing operation. The spatial multiplexing operation is inspired by the subpixel convolution in super-resolution tasks. The channel multiplexing operation is inspired by the channel shuffle in ShuffleNet. These two modules can be used to effectively optimize the multiply–accumulate (MACC) operation, by multiplying the number of operations and then adding it to a network. We employ this new module to build a novel lightweight backbone for the SiamRPN++ tracker. We trained this model and evaluated its performances on the VOT2018 and OTB2015 datasets. Our model is compressed to 43 MB, the inference time was 83 FPS, and the experiments were carried out in a single NVIDIA 2080Ti GPU. Our model is superior to MobileNetv2-SiamRPN++, which has a model size of 58 MB and the inference time of 55 FPS, and our method also managed to reduce the MACC from 1.2 to 0.5 B. Compared with SiamRPN++ with Resnet50 backbone, our model achieved a compression rate of 4.8\(\times \) and speedup of 3.3\(\times \), just losing 3% EAO.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Lee, K.-H., Hwang, J.-N.: On-road pedestrian tracking across multiple driving recorders. IEEE Trans. Multimedia 17(9), 1429–1438 (2015)CrossRef Lee, K.-H., Hwang, J.-N.: On-road pedestrian tracking across multiple driving recorders. IEEE Trans. Multimedia 17(9), 1429–1438 (2015)CrossRef
2.
Zurück zum Zitat M. Odelga, P. Stegagno, N. Kochanek, H.H. Bülthoff, A selfcontained teleoperated quadrotor: on-board state-estimation and indoor obstacle avoidance. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pp. 7840–7847 (2018) M. Odelga, P. Stegagno, N. Kochanek, H.H. Bülthoff, A selfcontained teleoperated quadrotor: on-board state-estimation and indoor obstacle avoidance. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pp. 7840–7847 (2018)
3.
Zurück zum Zitat Yuan, C., Liu, Z., Zhang, Y.: Aerial images-based forest fire detection for firefighting using optical remote sensing techniques and unmanned aerial vehicles. J. Intell. Robot. Syst. 88(2–4), 635–654 (2017)CrossRef Yuan, C., Liu, Z., Zhang, Y.: Aerial images-based forest fire detection for firefighting using optical remote sensing techniques and unmanned aerial vehicles. J. Intell. Robot. Syst. 88(2–4), 635–654 (2017)CrossRef
4.
Zurück zum Zitat Bertinetto, L., Valmadre, J., Henriques, J.F., Vedaldi, A., Torr, P.H.: Fully convolutional Siamese networks for object tracking. In: ECCV, pp. 850–865 (2016) Bertinetto, L., Valmadre, J., Henriques, J.F., Vedaldi, A., Torr, P.H.: Fully convolutional Siamese networks for object tracking. In: ECCV, pp. 850–865 (2016)
5.
Zurück zum Zitat Valmadre, J., Bertinetto, L., Henriques, J. F., Vedaldi, A., Torr, P.H.S.: End-to-end representation learning for correlation filter based tracking. In: CVPR (2017) Valmadre, J., Bertinetto, L., Henriques, J. F., Vedaldi, A., Torr, P.H.S.: End-to-end representation learning for correlation filter based tracking. In: CVPR (2017)
6.
Zurück zum Zitat Huang, C., Lucey, S., Ramanan, D.: Learning policies for adaptive tracking with deep feature cascades. In: ICCV (2017) Huang, C., Lucey, S., Ramanan, D.: Learning policies for adaptive tracking with deep feature cascades. In: ICCV (2017)
7.
Zurück zum Zitat Li, B. Wu, W., Wang, Q., Zhang, F., Xing, J., Yan, J. SiamRPN++: evolution of Siamese visual tracking with very deep networks. In: CVPR, pp. 4282–4291 (2019) Li, B. Wu, W., Wang, Q., Zhang, F., Xing, J., Yan, J. SiamRPN++: evolution of Siamese visual tracking with very deep networks. In: CVPR, pp. 4282–4291 (2019)
8.
Zurück zum Zitat Chollet, F.: Xception: deep learning with depthwise separable convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017) Chollet, F.: Xception: deep learning with depthwise separable convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017)
9.
Zurück zum Zitat Zhang, X., Zhou, X., Lin, M., Sun, J.: Shufflenet: an extremely efficient convolutional neural network for mobile devices. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018) Zhang, X., Zhou, X., Lin, M., Sun, J.: Shufflenet: an extremely efficient convolutional neural network for mobile devices. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)
10.
Zurück zum Zitat Li, B., Yan, J., Wu, W., Zhu, Z., Hu, X.: High performance visual tracking with Siamese region proposal network. In: CVPR (2018) Li, B., Yan, J., Wu, W., Zhu, Z., Hu, X.: High performance visual tracking with Siamese region proposal network. In: CVPR (2018)
11.
Zurück zum Zitat Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: International Conference on Neural Information Processing Systems, pp. 91–99 (2015) Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: International Conference on Neural Information Processing Systems, pp. 91–99 (2015)
12.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25(2), 2012 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25(2), 2012 (2012)
13.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016) He, K., Zhang, X., Ren, S., Sun, J.: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)
14.
Zurück zum Zitat Wang, Q., Zhang, L., Bertinetto, L., Hu, W., Torr, P.H.: Fast online object tracking and segmentation: a unifying approach. In: CVPR (2019) Wang, Q., Zhang, L., Bertinetto, L., Hu, W., Torr, P.H.: Fast online object tracking and segmentation: a unifying approach. In: CVPR (2019)
15.
Zurück zum Zitat Lin, T.-Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017) Lin, T.-Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017)
16.
Zurück zum Zitat Huang, G., Liu, S., Van der Maaten, L., Weinberger, K.Q.: Condensenet: an efficient densenet using learned group convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018) Huang, G., Liu, S., Van der Maaten, L., Weinberger, K.Q.: Condensenet: an efficient densenet using learned group convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)
17.
Zurück zum Zitat Shi, W., Caballero, J., Huszár, F., Totz, J., Aitken, A.P., Bishop, R., Rueckert, D., Wang, Z.: Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016) Shi, W., Caballero, J., Huszár, F., Totz, J., Aitken, A.P., Bishop, R., Rueckert, D., Wang, Z.: Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)
18.
Zurück zum Zitat Ma, N., Zhang, X., Zheng, H.-T., Sun, J.: Shufflenet v2: practical guidelines for efficient CNN architecture design. In: European Conference on Computer Vision (ECCV) (2018) Ma, N., Zhang, X., Zheng, H.-T., Sun, J.: Shufflenet v2: practical guidelines for efficient CNN architecture design. In: European Conference on Computer Vision (ECCV) (2018)
Metadaten
Titel
MPSiam: A Fast Multiplexing Siamese Tracking Network
verfasst von
Donghao Li
Ce Shen
Jinxing Hu
Diping Yuan
Copyright-Jahr
2022
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-16-4039-1_29

    Premium Partner