nach oben

Erschienen in:

2022 | OriginalPaper | Buchkapitel

29. MPSiam: A Fast Multiplexing Siamese Tracking Network

verfasst von : Donghao Li, Ce Shen, Jinxing Hu, Diping Yuan

Erschienen in: Advances in Smart Vehicular Technology, Transportation, Communication and Applications

Verlag: Springer Singapore

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Siamese trackers have achieved remarkable performance in accuracy. However, the high memory cost and inference speed have restricted the deployment of the state-of-the-art trackers in mobile applications. To address this issue, this paper presents a backbone consisting of multiplexing convolution blocks that newly proposed by us, which combine the spatial multiplexing operation and channel multiplexing operation. The spatial multiplexing operation is inspired by the subpixel convolution in super-resolution tasks. The channel multiplexing operation is inspired by the channel shuffle in ShuffleNet. These two modules can be used to effectively optimize the multiply–accumulate (MACC) operation, by multiplying the number of operations and then adding it to a network. We employ this new module to build a novel lightweight backbone for the SiamRPN++ tracker. We trained this model and evaluated its performances on the VOT2018 and OTB2015 datasets. Our model is compressed to 43 MB, the inference time was 83 FPS, and the experiments were carried out in a single NVIDIA 2080Ti GPU. Our model is superior to MobileNetv2-SiamRPN++, which has a model size of 58 MB and the inference time of 55 FPS, and our method also managed to reduce the MACC from 1.2 to 0.5 B. Compared with SiamRPN++ with Resnet50 backbone, our model achieved a compression rate of 4.8\(\times \) and speedup of 3.3\(\times \), just losing 3% EAO.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel An Improved Arithmetic Optimization Algorithm with a Strategy Balancing Exploration and Exploitation

Nächstes Kapitel A Single-Phase-to-Ground Fault Location Method Based on Deep Belief Network

Lee, K.-H., Hwang, J.-N.: On-road pedestrian tracking across multiple driving recorders. IEEE Trans. Multimedia 17(9), 1429–1438 (2015)CrossRef

M. Odelga, P. Stegagno, N. Kochanek, H.H. Bülthoff, A selfcontained teleoperated quadrotor: on-board state-estimation and indoor obstacle avoidance. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pp. 7840–7847 (2018)

Yuan, C., Liu, Z., Zhang, Y.: Aerial images-based forest fire detection for firefighting using optical remote sensing techniques and unmanned aerial vehicles. J. Intell. Robot. Syst. 88(2–4), 635–654 (2017)CrossRef

Bertinetto, L., Valmadre, J., Henriques, J.F., Vedaldi, A., Torr, P.H.: Fully convolutional Siamese networks for object tracking. In: ECCV, pp. 850–865 (2016)

Valmadre, J., Bertinetto, L., Henriques, J. F., Vedaldi, A., Torr, P.H.S.: End-to-end representation learning for correlation filter based tracking. In: CVPR (2017)

Huang, C., Lucey, S., Ramanan, D.: Learning policies for adaptive tracking with deep feature cascades. In: ICCV (2017)

Li, B. Wu, W., Wang, Q., Zhang, F., Xing, J., Yan, J. SiamRPN++: evolution of Siamese visual tracking with very deep networks. In: CVPR, pp. 4282–4291 (2019)

Chollet, F.: Xception: deep learning with depthwise separable convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017)

Zhang, X., Zhou, X., Lin, M., Sun, J.: Shufflenet: an extremely efficient convolutional neural network for mobile devices. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)

10.

Li, B., Yan, J., Wu, W., Zhu, Z., Hu, X.: High performance visual tracking with Siamese region proposal network. In: CVPR (2018)

11.

Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: International Conference on Neural Information Processing Systems, pp. 91–99 (2015)

12.

Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25(2), 2012 (2012)

13.

He, K., Zhang, X., Ren, S., Sun, J.: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)

14.

Wang, Q., Zhang, L., Bertinetto, L., Hu, W., Torr, P.H.: Fast online object tracking and segmentation: a unifying approach. In: CVPR (2019)

15.

Lin, T.-Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017)

16.

Huang, G., Liu, S., Van der Maaten, L., Weinberger, K.Q.: Condensenet: an efficient densenet using learned group convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)

17.

Shi, W., Caballero, J., Huszár, F., Totz, J., Aitken, A.P., Bishop, R., Rueckert, D., Wang, Z.: Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)

18.

Ma, N., Zhang, X., Zheng, H.-T., Sun, J.: Shufflenet v2: practical guidelines for efficient CNN architecture design. In: European Conference on Computer Vision (ECCV) (2018)

Titel: MPSiam: A Fast Multiplexing Siamese Tracking Network
verfasst von: Donghao Li
Ce Shen
Jinxing Hu
Diping Yuan
Verlag: Springer Singapore
Buch: Advances in Smart Vehicular Technology, Transportation, Communication and Applications
Print ISBN: 978-981-16-4038-4

Electronic ISBN: 978-981-16-4039-1

Copyright-Jahr: 2022
DOI: https://doi.org/10.1007/978-981-16-4039-1_29

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Premium Partner