Skip to main content
Erschienen in: Knowledge and Information Systems 10/2020

30.07.2020 | Regular Paper

Learning sequence-to-sequence affinity metric for near-online multi-object tracking

verfasst von: Weijiang Feng, Long Lan, Xiang Zhang, Zhigang Luo

Erschienen in: Knowledge and Information Systems | Ausgabe 10/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we propose a sequence-to-sequence affinity metric for the data association of near-online multi-object tracking. The proposed metric learns the affinity between track sequence consisting of the already associated detections and hypothesis sequence consisting of detections in the near future. With the potential hypothesis sequences, we leverage the idea that if a track sequence has a high affinity for a hypothesis sequence, and the hypothesis sequence also shares a close affinity for a current detection, then the affinity between the track sequence and the detection is high. By using the short hypothesis sequence as a “bridge”, the proposed sequence-to-sequence affinity metric enhances the conventional track sequence to detection affinity metric and improves its robustness to object occlusion and missing. Besides, in order to eliminate the negative effects of false alarms, we propose a false alarm model using both appearance and scale features of detection. The robustness of the proposed affinity metric allows us to use a simple greedy data association algorithm. Experimental results on the challenging MOT16 and MOT17 benchmarks demonstrate the effectiveness of our method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M et al (2016) Tensorflow: a system for large-scale machine learning. In: 12th \(\{\)USENIX\(\}\) symposium on operating systems design and implementation (\(\{\)OSDI\(\}\) 16), pp 265–283 Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M et al (2016) Tensorflow: a system for large-scale machine learning. In: 12th \(\{\)USENIX\(\}\) symposium on operating systems design and implementation (\(\{\)OSDI\(\}\) 16), pp 265–283
3.
Zurück zum Zitat Bochinski E, Eiselein V, Sikora T (2017) High-speed tracking-by-detection without using image information. In: IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, pp 1–6 Bochinski E, Eiselein V, Sikora T (2017) High-speed tracking-by-detection without using image information. In: IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, pp 1–6
4.
Zurück zum Zitat Breitenstein MD, Reichlin F, Leibe B, Koller-Meier E, Van Gool L (2011) Online multiperson tracking-by-detection from a single, uncalibrated camera. IEEE Trans Pattern Anal Mach Intell 33(9):1820–1833CrossRef Breitenstein MD, Reichlin F, Leibe B, Koller-Meier E, Van Gool L (2011) Online multiperson tracking-by-detection from a single, uncalibrated camera. IEEE Trans Pattern Anal Mach Intell 33(9):1820–1833CrossRef
5.
Zurück zum Zitat Chen J, Sheng H, Zhang Y, Xiong Z (2017) Enhancing detection model for multiple hypothesis tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops. IEEE, pp 18–27 Chen J, Sheng H, Zhang Y, Xiong Z (2017) Enhancing detection model for multiple hypothesis tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops. IEEE, pp 18–27
6.
Zurück zum Zitat Choi W (2015) Near-online multi-target tracking with aggregated local flow descriptor. In: Proceedings of the IEEE international conference on computer vision. IEEE, pp 3029–3037 Choi W (2015) Near-online multi-target tracking with aggregated local flow descriptor. In: Proceedings of the IEEE international conference on computer vision. IEEE, pp 3029–3037
7.
Zurück zum Zitat Chopra S, Hadsell R, LeCun Y (2005) Learning a similarity metric discriminatively, with application to face verification. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 539–546 Chopra S, Hadsell R, LeCun Y (2005) Learning a similarity metric discriminatively, with application to face verification. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 539–546
8.
Zurück zum Zitat Chu Q, Ouyang W, Li H, Wang X, Liu B, Yu N (2017) Online multi-object tracking using CNN-based single object tracker with spatial–temporal attention mechanism. In: Proceedings of the IEEE international conference on computer vision. IEEE, pp 4836–4845 Chu Q, Ouyang W, Li H, Wang X, Liu B, Yu N (2017) Online multi-object tracking using CNN-based single object tracker with spatial–temporal attention mechanism. In: Proceedings of the IEEE international conference on computer vision. IEEE, pp 4836–4845
9.
Zurück zum Zitat Dehghan A, Modiri Assari S, Shah M (2015) Gmmcp tracker: Globally optimal generalized maximum multi clique problem for multiple object tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 4091–4099 Dehghan A, Modiri Assari S, Shah M (2015) Gmmcp tracker: Globally optimal generalized maximum multi clique problem for multiple object tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 4091–4099
10.
Zurück zum Zitat Fagot-Bouquet L, Audigier R, Dhome Y, Lerasle F (2016) Improving multi-frame data association with sparse representations for robust near-online multi-object tracking. In: Proceedings of the European conference on computer vision. Springer, pp 774–790 Fagot-Bouquet L, Audigier R, Dhome Y, Lerasle F (2016) Improving multi-frame data association with sparse representations for robust near-online multi-object tracking. In: Proceedings of the European conference on computer vision. Springer, pp 774–790
11.
Zurück zum Zitat Feng W, Hu Z, Wu W, Yan J, Ouyang W (2019) Multi-object tracking with multiple cues and switcher-aware classification. arXiv:1901.06129 Feng W, Hu Z, Wu W, Yan J, Ouyang W (2019) Multi-object tracking with multiple cues and switcher-aware classification. arXiv:​1901.​06129
12.
Zurück zum Zitat Fu Z, Feng P, Angelini F, Chambers J, Naqvi SM (2018) Particle phd filter based multiple human tracking using online group-structured dictionary learning. IEEE Access 6:14764–14778CrossRef Fu Z, Feng P, Angelini F, Chambers J, Naqvi SM (2018) Particle phd filter based multiple human tracking using online group-structured dictionary learning. IEEE Access 6:14764–14778CrossRef
13.
Zurück zum Zitat Henschel R, Leal-Taixe L, Cremers D, Rosenhahn B (2018) Fusion of head and full-body detectors for multi-object tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops. IEEE, pp 1428–1437 Henschel R, Leal-Taixe L, Cremers D, Rosenhahn B (2018) Fusion of head and full-body detectors for multi-object tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops. IEEE, pp 1428–1437
14.
Zurück zum Zitat Keuper M, Tang S, Andres B, Brox T, Schiele B (2018) Motion segmentation and multiple object tracking by correlation co-clustering. IEEE Trans Pattern Anal Mach Intell 42(1):140–153CrossRef Keuper M, Tang S, Andres B, Brox T, Schiele B (2018) Motion segmentation and multiple object tracking by correlation co-clustering. IEEE Trans Pattern Anal Mach Intell 42(1):140–153CrossRef
15.
Zurück zum Zitat Kim C, Li F, Ciptadi A, Rehg JM (2015) Multiple hypothesis tracking revisited. In: Proceedings of the IEEE international conference on computer vision. IEEE, pp 4696–4704 Kim C, Li F, Ciptadi A, Rehg JM (2015) Multiple hypothesis tracking revisited. In: Proceedings of the IEEE international conference on computer vision. IEEE, pp 4696–4704
16.
Zurück zum Zitat Kim C, Li F, Rehg JM (2018) Multi-object tracking with neural gating using bilinear LSTM. In: Proceedings of the European conference on computer vision. Springer, pp 200–215 Kim C, Li F, Rehg JM (2018) Multi-object tracking with neural gating using bilinear LSTM. In: Proceedings of the European conference on computer vision. Springer, pp 200–215
17.
18.
Zurück zum Zitat Kutschbach T, Bochinski E, Eiselein V, Sikora T (2017) Sequential sensor fusion combining probability hypothesis density and kernelized correlation filters for multi-object tracking in video data. In: IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, pp 1–5 Kutschbach T, Bochinski E, Eiselein V, Sikora T (2017) Sequential sensor fusion combining probability hypothesis density and kernelized correlation filters for multi-object tracking in video data. In: IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, pp 1–5
19.
Zurück zum Zitat Lan L, Tao D, Gong C, Guan N, Luo Z (2016) Online multi-object tracking by quadratic pseudo-boolean optimization. In: International joint conference on artificial intelligence, pp 3396–3402 Lan L, Tao D, Gong C, Guan N, Luo Z (2016) Online multi-object tracking by quadratic pseudo-boolean optimization. In: International joint conference on artificial intelligence, pp 3396–3402
20.
Zurück zum Zitat Lan L, Wang X, Zhang S, Tao D, Gao W, Huang TS (2018) Interacting tracklets for multi-object tracking. IEEE Trans Image Process 27(9):4585–4597MathSciNetCrossRef Lan L, Wang X, Zhang S, Tao D, Gao W, Huang TS (2018) Interacting tracklets for multi-object tracking. IEEE Trans Image Process 27(9):4585–4597MathSciNetCrossRef
21.
Zurück zum Zitat Lan L, Wang X, Hua G, Huang TS, Tao D (2020) Semi-online multi-people tracking by re-identification. Int J Comput Vis 128:1937–1955MathSciNetCrossRef Lan L, Wang X, Hua G, Huang TS, Tao D (2020) Semi-online multi-people tracking by re-identification. Int J Comput Vis 128:1937–1955MathSciNetCrossRef
22.
Zurück zum Zitat Leal-Taixé L, Canton-Ferrer C, Schindler K (2016) Learning by tracking: siamese CNN for robust target association. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops. IEEE, pp 33–40 Leal-Taixé L, Canton-Ferrer C, Schindler K (2016) Learning by tracking: siamese CNN for robust target association. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops. IEEE, pp 33–40
23.
Zurück zum Zitat Liao S, Hu Y, Zhu X, Li SZ (2015) Person re-identification by local maximal occurrence representation and metric learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 2197–2206 Liao S, Hu Y, Zhu X, Li SZ (2015) Person re-identification by local maximal occurrence representation and metric learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 2197–2206
24.
Zurück zum Zitat Long C, Haizhou A, Zijie Z, Chong S (2018) Real-time multiple people tracking with deeply learned candidate selection and person re-identification. In: IEEE international conference on multimedia and expo (ICME). IEEE, pp 1–6 Long C, Haizhou A, Zijie Z, Chong S (2018) Real-time multiple people tracking with deeply learned candidate selection and person re-identification. In: IEEE international conference on multimedia and expo (ICME). IEEE, pp 1–6
25.
Zurück zum Zitat Ma C, Yang C, Yang F, Zhuang Y, Zhang Z, Jia H, Xie X (2018) Trajectory factory: Tracklet cleaving and re-connection by deep siamese bi-gru for multiple object tracking. In: IEEE international conference on multimedia and expo (ICME). IEEE, pp 1–6 Ma C, Yang C, Yang F, Zhuang Y, Zhang Z, Jia H, Xie X (2018) Trajectory factory: Tracklet cleaving and re-connection by deep siamese bi-gru for multiple object tracking. In: IEEE international conference on multimedia and expo (ICME). IEEE, pp 1–6
26.
Zurück zum Zitat Milan A, Rezatofighi SH, Dick A, Reid I, Schindler K (2017) Online multi-target tracking using recurrent neural networks. In: Proceedings of the AAAI conference on artificial intelligence, AAAI Milan A, Rezatofighi SH, Dick A, Reid I, Schindler K (2017) Online multi-target tracking using recurrent neural networks. In: Proceedings of the AAAI conference on artificial intelligence, AAAI
27.
Zurück zum Zitat Pirsiavash H, Ramanan D, Fowlkes CC (2011) Globally-optimal greedy algorithms for tracking a variable number of objects. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 1201–1208 Pirsiavash H, Ramanan D, Fowlkes CC (2011) Globally-optimal greedy algorithms for tracking a variable number of objects. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 1201–1208
29.
Zurück zum Zitat Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99 Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99
30.
Zurück zum Zitat Sadeghian A, Alahi A, Savarese S (2017) Tracking the untrackable: learning to track multiple cues with long-term dependencies. In: Proceedings of the IEEE international conference on computer vision. IEEE, pp 300–311 Sadeghian A, Alahi A, Savarese S (2017) Tracking the untrackable: learning to track multiple cues with long-term dependencies. In: Proceedings of the IEEE international conference on computer vision. IEEE, pp 300–311
31.
Zurück zum Zitat Sanchez-Matilla R, Poiesi F, Cavallaro A (2016) Online multi-target tracking with strong and weak detections. In: Proceedings of the European conference on computer vision. Springer, pp 84–99 Sanchez-Matilla R, Poiesi F, Cavallaro A (2016) Online multi-target tracking with strong and weak detections. In: Proceedings of the European conference on computer vision. Springer, pp 84–99
32.
Zurück zum Zitat Son J, Baek M, Cho M, Han B (2017) Multi-object tracking with quadruplet convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 5620–5629 Son J, Baek M, Cho M, Han B (2017) Multi-object tracking with quadruplet convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 5620–5629
33.
Zurück zum Zitat Wang B, Wang L, Shuai B, Zuo Z, Liu T, Luk Chan K, Wang G (2016) Joint learning of convolutional neural networks and temporally constrained metrics for tracklet association. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops. IEEE, pp 1–8 Wang B, Wang L, Shuai B, Zuo Z, Liu T, Luk Chan K, Wang G (2016) Joint learning of convolutional neural networks and temporally constrained metrics for tracklet association. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops. IEEE, pp 1–8
34.
Zurück zum Zitat Wang J, Song Y, Leung T, Rosenberg C, Wang J, Philbin J, Chen B, Wu Y (2014) Learning fine-grained image similarity with deep ranking. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 1386–1393 Wang J, Song Y, Leung T, Rosenberg C, Wang J, Philbin J, Chen B, Wu Y (2014) Learning fine-grained image similarity with deep ranking. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 1386–1393
35.
Zurück zum Zitat Xiang Y, Alahi A, Savarese S (2015) Learning to track: online multi-object tracking by decision making. In: Proceedings of the IEEE international conference on computer vision. IEEE, pp 4705–4713 Xiang Y, Alahi A, Savarese S (2015) Learning to track: online multi-object tracking by decision making. In: Proceedings of the IEEE international conference on computer vision. IEEE, pp 4705–4713
36.
Zurück zum Zitat Yang B, Nevatia R (2014) Multi-target tracking by online learning a CRF model of appearance and motion patterns. Int J Comput Vis 107(2):203–217MathSciNetCrossRef Yang B, Nevatia R (2014) Multi-target tracking by online learning a CRF model of appearance and motion patterns. Int J Comput Vis 107(2):203–217MathSciNetCrossRef
37.
Zurück zum Zitat Yoon YC, Boragule A, Song YM, Yoon K, Jeon M (2018) Online multi-object tracking with historical appearance matching and scene adaptive detection filtering. In: IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, pp 1–6 Yoon YC, Boragule A, Song YM, Yoon K, Jeon M (2018) Online multi-object tracking with historical appearance matching and scene adaptive detection filtering. In: IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, pp 1–6
38.
Zurück zum Zitat Zhang L, Li Y, Nevatia R (2008) Global data association for multi-object tracking using network flows. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 1–8 Zhang L, Li Y, Nevatia R (2008) Global data association for multi-object tracking using network flows. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 1–8
39.
Zurück zum Zitat Zhu J, Yang H, Liu N, Kim M, Zhang W, Yang MH (2018) Online multi-object tracking with dual matching attention networks. In: Proceedings of the European conference on computer vision. Springer, pp 366–382 Zhu J, Yang H, Liu N, Kim M, Zhang W, Yang MH (2018) Online multi-object tracking with dual matching attention networks. In: Proceedings of the European conference on computer vision. Springer, pp 366–382
Metadaten
Titel
Learning sequence-to-sequence affinity metric for near-online multi-object tracking
verfasst von
Weijiang Feng
Long Lan
Xiang Zhang
Zhigang Luo
Publikationsdatum
30.07.2020
Verlag
Springer London
Erschienen in
Knowledge and Information Systems / Ausgabe 10/2020
Print ISSN: 0219-1377
Elektronische ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-020-01488-7

Weitere Artikel der Ausgabe 10/2020

Knowledge and Information Systems 10/2020 Zur Ausgabe