Skip to main content
Erschienen in: International Journal of Computer Vision 3/2017

12.10.2016

Learning Optimal Parameters for Multi-target Tracking with Contextual Interactions

verfasst von: Shaofei Wang, Charless C. Fowlkes

Erschienen in: International Journal of Computer Vision | Ausgabe 3/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We describe an end-to-end framework for learning parameters of min-cost flow multi-target tracking problem with quadratic trajectory interactions including suppression of overlapping tracks and contextual cues about co-occurrence of different objects. Our approach utilizes structured prediction with a tracking-specific loss function to learn the complete set of model parameters. In this learning framework, we evaluate two different approaches to finding an optimal set of tracks under a quadratic model objective, one based on an linear program (LP) relaxation and the other based on novel greedy variants of dynamic programming that handle pairwise interactions. We find the greedy algorithms achieve almost equivalent accuracy to the LP relaxation while being up to 10\(\times \) faster than a commercial LP solver. We evaluate trained models on three challenging benchmarks. Surprisingly, we find that with proper parameter learning, our simple data association model without explicit appearance/motion reasoning is able to achieve comparable or better accuracy than many state-of-the-art methods that use far more complex motion features or appearance affinity metric learning.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
4
In a recent update of the benchmark server, the organizers changed their evaluation script to count detections in “don’t care” regions as false positives, which we believe is not consistent with general consensus of what “don’t care” regions mean. Thus we report the results up to 24 May 2016 which were evaluated using old evaluation script.
 
Literatur
Zurück zum Zitat Bae, S. H., & Yoon, K. J. (2014). Robust online multi-object tracking based on tracklet confidence and online discriminative appearance learning. In Proceedings of CVPR. Bae, S. H., & Yoon, K. J. (2014). Robust online multi-object tracking based on tracklet confidence and online discriminative appearance learning. In Proceedings of CVPR.
Zurück zum Zitat Bernardin, K., & Stiefelhagen, R. (2008). Evaluating multiple object tracking performance: The clear MOT metrics. Journal on Image Video Processing. doi:10.1155/2008/246309. Bernardin, K., & Stiefelhagen, R. (2008). Evaluating multiple object tracking performance: The clear MOT metrics. Journal on Image Video Processing. doi:10.​1155/​2008/​246309.
Zurück zum Zitat Brau, E., Guan, J., Simek, K., Del Pero, L., Reimer Dawson, C., & Barnard, K. (2013). Bayesian 3D tracking from monocular video. In Proceedings of ICCV. Brau, E., Guan, J., Simek, K., Del Pero, L., Reimer Dawson, C., & Barnard, K. (2013). Bayesian 3D tracking from monocular video. In Proceedings of ICCV.
Zurück zum Zitat Brendel, W., Amer, M., & Todorovic, S. (2011). Multiobject tracking as maximum weight independent set. In Proceedings of CVPR. Brendel, W., Amer, M., & Todorovic, S. (2011). Multiobject tracking as maximum weight independent set. In Proceedings of CVPR.
Zurück zum Zitat Butt, A. A., & Collins, R. T. (2013). Multi-target tracking by Lagrangian relaxation to min-cost network flow. In Proceedings of CVPR. Butt, A. A., & Collins, R. T. (2013). Multi-target tracking by Lagrangian relaxation to min-cost network flow. In Proceedings of CVPR.
Zurück zum Zitat Chari, V., Lacoste-Julien, S., Laptev, I., & Sivic, J. (2015). On pairwise costs for network flow multi-object tracking. In Proceedings of CVPR. Chari, V., Lacoste-Julien, S., Laptev, I., & Sivic, J. (2015). On pairwise costs for network flow multi-object tracking. In Proceedings of CVPR.
Zurück zum Zitat Choi, W. (2015). Near-online multi-target tracking with aggregated local flow descriptor. In Proceedings of ICCV. Choi, W. (2015). Near-online multi-target tracking with aggregated local flow descriptor. In Proceedings of ICCV.
Zurück zum Zitat Choi, W., & Savarese, S. (2012). A unified framework for multi-target tracking and collective activity recognition. In Proceedings of ECCV. Choi, W., & Savarese, S. (2012). A unified framework for multi-target tracking and collective activity recognition. In Proceedings of ECCV.
Zurück zum Zitat Dehghan, A., Tian, Y., Torr, P. H., & Shah, M. (2015). Target identity-aware network flow for online multiple target tracking. In Proceedings of CVPR. Dehghan, A., Tian, Y., Torr, P. H., & Shah, M. (2015). Target identity-aware network flow for online multiple target tracking. In Proceedings of CVPR.
Zurück zum Zitat Desai, C., Ramanan, D., & Fowlkes, C. (2009). Discriminative models for multi-class object layout. In Proceedings of ICCV. Desai, C., Ramanan, D., & Fowlkes, C. (2009). Discriminative models for multi-class object layout. In Proceedings of ICCV.
Zurück zum Zitat Dollár, P., Appel, R., Belongie, S., & Perona, P. (2014). Fast feature pyramids for object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(8), 1532–1545.CrossRef Dollár, P., Appel, R., Belongie, S., & Perona, P. (2014). Fast feature pyramids for object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(8), 1532–1545.CrossRef
Zurück zum Zitat Felzenszwalb, P. F., Girshick, R. B., McAllester, D., & Ramanan, D. (2010). Object detection with discriminatively trained part based models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(9), 1627–1645.CrossRef Felzenszwalb, P. F., Girshick, R. B., McAllester, D., & Ramanan, D. (2010). Object detection with discriminatively trained part based models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(9), 1627–1645.CrossRef
Zurück zum Zitat Finley, T., & Joachims, T. (2008). Training structural SVMs when exact inference is intractable. In Proceedings of ICML. Finley, T., & Joachims, T. (2008). Training structural SVMs when exact inference is intractable. In Proceedings of ICML.
Zurück zum Zitat Geiger, A., Lenz, P., & Urtasun, R. (2012). Are we ready for autonomous driving? The KITTI vision benchmark suite. In Proceedings of CVPR. Geiger, A., Lenz, P., & Urtasun, R. (2012). Are we ready for autonomous driving? The KITTI vision benchmark suite. In Proceedings of CVPR.
Zurück zum Zitat Ahuja, R. K., Magnanti, T. L., & Orlin, J. B. (1993). Network flows: Theory, algorithms, and applications. Upper Saddle River, NJ: Prentice-Hall, Inc.MATH Ahuja, R. K., Magnanti, T. L., & Orlin, J. B. (1993). Network flows: Theory, algorithms, and applications. Upper Saddle River, NJ: Prentice-Hall, Inc.MATH
Zurück zum Zitat Geiger, A., Lauer, M., Wojek, C., Stiller, C., & Urtasun, R. (2014). 3D traffic scene understanding from movable platforms. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(5), 1012–1025.CrossRef Geiger, A., Lauer, M., Wojek, C., Stiller, C., & Urtasun, R. (2014). 3D traffic scene understanding from movable platforms. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(5), 1012–1025.CrossRef
Zurück zum Zitat Geiger, A., Lenz, P., Stiller, C., & Urtasun, R. (2013). Vision meets robotics: The KITTI dataset. International Journal of Robotics Research, 32(11), 1231–1237.CrossRef Geiger, A., Lenz, P., Stiller, C., & Urtasun, R. (2013). Vision meets robotics: The KITTI dataset. International Journal of Robotics Research, 32(11), 1231–1237.CrossRef
Zurück zum Zitat Joachims, T., Finley, T., & Yu, C. N. (2009). Cutting-plane training of structural SVMs. Machine Learning, 77(1), 27–59.CrossRefMATH Joachims, T., Finley, T., & Yu, C. N. (2009). Cutting-plane training of structural SVMs. Machine Learning, 77(1), 27–59.CrossRefMATH
Zurück zum Zitat Joulin, A., Tang, K., & Fei-Fei, L. (2014). Efficient image and video co-localization with Frank–Wolfe algorithm. In Proceedings of ECCV. Joulin, A., Tang, K., & Fei-Fei, L. (2014). Efficient image and video co-localization with Frank–Wolfe algorithm. In Proceedings of ECCV.
Zurück zum Zitat Kim, S., Kwak, S., Feyereisl, J., & Han, B. (2013). Online multi-target tracking by large margin structured learning. In Proceedings of ACCV. Kim, S., Kwak, S., Feyereisl, J., & Han, B. (2013). Online multi-target tracking by large margin structured learning. In Proceedings of ACCV.
Zurück zum Zitat Kim, C., Li, F., Ciptadi, A., & Rehg, J. M. (2015). Multiple hypothesis tracking revisited. In Proceedings of ICCV. Kim, C., Li, F., Ciptadi, A., & Rehg, J. M. (2015). Multiple hypothesis tracking revisited. In Proceedings of ICCV.
Zurück zum Zitat Lacoste-Julien, S., Taskar, B., Klein, D., & Jordan, M. I. (2006). Word alignment via quadratic assignment. In Proceedings of HLT-NAACL. Lacoste-Julien, S., Taskar, B., Klein, D., & Jordan, M. I. (2006). Word alignment via quadratic assignment. In Proceedings of HLT-NAACL.
Zurück zum Zitat Leal-Taixé, L., Fenzi, M., Kuznetsova, A., Rosenhahn, B., & Savarese, S. (2014). Learning an image-based motion context for multiple people tracking. In Proceedings of CVPR. Leal-Taixé, L., Fenzi, M., Kuznetsova, A., Rosenhahn, B., & Savarese, S. (2014). Learning an image-based motion context for multiple people tracking. In Proceedings of CVPR.
Zurück zum Zitat Leal-Taixé, L., Milan, A., Reid, I., Roth, S., & Schindler, K. (2015). MOTChallenge 2015: Towards a benchmark for multi-target tracking. arXiv:1504.01942 [cs]. Leal-Taixé, L., Milan, A., Reid, I., Roth, S., & Schindler, K. (2015). MOTChallenge 2015: Towards a benchmark for multi-target tracking. arXiv:​1504.​01942 [cs].
Zurück zum Zitat Lenz, P., Geiger, A., & Urtasun, R. (2015). Followme: Efficient online min-cost flow tracking with bounded memory and computation. In Proceedings of ICCV. Lenz, P., Geiger, A., & Urtasun, R. (2015). Followme: Efficient online min-cost flow tracking with bounded memory and computation. In Proceedings of ICCV.
Zurück zum Zitat Li, Y., Huang, C., & Nevatia, R. (2009). Learning to associate: Hybridboosted multi-target tracker for crowded scene. In Proceedings of CVPR. Li, Y., Huang, C., & Nevatia, R. (2009). Learning to associate: Hybridboosted multi-target tracker for crowded scene. In Proceedings of CVPR.
Zurück zum Zitat Liu, C. (2009). Beyond pixels: Exploring new representations and applications for motion analysis. PhD Thesis, Massachusetts Institute of Technology. Liu, C. (2009). Beyond pixels: Exploring new representations and applications for motion analysis. PhD Thesis, Massachusetts Institute of Technology.
Zurück zum Zitat Lou, X., & Hamprecht, F. A. (2011). Structured learning for cell tracking. In Proceedings of NIPS. Lou, X., & Hamprecht, F. A. (2011). Structured learning for cell tracking. In Proceedings of NIPS.
Zurück zum Zitat Milan, A., Leal-Taixé, L., Schindler, K., & Reid, I. (2015). Joint tracking and segmentation of multiple targets. In Proceedings of CVPR. Milan, A., Leal-Taixé, L., Schindler, K., & Reid, I. (2015). Joint tracking and segmentation of multiple targets. In Proceedings of CVPR.
Zurück zum Zitat Milan, A., Schindler, K., & Roth, S. (2012). Discrete–continuous optimization for multi-target tracking. In Proceedings of CVPR. Milan, A., Schindler, K., & Roth, S. (2012). Discrete–continuous optimization for multi-target tracking. In Proceedings of CVPR.
Zurück zum Zitat Milan, A., Schindler, K., & Roth, S. (2013). Detection- and trajectory-level exclusion in multiple object tracking. In Proceedings of CVPR. Milan, A., Schindler, K., & Roth, S. (2013). Detection- and trajectory-level exclusion in multiple object tracking. In Proceedings of CVPR.
Zurück zum Zitat Milan, A., Schindler, K., & Roth, S. (2016). Multi-target tracking by discrete–continuous energy minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(10), 2054–2068. Milan, A., Schindler, K., & Roth, S. (2016). Multi-target tracking by discrete–continuous energy minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(10), 2054–2068.
Zurück zum Zitat Milan, A., Roth, S., & Schindler, K. (2014). Continuous energy minimization for multitarget tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(1), 58–72.CrossRef Milan, A., Roth, S., & Schindler, K. (2014). Continuous energy minimization for multitarget tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(1), 58–72.CrossRef
Zurück zum Zitat Pirsiavash, H., Ramanan, D., & Fowlkes, C. C. (2011). Globally-optimal greedy algorithms for tracking a variable number of objects. In Proceedings of CVPR. Pirsiavash, H., Ramanan, D., & Fowlkes, C. C. (2011). Globally-optimal greedy algorithms for tracking a variable number of objects. In Proceedings of CVPR.
Zurück zum Zitat Segal, A. V., & Reid, I. (2013). Latent data association: Bayesian model selection for multi-target tracking. In Proceedings of ICCV. Segal, A. V., & Reid, I. (2013). Latent data association: Bayesian model selection for multi-target tracking. In Proceedings of ICCV.
Zurück zum Zitat Solera, F., Calderara, S., & Cucchiara, R. (2015). Learning to divide and conquer for online multi-target tracking. In Proceedings of ICCV. Solera, F., Calderara, S., & Cucchiara, R. (2015). Learning to divide and conquer for online multi-target tracking. In Proceedings of ICCV.
Zurück zum Zitat Szummer, M., Kohli, P., & Hoiem, D. (2008). Learning CRFs using graph cuts. In ECCV. Szummer, M., Kohli, P., & Hoiem, D. (2008). Learning CRFs using graph cuts. In ECCV.
Zurück zum Zitat Tang, S., Andres, B., Andriluka, M., & Schiele, B. (2015). Subgraph decomposition for multi-target tracking. In Proceedings of CVPR. Tang, S., Andres, B., Andriluka, M., & Schiele, B. (2015). Subgraph decomposition for multi-target tracking. In Proceedings of CVPR.
Zurück zum Zitat Tang, S., Andriluka, M., Milan, A., Schindler, K., Roth, S., & Schiele, B. (2013). Learning people detectors for tracking in crowded scenes. In Proceedings of ICCV. Tang, S., Andriluka, M., Milan, A., Schindler, K., Roth, S., & Schiele, B. (2013). Learning people detectors for tracking in crowded scenes. In Proceedings of ICCV.
Zurück zum Zitat Taskar, B., Guestrin, C., & Koller, D. (2003). Max-margin Markov networks. In Proceedings of NIPS. Taskar, B., Guestrin, C., & Koller, D. (2003). Max-margin Markov networks. In Proceedings of NIPS.
Zurück zum Zitat Wang, S., & Fowlkes, C. C. (2015). Learning optimal parameters for multi-target tracking. In Proceedings of BMVC. Wang, S., & Fowlkes, C. C. (2015). Learning optimal parameters for multi-target tracking. In Proceedings of BMVC.
Zurück zum Zitat Wang, B., Wang, G., Luk Chan, K., & Wang, L. (2014). Tracklet association with online target-specific metric learning. In Proceedings of CVPR. Wang, B., Wang, G., Luk Chan, K., & Wang, L. (2014). Tracklet association with online target-specific metric learning. In Proceedings of CVPR.
Zurück zum Zitat Wang, X., Yang, M., Zhu, S., & Lin, Y. (2013). Regionlets for generic object detection. In Proceedings of ICCV. Wang, X., Yang, M., Zhu, S., & Lin, Y. (2013). Regionlets for generic object detection. In Proceedings of ICCV.
Zurück zum Zitat Wu, Z., Thangali, A., Sclaroff, S., & Betke, M. (2012). Coupling detection and data association for multiple object tracking. In Proceedings of CVPR. Wu, Z., Thangali, A., Sclaroff, S., & Betke, M. (2012). Coupling detection and data association for multiple object tracking. In Proceedings of CVPR.
Zurück zum Zitat Xiang, Y., Alahi, A., & Savarese, S. (2015). Learning to track: Online multi-object tracking by decision making. In Proceedings of ICCV. Xiang, Y., Alahi, A., & Savarese, S. (2015). Learning to track: Online multi-object tracking by decision making. In Proceedings of ICCV.
Zurück zum Zitat Yang, B., & Nevatia, R. (2012). An online learned CRF model for multi-target tracking. In Proceedings of CVPR. Yang, B., & Nevatia, R. (2012). An online learned CRF model for multi-target tracking. In Proceedings of CVPR.
Zurück zum Zitat Yoon, J. H., Yang, M., Lim, J., & Yoon, K. (2015). Bayesian multi-object tracking using motion context from multiple objects. In Proceedings of WACV. Yoon, J. H., Yang, M., Lim, J., & Yoon, K. (2015). Bayesian multi-object tracking using motion context from multiple objects. In Proceedings of WACV.
Zurück zum Zitat Zaied, A. N. H., & Shawky, L. A. E. (2014). A survey of quadratic assignment problems. International Journal of Computer Applications, 101(6), 28–36. Zaied, A. N. H., & Shawky, L. A. E. (2014). A survey of quadratic assignment problems. International Journal of Computer Applications, 101(6), 28–36.
Zurück zum Zitat Zhang, L., Li, Y., & Nevatia, R. (2008). Global data association for multi-object tracking using network flows. In Proceedings of CVPR. Zhang, L., Li, Y., & Nevatia, R. (2008). Global data association for multi-object tracking using network flows. In Proceedings of CVPR.
Metadaten
Titel
Learning Optimal Parameters for Multi-target Tracking with Contextual Interactions
verfasst von
Shaofei Wang
Charless C. Fowlkes
Publikationsdatum
12.10.2016
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 3/2017
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-016-0960-z

Weitere Artikel der Ausgabe 3/2017

International Journal of Computer Vision 3/2017 Zur Ausgabe