Skip to main content

2016 | OriginalPaper | Buchkapitel

It’s Moving! A Probabilistic Model for Causal Motion Segmentation in Moving Camera Videos

verfasst von : Pia Bideau, Erik Learned-Miller

Erschienen in: Computer Vision – ECCV 2016

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The human ability to detect and segment moving objects works in the presence of multiple objects, complex background geometry, motion of the observer, and even camouflage. In addition to all of this, the ability to detect motion is nearly instantaneous. While there has been much recent progress in motion segmentation, it still appears we are far from human capabilities. In this work, we derive from first principles a likelihood function for assessing the probability of an optical flow vector given the 2D motion direction of an object. This likelihood uses a novel combination of the angle and magnitude of the optical flow to maximize the information about how objects are moving differently. Using this new likelihood and several innovations in initialization, we develop a motion segmentation algorithm that beats current state-of-the-art methods by a large margin. We compare to five state-of-the-art methods on two established benchmarks, and a third new data set of camouflaged animals, which we introduce to push motion segmentation to the next level.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
We use the http://​www.​vlfeat.​org/​api/​slic.​html code with regionSize = 20 and regularizer = 0.5.
 
Literatur
1.
Zurück zum Zitat Torr, P.H.: Geometric motion segmentation and model selection. Philos. Trans. Royal Soc. Lond. Math. Phys. Eng. Sci. 356(1740), 1321–1340 (1998)CrossRefMATHMathSciNet Torr, P.H.: Geometric motion segmentation and model selection. Philos. Trans. Royal Soc. Lond. Math. Phys. Eng. Sci. 356(1740), 1321–1340 (1998)CrossRefMATHMathSciNet
2.
Zurück zum Zitat Brox, T., Malik, J.: Object segmentation by long term analysis of point trajectories. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 282–295. Springer, Heidelberg (2010)CrossRef Brox, T., Malik, J.: Object segmentation by long term analysis of point trajectories. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 282–295. Springer, Heidelberg (2010)CrossRef
3.
Zurück zum Zitat Tron, R., Vidal, R.: A benchmark for the comparison of 3-D motion segmentation algorithms. In: CVPR (2007) Tron, R., Vidal, R.: A benchmark for the comparison of 3-D motion segmentation algorithms. In: CVPR (2007)
4.
Zurück zum Zitat Narayana, M., Hanson, A., Learned-Miller, E.: Coherent motion segmentation in moving camera videos using optical flow orientations. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 1577–1584. IEEE (2013) Narayana, M., Hanson, A., Learned-Miller, E.: Coherent motion segmentation in moving camera videos using optical flow orientations. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 1577–1584. IEEE (2013)
5.
Zurück zum Zitat Grundmann, M., Kwatra, V., Han, M., Essa, I.: Efficient hierarchical graph-based video segmentation. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2141–2148. IEEE (2010) Grundmann, M., Kwatra, V., Han, M., Essa, I.: Efficient hierarchical graph-based video segmentation. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2141–2148. IEEE (2010)
6.
Zurück zum Zitat Lezama, J., Alahari, K., Sivic, J., Laptev, I.: Track to the future: spatio-temporal video segmentation with long-range motion cues. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2011) Lezama, J., Alahari, K., Sivic, J., Laptev, I.: Track to the future: spatio-temporal video segmentation with long-range motion cues. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2011)
7.
Zurück zum Zitat Kumar, M.P., Torr, P.H., Zisserman, A.: Learning layered motion segmentations of video. Int. J. Comput. Vis. 76(3), 301–319 (2008)CrossRef Kumar, M.P., Torr, P.H., Zisserman, A.: Learning layered motion segmentations of video. Int. J. Comput. Vis. 76(3), 301–319 (2008)CrossRef
8.
Zurück zum Zitat Irani, M., Rousso, B., Peleg, S.: Computing occluding and transparent motions. Int. J. Comput. Vis. 12, 5–16 (1994)CrossRef Irani, M., Rousso, B., Peleg, S.: Computing occluding and transparent motions. Int. J. Comput. Vis. 12, 5–16 (1994)CrossRef
9.
Zurück zum Zitat Ren, Y., Chua, C.S., Ho, Y.K.: Statistical background modeling for non-stationary camera. Pattern Recogn. Lett. 24, 183–196 (2003)CrossRefMATH Ren, Y., Chua, C.S., Ho, Y.K.: Statistical background modeling for non-stationary camera. Pattern Recogn. Lett. 24, 183–196 (2003)CrossRefMATH
10.
Zurück zum Zitat Sheikh, Y., Javed, O., Kanade, T.: Background subtraction for freely moving cameras. In: IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27–October 4 2009, pp. 1219–1225. IEEE (2009) Sheikh, Y., Javed, O., Kanade, T.: Background subtraction for freely moving cameras. In: IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27–October 4 2009, pp. 1219–1225. IEEE (2009)
11.
Zurück zum Zitat Elqursh, A., Elgammal, A.: Online moving camera background subtraction. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 228–241. Springer, Heidelberg (2012)CrossRef Elqursh, A., Elgammal, A.: Online moving camera background subtraction. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 228–241. Springer, Heidelberg (2012)CrossRef
12.
Zurück zum Zitat Ochs, P., Brox, T.: Higher order motion models and spectral clustering. In: CVPR (2012) Ochs, P., Brox, T.: Higher order motion models and spectral clustering. In: CVPR (2012)
13.
Zurück zum Zitat Kwak, S., Lim, T., Nam, W., Han, B., Han, J.H.: Generalized background subtraction based on hybrid inference by belief propagation and Bayesian filtering. In: ICCV (2011) Kwak, S., Lim, T., Nam, W., Han, B., Han, J.H.: Generalized background subtraction based on hybrid inference by belief propagation and Bayesian filtering. In: ICCV (2011)
14.
Zurück zum Zitat Rahmati, H., Dragon, R., Aamo, O.M., Gool, L., Adde, L.: Motion segmentation with weak labeling priors. In: Jiang, X., Hornegger, J., Koch, R. (eds.) GCPR 2014. LNCS, vol. 8753, pp. 159–171. Springer, Heidelberg (2014). doi:10.1007/978-3-319-11752-2_13 Rahmati, H., Dragon, R., Aamo, O.M., Gool, L., Adde, L.: Motion segmentation with weak labeling priors. In: Jiang, X., Hornegger, J., Koch, R. (eds.) GCPR 2014. LNCS, vol. 8753, pp. 159–171. Springer, Heidelberg (2014). doi:10.​1007/​978-3-319-11752-2_​13
15.
Zurück zum Zitat Jain, S.D., Grauman, K.: Supervoxel-consistent foreground propagation in video. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part IV. LNCS, vol. 8692, pp. 656–671. Springer, Heidelberg (2014) Jain, S.D., Grauman, K.: Supervoxel-consistent foreground propagation in video. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part IV. LNCS, vol. 8692, pp. 656–671. Springer, Heidelberg (2014)
16.
Zurück zum Zitat Zamalieva, D., Yilmaz, A., Davis, J.W.: A multi-transformational model for background subtraction with moving cameras. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part I. LNCS, vol. 8689, pp. 803–817. Springer, Heidelberg (2014) Zamalieva, D., Yilmaz, A., Davis, J.W.: A multi-transformational model for background subtraction with moving cameras. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part I. LNCS, vol. 8689, pp. 803–817. Springer, Heidelberg (2014)
17.
Zurück zum Zitat Papazoglou, A., Ferrari, V.: Fast object segmentation in unconstrained video. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 1777–1784. IEEE (2013) Papazoglou, A., Ferrari, V.: Fast object segmentation in unconstrained video. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 1777–1784. IEEE (2013)
18.
Zurück zum Zitat Keuper, M., Andres, B., Brox, T.: Motion trajectory segmentation via minimum cost multicuts. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3271–3279 (2015) Keuper, M., Andres, B., Brox, T.: Motion trajectory segmentation via minimum cost multicuts. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3271–3279 (2015)
19.
Zurück zum Zitat Taylor, B., Karasev, V., Soatto, S.: Causal video object segmentation from persistence of occlusions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4268–4276 (2015) Taylor, B., Karasev, V., Soatto, S.: Causal video object segmentation from persistence of occlusions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4268–4276 (2015)
20.
Zurück zum Zitat Fragkiadaki, K., Zhang, G., Shi, J.: Video segmentation by tracing discontinuities in a trajectory embedding. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1846–1853. IEEE (2012) Fragkiadaki, K., Zhang, G., Shi, J.: Video segmentation by tracing discontinuities in a trajectory embedding. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1846–1853. IEEE (2012)
21.
Zurück zum Zitat Sawhney, H.S., Guo, Y., Asmuth, J., Kumar, R.: Independent motion detection in 3D scenes. In: The Proceedings of the Seventh IEEE International Conference on Computer Vision, vol. 1, pp. 612–619. IEEE (1999) Sawhney, H.S., Guo, Y., Asmuth, J., Kumar, R.: Independent motion detection in 3D scenes. In: The Proceedings of the Seventh IEEE International Conference on Computer Vision, vol. 1, pp. 612–619. IEEE (1999)
22.
Zurück zum Zitat Dey, S., Reilly, V., Saleemi, I., Shah, M.: Detection of independently moving objects in non-planar scenes via multi-frame monocular epipolar constraint. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 860–873. Springer, Heidelberg (2012)CrossRef Dey, S., Reilly, V., Saleemi, I., Shah, M.: Detection of independently moving objects in non-planar scenes via multi-frame monocular epipolar constraint. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 860–873. Springer, Heidelberg (2012)CrossRef
23.
Zurück zum Zitat Namdev, R.K., Kundu, A., Krishna, K.M., Jawahar, C.V.: Motion segmentation of multiple objects from a freely moving monocular camera. In: International Conference on Robotics and Automation (2012) Namdev, R.K., Kundu, A., Krishna, K.M., Jawahar, C.V.: Motion segmentation of multiple objects from a freely moving monocular camera. In: International Conference on Robotics and Automation (2012)
24.
Zurück zum Zitat Csurka, G., Bouthemy, P.: Direct identification of moving objects and background from 2D motion models. In: The Proceedings of the Seventh IEEE International Conference on Computer Vision, vol. 1, pp. 566–571. IEEE (1999) Csurka, G., Bouthemy, P.: Direct identification of moving objects and background from 2D motion models. In: The Proceedings of the Seventh IEEE International Conference on Computer Vision, vol. 1, pp. 566–571. IEEE (1999)
25.
Zurück zum Zitat Sharma, R., Aloimonos, Y.: Early detection of independent motion from active control of normal image flow patterns. IEEE Trans. Syst. Man Cybernet. B (Cybernetics) 26(1), 42–52 (1996)CrossRef Sharma, R., Aloimonos, Y.: Early detection of independent motion from active control of normal image flow patterns. IEEE Trans. Syst. Man Cybernet. B (Cybernetics) 26(1), 42–52 (1996)CrossRef
26.
Zurück zum Zitat Elhamifar, E., Vidal, R.: Sparse subspace clustering. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 2790–2797. IEEE (2009) Elhamifar, E., Vidal, R.: Sparse subspace clustering. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 2790–2797. IEEE (2009)
27.
Zurück zum Zitat Horn, B.K.: Projective geometry considered harmful (1999) Horn, B.K.: Projective geometry considered harmful (1999)
28.
Zurück zum Zitat Ogale, A.S., Fermüller, C., Aloimonos, Y.: Motion segmentation using occlusions. IEEE Trans. Pattern Anal. Mach. Intell. 27(6), 988–992 (2005)CrossRef Ogale, A.S., Fermüller, C., Aloimonos, Y.: Motion segmentation using occlusions. IEEE Trans. Pattern Anal. Mach. Intell. 27(6), 988–992 (2005)CrossRef
29.
Zurück zum Zitat Horn, B.: Robot Vision. MIT Press, Cambridge (1986) Horn, B.: Robot Vision. MIT Press, Cambridge (1986)
30.
Zurück zum Zitat Bruss, A.R., Horn, B.K.: Passive navigation. Comput. Vis. Graph. Image Process. 21(1), 3–20 (1983)CrossRef Bruss, A.R., Horn, B.K.: Passive navigation. Comput. Vis. Graph. Image Process. 21(1), 3–20 (1983)CrossRef
31.
Zurück zum Zitat Sun, D., Roth, S., Black, M.J.: Secrets of optical flow estimation and their principles. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2432–2439. IEEE (2010) Sun, D., Roth, S., Black, M.J.: Secrets of optical flow estimation and their principles. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2432–2439. IEEE (2010)
32.
Zurück zum Zitat Narayana, M., Hanson, A., Learned-Miller, E.G.: Background subtraction: separating the modeling and the inference. Mach. Vis. Appl. 25(5), 1163–1174 (2014)CrossRef Narayana, M., Hanson, A., Learned-Miller, E.G.: Background subtraction: separating the modeling and the inference. Mach. Vis. Appl. 25(5), 1163–1174 (2014)CrossRef
33.
Zurück zum Zitat Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)CrossRefMathSciNet Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)CrossRefMathSciNet
34.
Zurück zum Zitat Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Susstrunk, S.: SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 34(11), 2274–2282 (2012)CrossRef Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Susstrunk, S.: SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 34(11), 2274–2282 (2012)CrossRef
35.
Zurück zum Zitat Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man, Cybernet. 9, 62–66 (1979)CrossRef Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man, Cybernet. 9, 62–66 (1979)CrossRef
36.
Zurück zum Zitat Wang, J.Y., Adelson, E.H.: Representing moving images with layers. IEEE Trans. Image Process 3(5), 625–638 (1994)CrossRef Wang, J.Y., Adelson, E.H.: Representing moving images with layers. IEEE Trans. Image Process 3(5), 625–638 (1994)CrossRef
37.
Zurück zum Zitat Powers, D.M.: Evaluation: from precision, recall and f-measure to ROC, informedness, markedness and correlation. Technical report SIE-07-001. Flinders University, Adelaide (2007) Powers, D.M.: Evaluation: from precision, recall and f-measure to ROC, informedness, markedness and correlation. Technical report SIE-07-001. Flinders University, Adelaide (2007)
Metadaten
Titel
It’s Moving! A Probabilistic Model for Causal Motion Segmentation in Moving Camera Videos
verfasst von
Pia Bideau
Erik Learned-Miller
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-46484-8_26