Skip to main content
Erschienen in: International Journal of Computer Vision 2/2015

01.04.2015

Labeling Complete Surfaces in Scene Understanding

verfasst von: Ruiqi Guo, Derek Hoiem

Erschienen in: International Journal of Computer Vision | Ausgabe 2/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Scene understanding requires reasoning about both what we can see and what is occluded. We offer a simple and general approach to infer labels of occluded background regions. Our approach incorporates estimates of visible surrounding background, detected objects, and shape priors from transferred training regions. We demonstrate the ability to infer the labels of occluded background regions in three datasets: the outdoor StreetScenes dataset, IndoorScene dataset and SUN09 dataset, all using the same approach. Furthermore, the proposed approach is extended to 3D space to find layered support surfaces in RGB-Depth scenes. Our experiments and analysis show that our method outperforms competent baselines.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Bileschi, S.M.: Streetscenes: Towards scene understanding in still images. Ph.D. thesis, Cambridge, MA (2006) Bileschi, S.M.: Streetscenes: Towards scene understanding in still images. Ph.D. thesis, Cambridge, MA (2006)
Zurück zum Zitat Brostow, G. J., Shotton, J., Fauqueur, J., & Cipolla, R.: Segmentation and recognition using structure from motion point clouds. In: Proceedings of the 10th European Conference on Computer Vision ECCV (2008). Brostow, G. J., Shotton, J., Fauqueur, J., & Cipolla, R.: Segmentation and recognition using structure from motion point clouds. In: Proceedings of the 10th European Conference on Computer Vision ECCV (2008).
Zurück zum Zitat Choi, M. J., Lim, J. J., Torralba, A., & Willsky, A. S.: Exploiting hierarchical context on a large database of object categories. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition CVPR (2010). Choi, M. J., Lim, J. J., Torralba, A., & Willsky, A. S.: Exploiting hierarchical context on a large database of object categories. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition CVPR (2010).
Zurück zum Zitat Felzenszwalb, P., Girshick, R., McAllester, D., & Ramanan, D.: Object detection with discriminatively trained part based models. In: Proceedings of the IEEE transactions on Pattern Analysis and Machine Intelligence (2009). Felzenszwalb, P., Girshick, R., McAllester, D., & Ramanan, D.: Object detection with discriminatively trained part based models. In: Proceedings of the IEEE transactions on Pattern Analysis and Machine Intelligence (2009).
Zurück zum Zitat Geiger, A., Wojek, C., & Urtasun, R.: Joint 3d estimation of objects and scene layout. In: Proceedings of the Advances in Neural Information Processing Systems NIPS (2011). Geiger, A., Wojek, C., & Urtasun, R.: Joint 3d estimation of objects and scene layout. In: Proceedings of the Advances in Neural Information Processing Systems NIPS (2011).
Zurück zum Zitat Gould, S., Gao, T., & Koller, D.: Region-based segmentation and object detection. In: Proceedings of the Advances in Neural Information Processing Systems NIPS (2009). Gould, S., Gao, T., & Koller, D.: Region-based segmentation and object detection. In: Proceedings of the Advances in Neural Information Processing Systems NIPS (2009).
Zurück zum Zitat Gould, S., Rodgers, J., Cohen, D., Elidan, G., & Koller, D. (2008). Multi-class segmentation with relative location prior. International Journal of Computer Vision, 80(3), 300–316.CrossRef Gould, S., Rodgers, J., Cohen, D., Elidan, G., & Koller, D. (2008). Multi-class segmentation with relative location prior. International Journal of Computer Vision, 80(3), 300–316.CrossRef
Zurück zum Zitat Guo, R., & Hoiem, D.: Beyond the line of sight: Labeling the underlying surfaces. In: Proceedings of the 12th European conference on Computer Vision ECCV (2012). Guo, R., & Hoiem, D.: Beyond the line of sight: Labeling the underlying surfaces. In: Proceedings of the 12th European conference on Computer Vision ECCV (2012).
Zurück zum Zitat Guo, R., & Hoiem, D.: Support surface prediction in indoor scenes. In: Proceedings of the IEEE International Conference on Computer Vision ICCV (2013). Guo, R., & Hoiem, D.: Support surface prediction in indoor scenes. In: Proceedings of the IEEE International Conference on Computer Vision ICCV (2013).
Zurück zum Zitat Gupta, A., Efros, A. A., & Hebert, M.: Blocks world revisited: Image understanding using qualitative geometry and mechanics. In: Proceedings of the 11th European Conference on Computer Vision ECCV (2010). Gupta, A., Efros, A. A., & Hebert, M.: Blocks world revisited: Image understanding using qualitative geometry and mechanics. In: Proceedings of the 11th European Conference on Computer Vision ECCV (2010).
Zurück zum Zitat Hedau, V., Hoiem, D., & Forsyth, D.: Recovering the spatial layout of cluttered rooms. In: Proceedings of the IEEE 12th International Computer Vision ICCV (2009). Hedau, V., Hoiem, D., & Forsyth, D.: Recovering the spatial layout of cluttered rooms. In: Proceedings of the IEEE 12th International Computer Vision ICCV (2009).
Zurück zum Zitat Hoiem, D., Efros, A. A., & Hebert, M. (2007). Recovering surface layout from an image. International Journal of Computer Vision, 75(1), 151–172.CrossRef Hoiem, D., Efros, A. A., & Hebert, M. (2007). Recovering surface layout from an image. International Journal of Computer Vision, 75(1), 151–172.CrossRef
Zurück zum Zitat Hoiem, D., Efros, A. A., & Hebert, M.: Closing the loop on scene interpretation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition CVPR (2008). Hoiem, D., Efros, A. A., & Hebert, M.: Closing the loop on scene interpretation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition CVPR (2008).
Zurück zum Zitat Isola, P., & Liu, C.: Scene collaging: analysis and synthesis of natural images with semantic layers. In: Proceedings of the IEEE International Conference on Computer Vision ICCV (2013). Isola, P., & Liu, C.: Scene collaging: analysis and synthesis of natural images with semantic layers. In: Proceedings of the IEEE International Conference on Computer Vision ICCV (2013).
Zurück zum Zitat Khosla, A., An, B., Lim, J. J., & Torralba, A.: Looking beyond the visible scene. In: Proceedings of the International Conference on Computer Vision CVPR (2014). Khosla, A., An, B., Lim, J. J., & Torralba, A.: Looking beyond the visible scene. In: Proceedings of the International Conference on Computer Vision CVPR (2014).
Zurück zum Zitat Kolmogorov, V., & Zabih, R. (2004). What energy functions can be minimized via graph cuts? Pattern Analysis and Machine Intelligence, 26(2), 147–159. Kolmogorov, V., & Zabih, R. (2004). What energy functions can be minimized via graph cuts? Pattern Analysis and Machine Intelligence, 26(2), 147–159.
Zurück zum Zitat Lee, D. C., Hebert, M., & Kanade, T.: Geometric reasoning for single image structure recovery. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition CVPR (2009). Lee, D. C., Hebert, M., & Kanade, T.: Geometric reasoning for single image structure recovery. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition CVPR (2009).
Zurück zum Zitat Li, C., Kowdle, A., Saxena, A., & Chen, T.: Towards holistic scene understanding: Feedback enabled cascaded classification models. In: Proceedings of the Advances in Neural Information Processing Systems NIPS (2010). Li, C., Kowdle, A., Saxena, A., & Chen, T.: Towards holistic scene understanding: Feedback enabled cascaded classification models. In: Proceedings of the Advances in Neural Information Processing Systems NIPS (2010).
Zurück zum Zitat Liu, C., Yuen, J., & Torralba, A. (2011). Nonparametric scene parsing via label transfer. Pattern Analysis and Machine Intelligence, 33(12), 2368–2382.CrossRef Liu, C., Yuen, J., & Torralba, A. (2011). Nonparametric scene parsing via label transfer. Pattern Analysis and Machine Intelligence, 33(12), 2368–2382.CrossRef
Zurück zum Zitat Malisiewicz, T., & Efros, A. A.: Beyond categories: The visual memex model for reasoning about object relationships. In: Advances in Neural Information Processing Systems NIPS (2009). Malisiewicz, T., & Efros, A. A.: Beyond categories: The visual memex model for reasoning about object relationships. In: Advances in Neural Information Processing Systems NIPS (2009).
Zurück zum Zitat Ross, S., Munoz, D., Hebert, M., & Bagnell, J. A. D.: Learning message-passing inference machines for structured prediction. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition CVPR (2011). Ross, S., Munoz, D., Hebert, M., & Bagnell, J. A. D.: Learning message-passing inference machines for structured prediction. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition CVPR (2011).
Zurück zum Zitat Russell, B. C., Torralba, A., Murphy, K. P., & Freeman, W. T. (2005). LabelMe: A database and web-based tool for image annotation. Technical Report, MIT. Russell, B. C., Torralba, A., Murphy, K. P., & Freeman, W. T. (2005). LabelMe: A database and web-based tool for image annotation. Technical Report, MIT.
Zurück zum Zitat Shotton, J., Winn, J., Rother, C., & Criminisi, A.: Textonboost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation. In: Proceedings of the 9th European conference on Computer Vision ECCV (2006). Shotton, J., Winn, J., Rother, C., & Criminisi, A.: Textonboost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation. In: Proceedings of the 9th European conference on Computer Vision ECCV (2006).
Zurück zum Zitat Silberman, N., Hoiem, D., Kohli, P., & Fergus, R.: Indoor segmentation and support inference from rgbd images. In: Proceedings of the 12th European Conference on Computer Vision ECCV, pp. 746–760 (2012). Silberman, N., Hoiem, D., Kohli, P., & Fergus, R.: Indoor segmentation and support inference from rgbd images. In: Proceedings of the 12th European Conference on Computer Vision ECCV, pp. 746–760 (2012).
Zurück zum Zitat Silberman, N., Shapira, L., Gal, R., & Kohli, P.: A contour completion model for augmenting surface reconstructions. In: Proceedings of the European Conference on Computer Vision ECCV (2014). Silberman, N., Shapira, L., Gal, R., & Kohli, P.: A contour completion model for augmenting surface reconstructions. In: Proceedings of the European Conference on Computer Vision ECCV (2014).
Zurück zum Zitat Tighe, J., & Lazebnik, S.: Superparsing: Scalable nonparametric image parsing with superpixels. In: Proceedings of the European Conference on Computer Vision ECCV (2010). Tighe, J., & Lazebnik, S.: Superparsing: Scalable nonparametric image parsing with superpixels. In: Proceedings of the European Conference on Computer Vision ECCV (2010).
Zurück zum Zitat Tu, Z., & Bai, X. (2010). Auto-context and its application to high-level vision tasks and 3D brain image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(10), 1744–1757.CrossRef Tu, Z., & Bai, X. (2010). Auto-context and its application to high-level vision tasks and 3D brain image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(10), 1744–1757.CrossRef
Zurück zum Zitat Zhang, H., Xiao, J., & Quan, L.: Supervised label transfer for semantic segmentation of street scenes. In: Proceedings of the 11th European Conference on Computer Vision ECCV (2010). Zhang, H., Xiao, J., & Quan, L.: Supervised label transfer for semantic segmentation of street scenes. In: Proceedings of the 11th European Conference on Computer Vision ECCV (2010).
Metadaten
Titel
Labeling Complete Surfaces in Scene Understanding
verfasst von
Ruiqi Guo
Derek Hoiem
Publikationsdatum
01.04.2015
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 2/2015
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-014-0776-7

Weitere Artikel der Ausgabe 2/2015

International Journal of Computer Vision 2/2015 Zur Ausgabe

Premium Partner