Stacked Hierarchical Labeling

Munoz, Daniel; Bagnell, J. Andrew; Hebert, Martial

doi:10.1007/978-3-642-15567-3_5

Daniel Munoz¹⁹,
J. Andrew Bagnell¹⁹ &
Martial Hebert¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6316))

Included in the following conference series:

European Conference on Computer Vision

6696 Accesses
61 Citations

Abstract

In this work we propose a hierarchical approach for labeling semantic objects and regions in scenes. Our approach is reminiscent of early vision literature in that we use a decomposition of the image in order to encode relational and spatial information. In contrast to much existing work on structured prediction for scene understanding, we bypass a global probabilistic model and instead directly train a hierarchical inference procedure inspired by the message passing mechanics of some approximate inference procedures in graphical models. This approach mitigates both the theoretical and empirical difficulties of learning probabilistic models when exact inference is intractable. In particular, we draw from recent work in machine learning and break the complex inference process into a hierarchical series of simple machine learning subproblems. Each subproblem in the hierarchy is designed to capture the image and contextual statistics in the scene. This hierarchy spans coarse-to-fine regions and explicitly models the mixtures of semantic labels that may be present due to imperfect segmentation. To avoid cascading of errors and overfitting, we train the learning problems in sequence to ensure robustness to likely errors earlier in the inference sequence and leverage the stacking approach developed by Cohen et al

Download to read the full chapter text

Chapter PDF

Fast, Exact and Multi-scale Inference for Semantic Image Segmentation with Deep Gaussian CRFs

Unified Perceptual Parsing for Scene Understanding

Learning deep representations for semantic image parsing: a comprehensive overview

Article 30 August 2018

Lili Huang, Jiefeng Peng, … Liang Lin

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Arbelaez, P., Maire, M., Fowlkes, C., Malik, J.: From contours to regions: An empirical evaluation. In: CVPR (2009)
Google Scholar
Barbu, A.: Training an active random field for real-time image denoising. IEEE Trans. on Image Processing 18(11) (2009)
Google Scholar
Bouman, C.A., Shapiro, M.: A multiscale random field model for bayesian image segmentation. IEEE Trans. on Image Processing 3(2) (1994)
Google Scholar
Cohen, W.W., Carvalho, V.R.: Stacked sequential learning. In: IJCAI (2005)
Google Scholar
Daume III, H., Langford, J., Marcu, D.: Search-based structured prediction. Machine Learning Journal 75(3) (2009)
Google Scholar
Feng, X., Williams, C.K.I., Felderhof, S.N.: Combining belief networks and neural networks for scene segmentation. IEEE T-PAMI 24(4) (2002)
Google Scholar
Gould, S., Rodgers, J., Cohen, D., Elidan, G., Koller, D.: Multi-class segmentation with relative location prior. IJCV 80(3) (2008)
Google Scholar
Gould, S., Fulton, R., Koller, D.: Decomposing a scene into geometric and semantically consistent regions. In: ICCV (2009)
Google Scholar
Gould, S., Russakovsky, O., Goodfellow, I., Baumstarck, P., Ng, A.Y., Koller, D.: The stair vision library, v2.3 (2009), http://ai.stanford.edu/~sgould/svl
Heitz, G., Gould, S., Saxena, A., Koller, D.: Cascaded classification models: Combining models for holistic scene understanding. In: NIPS (2008)
Google Scholar
Kakade, S., Teh, Y.W., Roweis, S.: An alternate objective function for markovian fields. In: ICML (2002)
Google Scholar
Kohli, P., Ladicky, L., Torr, P.H.: Robust higher order potentials for enforcing label consistency. IJCV 82(3) (2009)
Google Scholar
Kolmogorov, V., Zabih, R.: What energy functions can be minimized via graph cuts? IEEE T-PAMI 26(2) (2004)
Google Scholar
Komodakis, N., Paragios, N., Tziritas, G.: Mrf energy minimization and beyond via dual decomposition. IEEE T-PAMI (in press)
Google Scholar
Kou, Z., Cohen, W.W.: Stacked graphical models for efficient inference in markov random fields. In: SDM (2007)
Google Scholar
Kulesza, A., Pereira, F.: Structured learning with approximate inference. In: NIPS (2007)
Google Scholar
Kumar, S., August, J., Hebert, M.: Exploiting inference for approximate parameter learning in discriminative fields: An empirical study. In: Rangarajan, A., Vemuri, B.C., Yuille, A.L. (eds.) EMMCVPR 2005. LNCS, vol. 3757, pp. 153–168. Springer, Heidelberg (2005)
Chapter Google Scholar
Kumar, S., Hebert, M.: A hierarchical field framework for unified context-based classification. In: ICCV (2005)
Google Scholar
Kumar, S., Hebert, M.: Discriminative random fields. IJCV 68(2) (2006)
Google Scholar
Ladicky, L., Russell, C., Kohli, P., Torr, P.: Associative hierarchical crfs for object class image segmentation. In: ICCV (2009)
Google Scholar
Lim, J.J., Arbelaez, P., Gu, C., Malik, J.: Context by region ancestry. In: ICCV (2009)
Google Scholar
Maire, M., Arbelaez, P., Fowlkes, C., Malik, J.: Using contours to detect and localize junctions in natural images. In: CVPR (2008)
Google Scholar
Ohta, Y., Kanade, T., Sakai, T.: An analysis system for scenes containing objects with substructures. In: Int’l. Joint Conference on Pattern Recognitions (1978)
Google Scholar
Ratliff, N., Silver, D., Bagnell, J.A.: Learning to search: Functional gradient techniques for imitation learning. Autonomous Robots 27(1) (2009)
Google Scholar
Ross, S., Bagnell, J.A.: Efficient reductions for imitation learning. In: AIStats (2010)
Google Scholar
Shotton, J., Winn, J., Rother, C., Criminisi, A.: Textonboost for image understanding: Multi-class object recognition and segmentation by jointly modeling texture, layout, and context. IJCV 81(1) (2009)
Google Scholar
Tu, Z., Bai, X.: Auto-context and its application to high-level vision tasks and 3d brain image segmentation. T-PAMI 18(11) (2009)
Google Scholar
Viola, P., Jones, M.J.: Robust real-time face detection. IJCV 57(2) (2004)
Google Scholar
Wainwright, M.J.: Estimating the “wrong” graphical model: Benefits in the computation-limited setting. JMLR 7(11) (2006)
Google Scholar
Wolpert, D.H.: Stacked generalization. Neural Networks 5(2) (1992)
Google Scholar
Zhang, L., Ji, Q.: Image segmentation with a unified graphical model. T-PAMI 32(8) (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

The Robotics Institute, Carnegie Mellon University,
Daniel Munoz, J. Andrew Bagnell & Martial Hebert

Authors

Daniel Munoz
View author publications
You can also search for this author in PubMed Google Scholar
J. Andrew Bagnell
View author publications
You can also search for this author in PubMed Google Scholar
Martial Hebert
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

GRASP Laboratory, University of Pennsylvania, 3330 Walnut Street, 19104, Philadelphia, PA, USA
Kostas Daniilidis
School of Electrical and Computer Engineering, National Technical University of Athens, 15773, Athens, Greece
Petros Maragos
Department of Applied Mathematics, Ecole Centrale de Paris, Grande Voie des Vignes, 92295, Chatenay-Malabry, France
Nikos Paragios

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Munoz, D., Bagnell, J.A., Hebert, M. (2010). Stacked Hierarchical Labeling. In: Daniilidis, K., Maragos, P., Paragios, N. (eds) Computer Vision – ECCV 2010. ECCV 2010. Lecture Notes in Computer Science, vol 6316. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15567-3_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-15567-3_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15566-6
Online ISBN: 978-3-642-15567-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Stacked Hierarchical Labeling

Abstract

Chapter PDF

Similar content being viewed by others

Fast, Exact and Multi-scale Inference for Semantic Image Segmentation with Deep Gaussian CRFs

Unified Perceptual Parsing for Scene Understanding

Learning deep representations for semantic image parsing: a comprehensive overview

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Stacked Hierarchical Labeling

Abstract

Chapter PDF

Similar content being viewed by others

Fast, Exact and Multi-scale Inference for Semantic Image Segmentation with Deep Gaussian CRFs

Unified Perceptual Parsing for Scene Understanding

Learning deep representations for semantic image parsing: a comprehensive overview

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation