Skip to main content
Top

2015 | OriginalPaper | Chapter

Interactive RGB-D Image Segmentation Using Hierarchical Graph Cut and Geodesic Distance

Authors : Ling Ge, Ran Ju, Tongwei Ren, Gangshan Wu

Published in: Advances in Multimedia Information Processing -- PCM 2015

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, we propose a novel interactive image segmentation method for RGB-D images using hierarchical Graph Cut. Considering the characteristics of RGB channels and depth channel in RGB-D image, we utilize Euclidean distance on RGB space and geodesic distance on 3D space to measure how likely a pixel belongs to foreground or background in color and depth respectively, and integrate the color cue and depth cue into a unified Graph Cut framework to obtain the optimal segmentation result. Moreover, to overcome the low efficiency problem of Graph Cut in handling high resolution images, we accelerate the proposed method with hierarchical strategy. The experimental results show that our method outperforms the state-of-the-art methods with high efficiency.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Li, S., Ju, R., Ren, T., Wu, G.: Saliency cuts based on adaptive triple threshoding. In: International Conference on Image Processing, pp. 1–4. IEEE (2015) Li, S., Ju, R., Ren, T., Wu, G.: Saliency cuts based on adaptive triple threshoding. In: International Conference on Image Processing, pp. 1–4. IEEE (2015)
2.
go back to reference Nguyen, T.N.A., Cai, J., Zhang, J., Zheng, J.: Robust interactive image segmentation using convex active contours. IEEE Trans. Image Process. 21(8), 3734–3743 (2012)MathSciNetCrossRef Nguyen, T.N.A., Cai, J., Zhang, J., Zheng, J.: Robust interactive image segmentation using convex active contours. IEEE Trans. Image Process. 21(8), 3734–3743 (2012)MathSciNetCrossRef
3.
go back to reference Delgado-Gonzalo, R., Chenouard, N., Unser, M.: Spline-based deforming ellipsoids for interactive 3D bioimage segmentation. IEEE Trans. Image Process. 22(10), 3926–3940 (2013)MathSciNetCrossRef Delgado-Gonzalo, R., Chenouard, N., Unser, M.: Spline-based deforming ellipsoids for interactive 3D bioimage segmentation. IEEE Trans. Image Process. 22(10), 3926–3940 (2013)MathSciNetCrossRef
4.
go back to reference Cheng, M.M., Mitra, N.J., Huang, X., Torr, P.H.S., Hu, S.M.: Global contrast based salient region detection. IEEE Trans. Pattern Anal. Mach. Intell. 37(3), 569–582 (2014)CrossRef Cheng, M.M., Mitra, N.J., Huang, X., Torr, P.H.S., Hu, S.M.: Global contrast based salient region detection. IEEE Trans. Pattern Anal. Mach. Intell. 37(3), 569–582 (2014)CrossRef
5.
go back to reference Ren, T., Liu, Y., Wu, G.: Image retargeting based on global energy optimization. In: IEEE International Conference on Multimedia and Expo, pp. 406–409 (2009) Ren, T., Liu, Y., Wu, G.: Image retargeting based on global energy optimization. In: IEEE International Conference on Multimedia and Expo, pp. 406–409 (2009)
6.
go back to reference Xu, X., Geng, W., Ju, R., Yang, Y., Ren, T., Wu, G.: OBSIR: object-based stereo image retrieval. In: IEEE International Conference on Multimedia and Expo, pp. 1–6 (2014) Xu, X., Geng, W., Ju, R., Yang, Y., Ren, T., Wu, G.: OBSIR: object-based stereo image retrieval. In: IEEE International Conference on Multimedia and Expo, pp. 1–6 (2014)
7.
go back to reference Greig, D., Porteous, B., Seheult, A.H.: Exact maximum a posteriori estimation for binary images. J. Roy. Stat. Soc. Ser. B (Methodol.) 51, 271–279 (1989) Greig, D., Porteous, B., Seheult, A.H.: Exact maximum a posteriori estimation for binary images. J. Roy. Stat. Soc. Ser. B (Methodol.) 51, 271–279 (1989)
8.
go back to reference Boykov, Y.Y., Jolly, M.P.: Interactive graph cuts for optimal boundary & region segmentation of objects in ND images. In: IEEE International Conference on Computer Vision, pp. 105–112 (2001) Boykov, Y.Y., Jolly, M.P.: Interactive graph cuts for optimal boundary & region segmentation of objects in ND images. In: IEEE International Conference on Computer Vision, pp. 105–112 (2001)
9.
go back to reference Yatziv, L., Bartesaghi, A., Sapiro, G.: O(n) implementation of the fast marching algorithm. J. Comput. Phys. 212(2), 393–399 (2006)CrossRefMATH Yatziv, L., Bartesaghi, A., Sapiro, G.: O(n) implementation of the fast marching algorithm. J. Comput. Phys. 212(2), 393–399 (2006)CrossRefMATH
10.
go back to reference Diebold, J., Demmel, N., Hazırbaş, C., Moeller, M., Cremers, D.: Interactive multi-label segmentation of RGB-D images. In: Aujol, J.-F., Nikolova, M., Papadakis, N. (eds.) SSVM 2015. LNCS, vol. 9087, pp. 294–306. Springer, Heidelberg (2015) Diebold, J., Demmel, N., Hazırbaş, C., Moeller, M., Cremers, D.: Interactive multi-label segmentation of RGB-D images. In: Aujol, J.-F., Nikolova, M., Papadakis, N. (eds.) SSVM 2015. LNCS, vol. 9087, pp. 294–306. Springer, Heidelberg (2015)
11.
go back to reference Boykov, Y., Kolmogorov, V.: An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Trans. Pattern Anal. Mach. Intell. 26(9), 1124–1137 (2004)CrossRefMATH Boykov, Y., Kolmogorov, V.: An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Trans. Pattern Anal. Mach. Intell. 26(9), 1124–1137 (2004)CrossRefMATH
12.
go back to reference Ju, R., Ge, L., Geng, W., Ren, T., Wu, G.: Depth saliency based on anisotropic center-surround difference. In: IEEE International Conference on Image Processing, pp. 1115–1119 (2014) Ju, R., Ge, L., Geng, W., Ren, T., Wu, G.: Depth saliency based on anisotropic center-surround difference. In: IEEE International Conference on Image Processing, pp. 1115–1119 (2014)
13.
go back to reference Peng, H., Li, B., Xiong, W., Hu, W., Ji, R.: RGBD salient object detection: a benchmark and algorithms. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part III. LNCS, vol. 8691, pp. 92–109. Springer, Heidelberg (2014) Peng, H., Li, B., Xiong, W., Hu, W., Ji, R.: RGBD salient object detection: a benchmark and algorithms. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part III. LNCS, vol. 8691, pp. 92–109. Springer, Heidelberg (2014)
14.
go back to reference Sang, J., Mei, T., Xu, Y.Q., Zhao, C., Xu, C., Li, S.: Interaction design for mobile visual search. IEEE Trans. Multimedia 15(7), 1665–1676 (2013)CrossRef Sang, J., Mei, T., Xu, Y.Q., Zhao, C., Xu, C., Li, S.: Interaction design for mobile visual search. IEEE Trans. Multimedia 15(7), 1665–1676 (2013)CrossRef
15.
go back to reference Sang, J.: User-centric social multimedia computing. Springer, Heidelberg (2014)CrossRef Sang, J.: User-centric social multimedia computing. Springer, Heidelberg (2014)CrossRef
16.
go back to reference Rother, C., Kolmogorov, V., Blake, A.: Grabcut: interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. 23(3), 309–314 (2004)CrossRef Rother, C., Kolmogorov, V., Blake, A.: Grabcut: interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. 23(3), 309–314 (2004)CrossRef
17.
go back to reference Grady, L.: Random walks for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 28(11), 1768–1783 (2006)CrossRef Grady, L.: Random walks for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 28(11), 1768–1783 (2006)CrossRef
18.
go back to reference Gulshan, V., Rother, C., Criminisi, A., Blake, A., Zisserman, A.: Geodesic star convexity for interactive image segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3129–3136 (2010) Gulshan, V., Rother, C., Criminisi, A., Blake, A., Zisserman, A.: Geodesic star convexity for interactive image segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3129–3136 (2010)
19.
go back to reference Lombaert, H., Sun, Y., Grady, L., Xu, C.: A multilevel banded graph cuts method for fast image segmentation. In: IEEE International Conference on Computer Vision, pp. 259–265 (2005) Lombaert, H., Sun, Y., Grady, L., Xu, C.: A multilevel banded graph cuts method for fast image segmentation. In: IEEE International Conference on Computer Vision, pp. 259–265 (2005)
20.
go back to reference Vaudrey, T., Gruber, D., Wedel, A., Klappstein, J.: Space-time multi-resolution banded graph-cut for fast segmentation. In: Rigoll, G. (ed.) DAGM 2008. LNCS, vol. 5096, pp. 203–213. Springer, Heidelberg (2008) CrossRef Vaudrey, T., Gruber, D., Wedel, A., Klappstein, J.: Space-time multi-resolution banded graph-cut for fast segmentation. In: Rigoll, G. (ed.) DAGM 2008. LNCS, vol. 5096, pp. 203–213. Springer, Heidelberg (2008) CrossRef
21.
go back to reference Kolmogorov, V., Criminisi, A., Blake, A., Cross, G., Rother, C.: Bi-layer segmentation of binocular stereo video. In: IEEE International Conference on Computer Vision and Pattern Recognition, pp. 407–414 (2005) Kolmogorov, V., Criminisi, A., Blake, A., Cross, G., Rother, C.: Bi-layer segmentation of binocular stereo video. In: IEEE International Conference on Computer Vision and Pattern Recognition, pp. 407–414 (2005)
22.
go back to reference Harville, M., Gordon, G., Woodfill, J.: Foreground segmentation using adaptive mixture models in color and depth. In: IEEE Workshop on Detection and Recognition of Events in Video, pp. 3–11 (2001) Harville, M., Gordon, G., Woodfill, J.: Foreground segmentation using adaptive mixture models in color and depth. In: IEEE Workshop on Detection and Recognition of Events in Video, pp. 3–11 (2001)
23.
go back to reference Ahn, J.H., Kim, K., Byun, H.: Robust object segmentation using graph cut with object and background seed estimation. In: International Conference on Pattern Recognition, pp. 361–364. IEEE (2006) Ahn, J.H., Kim, K., Byun, H.: Robust object segmentation using graph cut with object and background seed estimation. In: International Conference on Pattern Recognition, pp. 361–364. IEEE (2006)
Metadata
Title
Interactive RGB-D Image Segmentation Using Hierarchical Graph Cut and Geodesic Distance
Authors
Ling Ge
Ran Ju
Tongwei Ren
Gangshan Wu
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-24075-6_12