Skip to main content

2017 | OriginalPaper | Buchkapitel

What Is the Best Depth-Map Compression for Depth Image Based Rendering?

verfasst von : Jens Ogniewski, Per-Erik Forssén

Erschienen in: Computer Analysis of Images and Patterns

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Many of the latest smart phones and tablets come with integrated depth sensors, that make depth-maps freely available, thus enabling new forms of applications like rendering from different view points. However, efficient compression exploiting the characteristics of depth-maps as well as the requirements of these new applications is still an open issue. In this paper, we evaluate different depth-map compression algorithms, with a focus on tree-based methods and view projection as application.
The contributions of this paper are the following: 1. extensions of existing geometric compression trees, 2. a comparison of a number of different trees, 3. a comparison of them to a state-of-the-art video coder, 4. an evaluation using ground-truth data that considers both depth-maps and predicted frames with arbitrary camera translation and rotation.
Despite our best efforts, and contrary to earlier results, current video depth-map compression outperforms tree-based methods in most cases. The reason for this is likely that previous evaluations focused on low-quality, low-resolution depth maps, while high-resolution depth (as needed in the DIBR setting) has been ignored up until now. We also demonstrate that PSNR on depth-maps is not always a good measure of their utility.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Newcombe, R.A., Izadi, S., Hilliges, O., Molyneaux, D., Kim, D., Davison, A.J., Kohi, P., Shotton, J., Hodges, S., Fitzgibbon, A.: KinectFusion: real-time dense surface mapping and tracking. In: Proceedings of the 10th IEEE International Symposium on Mixed and Augmented Reality, pp. 127–136 (2011) Newcombe, R.A., Izadi, S., Hilliges, O., Molyneaux, D., Kim, D., Davison, A.J., Kohi, P., Shotton, J., Hodges, S., Fitzgibbon, A.: KinectFusion: real-time dense surface mapping and tracking. In: Proceedings of the 10th IEEE International Symposium on Mixed and Augmented Reality, pp. 127–136 (2011)
2.
Zurück zum Zitat Kim, S.-Y., Ho, Y.-S.: Mesh-based depth coding for 3D video using hierarchical decomposition of depth maps. In: Proceedings of IEEE International Conference on Image Processing, pp. 117–120 (2007) Kim, S.-Y., Ho, Y.-S.: Mesh-based depth coding for 3D video using hierarchical decomposition of depth maps. In: Proceedings of IEEE International Conference on Image Processing, pp. 117–120 (2007)
3.
Zurück zum Zitat Jingjing, F., Miao, D., Weiren, Y., Wang, S., Yan, L., Li, S.: Kinect-like depth data compression. IEEE Trans. Multimed. 15, 1340–1352 (2013)CrossRef Jingjing, F., Miao, D., Weiren, Y., Wang, S., Yan, L., Li, S.: Kinect-like depth data compression. IEEE Trans. Multimed. 15, 1340–1352 (2013)CrossRef
4.
Zurück zum Zitat Merkle, P., Múller, K., Marpe, D., Wiegand, T.: Depth intra coding for 3d video based on geometric primitives. IEEE Trans. Circ. Syst. Video Technol. 99, 570–582 (2015) Merkle, P., Múller, K., Marpe, D., Wiegand, T.: Depth intra coding for 3d video based on geometric primitives. IEEE Trans. Circ. Syst. Video Technol. 99, 570–582 (2015)
5.
Zurück zum Zitat Chai, B.B., Sethuraman, S., Sawhney, H.S., Hatrack, P.: Depth map compression for real-time view-based rendering. Pattern Recogn. Lett. 25, 755–766 (2004)CrossRef Chai, B.B., Sethuraman, S., Sawhney, H.S., Hatrack, P.: Depth map compression for real-time view-based rendering. Pattern Recogn. Lett. 25, 755–766 (2004)CrossRef
6.
Zurück zum Zitat Morvan, Y., Farin, D., de With, P.H.N.: Multiview video coding using depth based 3D warping. In: Proceedings of IEEE International Conference on Image Processing, vol. 5, pp. 105–108 (2007) Morvan, Y., Farin, D., de With, P.H.N.: Multiview video coding using depth based 3D warping. In: Proceedings of IEEE International Conference on Image Processing, vol. 5, pp. 105–108 (2007)
7.
Zurück zum Zitat Colleu, T., Pateux, S., Morinc, L., Labit, C.: A polygon soup representation for multiview coding. J. Vis. Commun. Image Represent. 21, 561–576 (2010)CrossRef Colleu, T., Pateux, S., Morinc, L., Labit, C.: A polygon soup representation for multiview coding. J. Vis. Commun. Image Represent. 21, 561–576 (2010)CrossRef
8.
Zurück zum Zitat Sandberg, D., Forssén, P.E., Ogniewski, J.: Model-based video coding using colour and depth cameras. In: Proceedings of International Conference on Digital Image Computing Techniques and Applications, pp. 158–163 (2011) Sandberg, D., Forssén, P.E., Ogniewski, J.: Model-based video coding using colour and depth cameras. In: Proceedings of International Conference on Digital Image Computing Techniques and Applications, pp. 158–163 (2011)
9.
Zurück zum Zitat Chai, B.-B., Sethuraman, S., Sawhney, H.S.: A depth map representation for real-time transmission and view-based rendering of a dynamic 3D scene. In: Proceedings of First International Symposium on 3D Data Processing Visualization and Transmission, pp. 107–114 (2002) Chai, B.-B., Sethuraman, S., Sawhney, H.S.: A depth map representation for real-time transmission and view-based rendering of a dynamic 3D scene. In: Proceedings of First International Symposium on 3D Data Processing Visualization and Transmission, pp. 107–114 (2002)
10.
Zurück zum Zitat Sarkis, M., Zia, W., Diepold, K.: Fast depth map compression and meshing with compressed tritree. In: Zha, H., Taniguchi, R., Maybank, S. (eds.) ACCV 2009. LNCS, vol. 5995, pp. 44–55. Springer, Heidelberg (2010). doi:10.1007/978-3-642-12304-7_5 CrossRef Sarkis, M., Zia, W., Diepold, K.: Fast depth map compression and meshing with compressed tritree. In: Zha, H., Taniguchi, R., Maybank, S. (eds.) ACCV 2009. LNCS, vol. 5995, pp. 44–55. Springer, Heidelberg (2010). doi:10.​1007/​978-3-642-12304-7_​5 CrossRef
11.
Zurück zum Zitat Oh, B.T., Lee, J., Park, D.-S.: Binary tree decomposition depth coding for 3D video applications. In: Proceedings of IEEE International Conference on Multimedia and Expo, pp. 1–6 (2011) Oh, B.T., Lee, J., Park, D.-S.: Binary tree decomposition depth coding for 3D video applications. In: Proceedings of IEEE International Conference on Multimedia and Expo, pp. 1–6 (2011)
12.
Zurück zum Zitat Byung Tae, O., Lee, J., Park, D.-S.: Depth map coding based on synthesized view distortion function. IEEE J. Sel. Topics Signal Process. 5, 1344–1352 (2011)CrossRef Byung Tae, O., Lee, J., Park, D.-S.: Depth map coding based on synthesized view distortion function. IEEE J. Sel. Topics Signal Process. 5, 1344–1352 (2011)CrossRef
13.
Zurück zum Zitat Wang, L., Yu, L.: Rate-distortion optimization for depth map coding with distortion estimation of synthesized view. In: Proceedings of IEEE International Symposium on Circuits and Systems, pp. 17–20 (2013) Wang, L., Yu, L.: Rate-distortion optimization for depth map coding with distortion estimation of synthesized view. In: Proceedings of IEEE International Symposium on Circuits and Systems, pp. 17–20 (2013)
14.
Zurück zum Zitat Müller, K., Schwarz, H., Marpe, D., Bartnik, C., Bosse, S., Brust, H., Hinz, T., Lakshman, H., Merkle, P., Rhee, F.H., Tech, G., Winken, M., Wiegand, T.: 3D high-efficiency video coding for multi-view video and depth data. IEEE Trans. Image Process. 22, 2266–2278 (2013)MathSciNetCrossRef Müller, K., Schwarz, H., Marpe, D., Bartnik, C., Bosse, S., Brust, H., Hinz, T., Lakshman, H., Merkle, P., Rhee, F.H., Tech, G., Winken, M., Wiegand, T.: 3D high-efficiency video coding for multi-view video and depth data. IEEE Trans. Image Process. 22, 2266–2278 (2013)MathSciNetCrossRef
15.
Zurück zum Zitat Ogniewski, J., Forssén, P.E.: Pushing the limits for view prediction in video coding. In: 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (2017) Ogniewski, J., Forssén, P.E.: Pushing the limits for view prediction in video coding. In: 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (2017)
16.
Zurück zum Zitat Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multi-scale structural similarity for image quality assessment. In: 37th IEEE Asilomar Conference on Signals, Systems and Computers (2003) Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multi-scale structural similarity for image quality assessment. In: 37th IEEE Asilomar Conference on Signals, Systems and Computers (2003)
17.
Zurück zum Zitat Sullivan, G.J., Ohm, J.R., Han, W.J., Wiegand, T.: Overview of the high efficiency video coding (HEVC) standard. IEEE Trans. Circ. Syst. Video Technol. 22, 1649–1668 (2012)CrossRef Sullivan, G.J., Ohm, J.R., Han, W.J., Wiegand, T.: Overview of the high efficiency video coding (HEVC) standard. IEEE Trans. Circ. Syst. Video Technol. 22, 1649–1668 (2012)CrossRef
18.
Zurück zum Zitat Iyer, K.N., Maiti, K., Navathe, B., Kannan, H., Sharma, A.: Multiview video coding using depth based 3D warping. In: Proceedings of IEEE International Conference on Multimedia and Expo, pp. 1108–1113 (2010) Iyer, K.N., Maiti, K., Navathe, B., Kannan, H., Sharma, A.: Multiview video coding using depth based 3D warping. In: Proceedings of IEEE International Conference on Multimedia and Expo, pp. 1108–1113 (2010)
19.
Zurück zum Zitat Butler, D.J., Wulff, J., Stanley, G.B., Black, M.J.: A naturalistic open source movie for optical flow evaluation. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7577, pp. 611–625. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33783-3_44 CrossRef Butler, D.J., Wulff, J., Stanley, G.B., Black, M.J.: A naturalistic open source movie for optical flow evaluation. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7577, pp. 611–625. Springer, Heidelberg (2012). doi:10.​1007/​978-3-642-33783-3_​44 CrossRef
20.
Zurück zum Zitat Solh, M., Al Regib, G.: Hierarchical Hole-Filling(HHF): depth image based rendering without depth map filtering for 3D-TV. In: IEEE International Workshop on Multimedia and Signal Processing (2010) Solh, M., Al Regib, G.: Hierarchical Hole-Filling(HHF): depth image based rendering without depth map filtering for 3D-TV. In: IEEE International Workshop on Multimedia and Signal Processing (2010)
21.
Zurück zum Zitat Chao, Y.-H., Ortega, A., Wei, H., Cheung, G.: Edge-adaptive depth map coding with lifting transform on graphs. In: Picture Coding Symposium (2015) Chao, Y.-H., Ortega, A., Wei, H., Cheung, G.: Edge-adaptive depth map coding with lifting transform on graphs. In: Picture Coding Symposium (2015)
Metadaten
Titel
What Is the Best Depth-Map Compression for Depth Image Based Rendering?
verfasst von
Jens Ogniewski
Per-Erik Forssén
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-64698-5_34