Skip to main content
Erschienen in: Multimedia Systems 5/2015

01.10.2015 | Regular Paper

2D to 3D conversion with motion-type adaptive depth estimation

verfasst von: Cheolkon Jung, Lei Wang, Xiaohua Zhu, Licheng Jiao

Erschienen in: Multimedia Systems | Ausgabe 5/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

2D to 3D conversion is an important task for 3DTV broadcasting services due to the lack of stereoscopic 3D contents. In this paper, we propose 2D to 3D conversion with motion-type adaptive depth estimation. Because the most important depth cue is motion parallax in our method, we first perform motion estimation between sequential video frames. Then, we adopt a motion-type adaptive approach to depth map estimation because videos have different depth structures according to the type of motion. To be specific, depth from motion is exploited to estimate depth maps in the case of global motion while the depth maps are generated based on the depth from template with the local motion-guided refinement in the case of local motion. Finally, we employ depth image-based rendering (DIBR) to generate stereoscopic virtual views from the depth maps. Experimental results demonstrate that the proposed 2D to 3D conversion is very effective in generating accurate depth maps and providing realistic 3D effects.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Tam, W.J., Zhang, L.: “3D-TV content generation: 2d-to-3d conversion”. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), pp. 1869–1872 (2006) Tam, W.J., Zhang, L.: “3D-TV content generation: 2d-to-3d conversion”. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), pp. 1869–1872 (2006)
2.
Zurück zum Zitat Wiegand, T., Sullivan, G.J., Bjontegaard, G., Luthra, A.: Overview of the H. 264/AVC video coding standard. IEEE Trans. Circuits Syst. Video Technol. 13(7), 560–576 (2003)CrossRef Wiegand, T., Sullivan, G.J., Bjontegaard, G., Luthra, A.: Overview of the H. 264/AVC video coding standard. IEEE Trans. Circuits Syst. Video Technol. 13(7), 560–576 (2003)CrossRef
3.
Zurück zum Zitat Varodayan, D., Chen, D., Flierl, M., Girod, B.: “Wyner–Ziv coding of video with unsupervised motion vector learning,” Signal Processing: Image Communication, pp. 369–378 (2008) Varodayan, D., Chen, D., Flierl, M., Girod, B.: “Wyner–Ziv coding of video with unsupervised motion vector learning,” Signal Processing: Image Communication, pp. 369–378 (2008)
4.
Zurück zum Zitat Chen, D., Tsai, S., Chandrasekhar, V., Takacs, G., Vedantham, R., Grzeszczuk, R., Girod, B.: Residual enhanced visual vector as a compact signature for mobile visual search. Sig. Process. 93(8), 2316–2327 (2013)CrossRef Chen, D., Tsai, S., Chandrasekhar, V., Takacs, G., Vedantham, R., Grzeszczuk, R., Girod, B.: Residual enhanced visual vector as a compact signature for mobile visual search. Sig. Process. 93(8), 2316–2327 (2013)CrossRef
5.
Zurück zum Zitat Wang, H., Schuster, G.M., Katsaggelos, A.K.: Rate-distortion optimal bit allocation for object-based video coding. IEEE Trans. Circuits Syst. Video Technol. 15(9), 1113–1123 (2005)CrossRef Wang, H., Schuster, G.M., Katsaggelos, A.K.: Rate-distortion optimal bit allocation for object-based video coding. IEEE Trans. Circuits Syst. Video Technol. 15(9), 1113–1123 (2005)CrossRef
6.
Zurück zum Zitat Zhu, L., Fan, Z., Aggelos, K.K.: “Joint video summarization and transmission adaptation for energy-efficient wireless video streaming,” EURASIP J Adv Signal Process, vol. 2008, Article ID 657032 (2008) Zhu, L., Fan, Z., Aggelos, K.K.: “Joint video summarization and transmission adaptation for energy-efficient wireless video streaming,” EURASIP J Adv Signal Process, vol. 2008, Article ID 657032 (2008)
7.
Zurück zum Zitat Smolic, A., Kauff, P., Knorr, S., Hornung, A., Kunter, M., Muller, M., Lang, M.: Three-dimensional video postproduction and processing. Proc. IEEE 99(4), 607–625 (2011)CrossRef Smolic, A., Kauff, P., Knorr, S., Hornung, A., Kunter, M., Muller, M., Lang, M.: Three-dimensional video postproduction and processing. Proc. IEEE 99(4), 607–625 (2011)CrossRef
8.
Zurück zum Zitat Pitas, I., Nikolaidis, N.:“Anthropocentric video analysis for film and games postproduction”. In: Proceedings of the 11th International Conference on Computer Systems and Technologies and Workshop for PhD Students in Computing on International Conference on Computer Systems and Technologies pp. 11–18 (2010) Pitas, I., Nikolaidis, N.:“Anthropocentric video analysis for film and games postproduction”. In: Proceedings of the 11th International Conference on Computer Systems and Technologies and Workshop for PhD Students in Computing on International Conference on Computer Systems and Technologies pp. 11–18 (2010)
9.
Zurück zum Zitat Daribo, I., Saito, H.: A novel inpainting-based layered depth video for 3DTV. IEEE Trans. Broadcast. 57(2), 533–541 (2011)CrossRef Daribo, I., Saito, H.: A novel inpainting-based layered depth video for 3DTV. IEEE Trans. Broadcast. 57(2), 533–541 (2011)CrossRef
10.
Zurück zum Zitat Holte, M.B., Moeslund, T.B., Nikolaidis, N., Pitas, I.: “3D human action recognition for multi-view camera systems”. In: Proceedings of International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission (3DIMPVT), pp. 342–349 (2011) Holte, M.B., Moeslund, T.B., Nikolaidis, N., Pitas, I.: “3D human action recognition for multi-view camera systems”. In: Proceedings of International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission (3DIMPVT), pp. 342–349 (2011)
11.
Zurück zum Zitat Liao, M., Gao, J., Yang, R., Gong, M.: Video stereolization: combining motion analysis with user interaction. IEEE Trans. Visual Comput. Graphics 18(7), 1079–1088 (2012)CrossRef Liao, M., Gao, J., Yang, R., Gong, M.: Video stereolization: combining motion analysis with user interaction. IEEE Trans. Visual Comput. Graphics 18(7), 1079–1088 (2012)CrossRef
12.
Zurück zum Zitat Yang, N.E., Lee, J.W., Park, R-H.: “Depth map generation from a single image using local depth hypothesis”. In: Proceedings of IEEE International Consumer Electronics (ICCE), pp. 311–312 (2012) Yang, N.E., Lee, J.W., Park, R-H.: “Depth map generation from a single image using local depth hypothesis”. In: Proceedings of IEEE International Consumer Electronics (ICCE), pp. 311–312 (2012)
13.
Zurück zum Zitat Zhang, L., Vazquez, C., Knorr, S.: 3D-TV content creation: automatic 2D-to-3D video conversion. IEEE Trans. Broadcast. 99, 1–12 (2011) Zhang, L., Vazquez, C., Knorr, S.: 3D-TV content creation: automatic 2D-to-3D video conversion. IEEE Trans. Broadcast. 99, 1–12 (2011)
14.
Zurück zum Zitat Kim, D., Min, D., Sohn, K.: A stereoscopic video generation method using stereoscopic display characterization and motion analysis. IEEE Trans. Broadcast. 54, 188–197 (2008)CrossRef Kim, D., Min, D., Sohn, K.: A stereoscopic video generation method using stereoscopic display characterization and motion analysis. IEEE Trans. Broadcast. 54, 188–197 (2008)CrossRef
15.
Zurück zum Zitat Pourazad, M.T., Nasiopoulos, P., Ward, R.K.: An H. 264-based scheme for 2D to 3D video conversion. IEEE Trans. Consum. Electron. 55, 742–748 (2008)CrossRef Pourazad, M.T., Nasiopoulos, P., Ward, R.K.: An H. 264-based scheme for 2D to 3D video conversion. IEEE Trans. Consum. Electron. 55, 742–748 (2008)CrossRef
16.
Zurück zum Zitat Lai, Y.K., Lai, Y.F., Chen, Y.C.: “An effective hybrid depth-perception algorithm for 2D-to-3D conversion in 3D display systems”. In: Proceedings IEEE ICCE, pp. 612–613 (2012) Lai, Y.K., Lai, Y.F., Chen, Y.C.: “An effective hybrid depth-perception algorithm for 2D-to-3D conversion in 3D display systems”. In: Proceedings IEEE ICCE, pp. 612–613 (2012)
17.
Zurück zum Zitat Yu, F., Liu, J., Ren, Y., Sun, J., Gao, Y., Liu, W.: “Depth generation method for 2D to 3D conversion”. In: Proceedings 3DTV-Con (2011) Yu, F., Liu, J., Ren, Y., Sun, J., Gao, Y., Liu, W.: “Depth generation method for 2D to 3D conversion”. In: Proceedings 3DTV-Con (2011)
18.
Zurück zum Zitat Jung, Y.J., Baik, A., Kim, J., Park, D.: “A novel 2D-to-3D conversion technique based on relative height depth cue”. In: Proceedings SPIE Electronics Imaging, Stereoscopic Displays and Applications (2009) Jung, Y.J., Baik, A., Kim, J., Park, D.: “A novel 2D-to-3D conversion technique based on relative height depth cue”. In: Proceedings SPIE Electronics Imaging, Stereoscopic Displays and Applications (2009)
19.
Zurück zum Zitat Cheng, C.C., Li, C.T., Chen, L.-G.: A novel 2d-to-3d conversion system using edge information. IEEE Trans. Consum. Electron. 56(3), 1739–1745 (2010)CrossRef Cheng, C.C., Li, C.T., Chen, L.-G.: A novel 2d-to-3d conversion system using edge information. IEEE Trans. Consum. Electron. 56(3), 1739–1745 (2010)CrossRef
20.
Zurück zum Zitat Zhang, Z., Wang, Y., Jiang, T., Gao, W.: “Visual pertinent 2D-to-3D video conversion by multi-cue fusion”. In: Proceedings of IEEE International Conference on Image Processing (ICIP), pp. 909–912 (2011) Zhang, Z., Wang, Y., Jiang, T., Gao, W.: “Visual pertinent 2D-to-3D video conversion by multi-cue fusion”. In: Proceedings of IEEE International Conference on Image Processing (ICIP), pp. 909–912 (2011)
21.
Zurück zum Zitat Kraemer, P., Benois-Pineau, J.: “Camera motion detection in the rough indexing paradigm”. In: TREC Video Retrieval Evaluation Online Proceedings, TRECVID05 (2005) Kraemer, P., Benois-Pineau, J.: “Camera motion detection in the rough indexing paradigm”. In: TREC Video Retrieval Evaluation Online Proceedings, TRECVID05 (2005)
22.
Zurück zum Zitat Tao, M.W., Bai, J., Kohli, P., Paris, S.: SimpleFlow: a non-iterative, sublinear optical flow algorithm. Comput Graphics Forum 31(2), 345–353 (2012)CrossRef Tao, M.W., Bai, J., Kohli, P., Paris, S.: SimpleFlow: a non-iterative, sublinear optical flow algorithm. Comput Graphics Forum 31(2), 345–353 (2012)CrossRef
23.
Zurück zum Zitat Liu, C., Christopher, L.: “Depth map estimation from motion for 2D to 3D conversion”. In: Proceedings of IEEE International Conference on Electro/Information Technology (EIT), pp. 1–4 (2012) Liu, C., Christopher, L.: “Depth map estimation from motion for 2D to 3D conversion”. In: Proceedings of IEEE International Conference on Electro/Information Technology (EIT), pp. 1–4 (2012)
24.
Zurück zum Zitat Han, K., Hong, K.: “Geometric and texture cue based depth-map estimation for 2D to 3D image conversion”. In: Proceedings of IEEE International Conference on Consumer Electronics (ICCE), (2011) Han, K., Hong, K.: “Geometric and texture cue based depth-map estimation for 2D to 3D image conversion”. In: Proceedings of IEEE International Conference on Consumer Electronics (ICCE), (2011)
25.
Zurück zum Zitat Duda, R.O., Hart, P.E.: “Use of the hough transformation to detect lines and curves in pictures”, In: AI Center, SRI International (1971) Duda, R.O., Hart, P.E.: “Use of the hough transformation to detect lines and curves in pictures”, In: AI Center, SRI International (1971)
26.
Zurück zum Zitat Zhang, X., Yang, Y.: “Minimum spanning tree and color image segmentation”. In: Proceedings of IEEE International Conference on Networking, Sensing and Control, pp. 900–904 (2008) Zhang, X., Yang, Y.: “Minimum spanning tree and color image segmentation”. In: Proceedings of IEEE International Conference on Networking, Sensing and Control, pp. 900–904 (2008)
27.
Zurück zum Zitat Tomasi, C., Manduchi, R.: “Bilateral filtering for gray and color images”. In: Proceedings of International Conference on Computer Vision (ICCV) (1998) Tomasi, C., Manduchi, R.: “Bilateral filtering for gray and color images”. In: Proceedings of International Conference on Computer Vision (ICCV) (1998)
28.
Zurück zum Zitat Petschnigg, G., Szeliski, R., Agrawala, M., Cohen, M., Hoppe, H., Toyama, K.: “Digital photography with flash and no-flash image pairs”. In: Proceedings of ACM SIGGRAPH, pp. 664–672 (2004) Petschnigg, G., Szeliski, R., Agrawala, M., Cohen, M., Hoppe, H., Toyama, K.: “Digital photography with flash and no-flash image pairs”. In: Proceedings of ACM SIGGRAPH, pp. 664–672 (2004)
29.
Zurück zum Zitat Fehn, C.: “A 3D-TV approach using depth-image-based rendering (DIBR)”. In: Proceedings Of Visualization, Imaging, and Image Processing, pp. 482–487 (2003) Fehn, C.: “A 3D-TV approach using depth-image-based rendering (DIBR)”. In: Proceedings Of Visualization, Imaging, and Image Processing, pp. 482–487 (2003)
30.
Zurück zum Zitat Fehn, C.: “Depth-image-based rendering (DIBR), compression and transmission for a new approach on 3D-TV”. In: Proceedings SPIE 5291, Stereoscopic Displays and Virtual Reality Systems XI (2004) Fehn, C.: “Depth-image-based rendering (DIBR), compression and transmission for a new approach on 3D-TV”. In: Proceedings SPIE 5291, Stereoscopic Displays and Virtual Reality Systems XI (2004)
31.
Zurück zum Zitat Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)CrossRef Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)CrossRef
32.
Zurück zum Zitat Wang, Z., Bovik, A.C.: A universal image quality index. IEEE Signal Process. Lett. 9(3), 81–84 (2001)CrossRef Wang, Z., Bovik, A.C.: A universal image quality index. IEEE Signal Process. Lett. 9(3), 81–84 (2001)CrossRef
33.
Zurück zum Zitat Sheikh, H.R., Bovik, A.C.: Image information and visual quality. IEEE Trans. Image Process. 15(2), 430–444 (2006)CrossRef Sheikh, H.R., Bovik, A.C.: Image information and visual quality. IEEE Trans. Image Process. 15(2), 430–444 (2006)CrossRef
34.
Zurück zum Zitat Moorthy, A.K., Bovik, A.C.: A two-step framework for constructing blind image quality indices. IEEE Signal Process. Lett. 17(5), 513–516 (2010)CrossRef Moorthy, A.K., Bovik, A.C.: A two-step framework for constructing blind image quality indices. IEEE Signal Process. Lett. 17(5), 513–516 (2010)CrossRef
Metadaten
Titel
2D to 3D conversion with motion-type adaptive depth estimation
verfasst von
Cheolkon Jung
Lei Wang
Xiaohua Zhu
Licheng Jiao
Publikationsdatum
01.10.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
Multimedia Systems / Ausgabe 5/2015
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-014-0375-z

Weitere Artikel der Ausgabe 5/2015

Multimedia Systems 5/2015 Zur Ausgabe

Neuer Inhalt