nach oben

Multimedia Systems

Erschienen in:

01.10.2015 | Regular Paper

2D to 3D conversion with motion-type adaptive depth estimation

verfasst von: Cheolkon Jung, Lei Wang, Xiaohua Zhu, Licheng Jiao

Erschienen in: Multimedia Systems | Ausgabe 5/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

2D to 3D conversion is an important task for 3DTV broadcasting services due to the lack of stereoscopic 3D contents. In this paper, we propose 2D to 3D conversion with motion-type adaptive depth estimation. Because the most important depth cue is motion parallax in our method, we first perform motion estimation between sequential video frames. Then, we adopt a motion-type adaptive approach to depth map estimation because videos have different depth structures according to the type of motion. To be specific, depth from motion is exploited to estimate depth maps in the case of global motion while the depth maps are generated based on the depth from template with the local motion-guided refinement in the case of local motion. Finally, we employ depth image-based rendering (DIBR) to generate stereoscopic virtual views from the depth maps. Experimental results demonstrate that the proposed 2D to 3D conversion is very effective in generating accurate depth maps and providing realistic 3D effects.

Vorheriger Artikel Modeling performing arts metadata and relationships in content service for institutions

Nächster Artikel ALD: adaptive layer distribution for scalable video

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Tam, W.J., Zhang, L.: “3D-TV content generation: 2d-to-3d conversion”. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), pp. 1869–1872 (2006)

Wiegand, T., Sullivan, G.J., Bjontegaard, G., Luthra, A.: Overview of the H. 264/AVC video coding standard. IEEE Trans. Circuits Syst. Video Technol. 13(7), 560–576 (2003)CrossRef

Varodayan, D., Chen, D., Flierl, M., Girod, B.: “Wyner–Ziv coding of video with unsupervised motion vector learning,” Signal Processing: Image Communication, pp. 369–378 (2008)

Chen, D., Tsai, S., Chandrasekhar, V., Takacs, G., Vedantham, R., Grzeszczuk, R., Girod, B.: Residual enhanced visual vector as a compact signature for mobile visual search. Sig. Process. 93(8), 2316–2327 (2013)CrossRef

Wang, H., Schuster, G.M., Katsaggelos, A.K.: Rate-distortion optimal bit allocation for object-based video coding. IEEE Trans. Circuits Syst. Video Technol. 15(9), 1113–1123 (2005)CrossRef

Zhu, L., Fan, Z., Aggelos, K.K.: “Joint video summarization and transmission adaptation for energy-efficient wireless video streaming,” EURASIP J Adv Signal Process, vol. 2008, Article ID 657032 (2008)

Smolic, A., Kauff, P., Knorr, S., Hornung, A., Kunter, M., Muller, M., Lang, M.: Three-dimensional video postproduction and processing. Proc. IEEE 99(4), 607–625 (2011)CrossRef

Pitas, I., Nikolaidis, N.:“Anthropocentric video analysis for film and games postproduction”. In: Proceedings of the 11th International Conference on Computer Systems and Technologies and Workshop for PhD Students in Computing on International Conference on Computer Systems and Technologies pp. 11–18 (2010)

Daribo, I., Saito, H.: A novel inpainting-based layered depth video for 3DTV. IEEE Trans. Broadcast. 57(2), 533–541 (2011)CrossRef

10.

Holte, M.B., Moeslund, T.B., Nikolaidis, N., Pitas, I.: “3D human action recognition for multi-view camera systems”. In: Proceedings of International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission (3DIMPVT), pp. 342–349 (2011)

11.

Liao, M., Gao, J., Yang, R., Gong, M.: Video stereolization: combining motion analysis with user interaction. IEEE Trans. Visual Comput. Graphics 18(7), 1079–1088 (2012)CrossRef

12.

Yang, N.E., Lee, J.W., Park, R-H.: “Depth map generation from a single image using local depth hypothesis”. In: Proceedings of IEEE International Consumer Electronics (ICCE), pp. 311–312 (2012)

13.

Zhang, L., Vazquez, C., Knorr, S.: 3D-TV content creation: automatic 2D-to-3D video conversion. IEEE Trans. Broadcast. 99, 1–12 (2011)

14.

Kim, D., Min, D., Sohn, K.: A stereoscopic video generation method using stereoscopic display characterization and motion analysis. IEEE Trans. Broadcast. 54, 188–197 (2008)CrossRef

15.

Pourazad, M.T., Nasiopoulos, P., Ward, R.K.: An H. 264-based scheme for 2D to 3D video conversion. IEEE Trans. Consum. Electron. 55, 742–748 (2008)CrossRef

16.

Lai, Y.K., Lai, Y.F., Chen, Y.C.: “An effective hybrid depth-perception algorithm for 2D-to-3D conversion in 3D display systems”. In: Proceedings IEEE ICCE, pp. 612–613 (2012)

17.

Yu, F., Liu, J., Ren, Y., Sun, J., Gao, Y., Liu, W.: “Depth generation method for 2D to 3D conversion”. In: Proceedings 3DTV-Con (2011)

18.

Jung, Y.J., Baik, A., Kim, J., Park, D.: “A novel 2D-to-3D conversion technique based on relative height depth cue”. In: Proceedings SPIE Electronics Imaging, Stereoscopic Displays and Applications (2009)

19.

Cheng, C.C., Li, C.T., Chen, L.-G.: A novel 2d-to-3d conversion system using edge information. IEEE Trans. Consum. Electron. 56(3), 1739–1745 (2010)CrossRef

20.

Zhang, Z., Wang, Y., Jiang, T., Gao, W.: “Visual pertinent 2D-to-3D video conversion by multi-cue fusion”. In: Proceedings of IEEE International Conference on Image Processing (ICIP), pp. 909–912 (2011)

21.

Kraemer, P., Benois-Pineau, J.: “Camera motion detection in the rough indexing paradigm”. In: TREC Video Retrieval Evaluation Online Proceedings, TRECVID05 (2005)

22.

Tao, M.W., Bai, J., Kohli, P., Paris, S.: SimpleFlow: a non-iterative, sublinear optical flow algorithm. Comput Graphics Forum 31(2), 345–353 (2012)CrossRef

23.

Liu, C., Christopher, L.: “Depth map estimation from motion for 2D to 3D conversion”. In: Proceedings of IEEE International Conference on Electro/Information Technology (EIT), pp. 1–4 (2012)

24.

Han, K., Hong, K.: “Geometric and texture cue based depth-map estimation for 2D to 3D image conversion”. In: Proceedings of IEEE International Conference on Consumer Electronics (ICCE), (2011)

25.

Duda, R.O., Hart, P.E.: “Use of the hough transformation to detect lines and curves in pictures”, In: AI Center, SRI International (1971)

26.

Zhang, X., Yang, Y.: “Minimum spanning tree and color image segmentation”. In: Proceedings of IEEE International Conference on Networking, Sensing and Control, pp. 900–904 (2008)

27.

Tomasi, C., Manduchi, R.: “Bilateral filtering for gray and color images”. In: Proceedings of International Conference on Computer Vision (ICCV) (1998)

28.

Petschnigg, G., Szeliski, R., Agrawala, M., Cohen, M., Hoppe, H., Toyama, K.: “Digital photography with flash and no-flash image pairs”. In: Proceedings of ACM SIGGRAPH, pp. 664–672 (2004)

29.

Fehn, C.: “A 3D-TV approach using depth-image-based rendering (DIBR)”. In: Proceedings Of Visualization, Imaging, and Image Processing, pp. 482–487 (2003)

30.

Fehn, C.: “Depth-image-based rendering (DIBR), compression and transmission for a new approach on 3D-TV”. In: Proceedings SPIE 5291, Stereoscopic Displays and Virtual Reality Systems XI (2004)

31.

Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)CrossRef

32.

Wang, Z., Bovik, A.C.: A universal image quality index. IEEE Signal Process. Lett. 9(3), 81–84 (2001)CrossRef

33.

Sheikh, H.R., Bovik, A.C.: Image information and visual quality. IEEE Trans. Image Process. 15(2), 430–444 (2006)CrossRef

34.

Moorthy, A.K., Bovik, A.C.: A two-step framework for constructing blind image quality indices. IEEE Signal Process. Lett. 17(5), 513–516 (2010)CrossRef

Titel: 2D to 3D conversion with motion-type adaptive depth estimation
verfasst von: Cheolkon Jung
Lei Wang
Xiaohua Zhu
Licheng Jiao
Publikationsdatum: 01.10.2015
Verlag: Springer Berlin Heidelberg
Erschienen in: Multimedia Systems / Ausgabe 5/2015
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI: https://doi.org/10.1007/s00530-014-0375-z

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Kryptowährungen/© gopixa / Getty Images / iStock, MG4 aus China auf dem Prüfstand im ADAC-Technik-Zentrum in Landsberg am Lech/© ADAC e.V., Chassis eines Elektrofahrzeugs/© chesky / stock.adobe.com, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Sustainibility Finance/© Robert Kneschke / stock.adobe.com / Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 5/2015

Context-based environmental audio event recognition for scene understanding

ALD: adaptive layer distribution for scalable video

Behaviour recognition using multivariate m-mediod based modelling of motion trajectories

Modeling performing arts metadata and relationships in content service for institutions

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.