Skip to main content

2015 | OriginalPaper | Buchkapitel

Fixed Point Learning Based 3D Conversion of 2D Videos

verfasst von : Nidhi Chahal, Santanu Chaudhury

Erschienen in: Pattern Recognition and Machine Intelligence

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The depth cues which are also called monocular cues from single still image are more versatile while depth cues of multiple images gives more accurate depth extraction. Machine learning is a promising and new research direction for this type of conversion in today scenario. In our paper, a fast automatic 2D to 3D conversion technique is proposed which utilizes a fixed point learning framework for the accurate estimation of depth maps of query images using model trained from a training database of 2D color and depth images. The depth maps obtained from monocular and motion depth cues of input images/video and ground truth depths are used in training database for the fixed point iteration. The results produces with fixed point model are more accurate and reliable than MRF fusion of both types of depth cues. The stereo pairs are generated then using input video frames and their corresponding depth maps obtained from fixed point learning framework. These stereo pairs are put together to get the final 3D video which can be displayed on any 3DTV and seen using 3D glasses.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Saxena, A., Sun, M., Ng, A.Y.: Make3d: learning 3d scene structure from a single still image. Trans. Pattern Anal. Mach. Intell. 31, 824–840 (2009). IEEECrossRef Saxena, A., Sun, M., Ng, A.Y.: Make3d: learning 3d scene structure from a single still image. Trans. Pattern Anal. Mach. Intell. 31, 824–840 (2009). IEEECrossRef
2.
Zurück zum Zitat Konrad, J., Wang, M., Ishwar, P.: 2d-to-3d image conversion by learning depth from examples. In: Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 16–22. IEEE (2012) Konrad, J., Wang, M., Ishwar, P.: 2d-to-3d image conversion by learning depth from examples. In: Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 16–22. IEEE (2012)
3.
Zurück zum Zitat Liu, C., Yuen, J., Torralba, A.: Nonparametric scene parsing: label transfer via dense scene alignment. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1972–1979. IEEE (2009) Liu, C., Yuen, J., Torralba, A.: Nonparametric scene parsing: label transfer via dense scene alignment. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1972–1979. IEEE (2009)
4.
Zurück zum Zitat Konrad, J., Wang, M., Ishwar, P., Wu, C., Mukherjee, D.: Learning-based, automatic 2d-to-3d image and video conversion. Trans. Image Process. 22(9), 3485–3496 (2013). IEEECrossRef Konrad, J., Wang, M., Ishwar, P., Wu, C., Mukherjee, D.: Learning-based, automatic 2d-to-3d image and video conversion. Trans. Image Process. 22(9), 3485–3496 (2013). IEEECrossRef
5.
Zurück zum Zitat Zhuo, S., Sim, T.: Defocus map estimation from a single defocused image. Pattern Recogn. 44(9), 1852–1858 (2011)CrossRef Zhuo, S., Sim, T.: Defocus map estimation from a single defocused image. Pattern Recogn. 44(9), 1852–1858 (2011)CrossRef
6.
Zurück zum Zitat Sun, D., Roth, S., Black, M.J.: Secrets of optical flow estimation and their principles. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2432–2439 (2010) Sun, D., Roth, S., Black, M.J.: Secrets of optical flow estimation and their principles. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2432–2439 (2010)
7.
Zurück zum Zitat Xu, M., Chen, H., Varshney, P.K.: An image fusion approach based on markov random fields. Trans. Geosci. Remote Sens. 49(12), 5116–5127 (2011)CrossRef Xu, M., Chen, H., Varshney, P.K.: An image fusion approach based on markov random fields. Trans. Geosci. Remote Sens. 49(12), 5116–5127 (2011)CrossRef
8.
Zurück zum Zitat Li, Q., Wang, J., Tu, Z.: Fixed-point model for structured labeling. In: Proceedings of the 30th International Conference on Machine Learning, Atlanta, Georgia, USA, vol. 28 (2013) Li, Q., Wang, J., Tu, Z.: Fixed-point model for structured labeling. In: Proceedings of the 30th International Conference on Machine Learning, Atlanta, Georgia, USA, vol. 28 (2013)
9.
Zurück zum Zitat Kang, Y., Lai, Y., Chen, Y.: An effective hybrid depth-generation algorithm for 2D-to-3D conversion in 3D displays. J. Disp. Technol. 9(3), 154–161 (2013)CrossRef Kang, Y., Lai, Y., Chen, Y.: An effective hybrid depth-generation algorithm for 2D-to-3D conversion in 3D displays. J. Disp. Technol. 9(3), 154–161 (2013)CrossRef
Metadaten
Titel
Fixed Point Learning Based 3D Conversion of 2D Videos
verfasst von
Nidhi Chahal
Santanu Chaudhury
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-19941-2_9

Premium Partner