Skip to main content

2019 | OriginalPaper | Buchkapitel

Intrinsic Decomposition by Learning from Varying Lighting Conditions

verfasst von : Gregoire Nieto, Mohammad Rouhani, Philippe Robert

Erschienen in: Advances in Visual Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Intrinsic image decomposition describes an image based on its reflectance and shading components. In this paper we tackle the problem of estimating the diffuse reflectance from a sequence of images captured from a fixed viewpoint under various illuminations. To this end we propose a deep learning approach to avoid heuristics and strong assumptions on the reflectance prior. We compare two network architectures: one classic ‘U’ shaped Convolutional Neural Network (CNN) and a Recurrent Neural Network (RNN) composed of Convolutional Gated Recurrent Units (CGRU). We train our networks on a new dataset specifically designed for the task of intrinsic decomposition from sequences. We test our networks on MIT and BigTime datasets and outperform state-of-the-art algorithms both qualitatively and quantitatively.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Baslamisli, A.S., Le, H.-A., Gevers, T.: CNN based learning using reflection and retinex models for intrinsic image decomposition. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018 Baslamisli, A.S., Le, H.-A., Gevers, T.: CNN based learning using reflection and retinex models for intrinsic image decomposition. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018
2.
Zurück zum Zitat Bell, S., Bala, K., Snavely, N.: Intrinsic images in the wild. ACM Trans. Graph. (SIGGRAPH) 33(4), 159 (2014)CrossRef Bell, S., Bala, K., Snavely, N.: Intrinsic images in the wild. ACM Trans. Graph. (SIGGRAPH) 33(4), 159 (2014)CrossRef
3.
Zurück zum Zitat Butler, D., Wulff, J., Stanley, G., Black, M.: MPI-Sintel optical flow benchmark: supplemental material. Technical report, MPI-IS-TR-006, MPI for Intelligent Systems (2012) Butler, D., Wulff, J., Stanley, G., Black, M.: MPI-Sintel optical flow benchmark: supplemental material. Technical report, MPI-IS-TR-006, MPI for Intelligent Systems (2012)
4.
Zurück zum Zitat Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014) Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:​1406.​1078 (2014)
5.
Zurück zum Zitat Duchêne, S., et al.: Multi-view intrinsic images of outdoors scenes with an application to relighting. ACM Trans. Graph. 34(5) (2015) Duchêne, S., et al.: Multi-view intrinsic images of outdoors scenes with an application to relighting. ACM Trans. Graph. 34(5) (2015)
6.
Zurück zum Zitat Fan, Q., Yang, J., Hua, G., Chen, B., Wipf, D.: Revisiting deep intrinsic image decompositions. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018 Fan, Q., Yang, J., Hua, G., Chen, B., Wipf, D.: Revisiting deep intrinsic image decompositions. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018
7.
Zurück zum Zitat Goldluecke, B., Cremers, D.: An approach to vectorial total variation based on geometric measure theory. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 327–333, June 2010 Goldluecke, B., Cremers, D.: An approach to vectorial total variation based on geometric measure theory. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 327–333, June 2010
8.
Zurück zum Zitat Grosse, R., Johnson, M.K., Adelson, E.H., Freeman, W.T.: Ground truth dataset and baseline evaluations for intrinsic image algorithms. In: 2009 IEEE 12th International Conference on Computer Vision, pp. 2335–2342, September 2009 Grosse, R., Johnson, M.K., Adelson, E.H., Freeman, W.T.: Ground truth dataset and baseline evaluations for intrinsic image algorithms. In: 2009 IEEE 12th International Conference on Computer Vision, pp. 2335–2342, September 2009
9.
Zurück zum Zitat Hochreiter, S., Bengio, Y., Frasconi, P.: Gradient flow in recurrent nets: the difficulty of learning long-term dependencies. In: Kolen, J., Kremer, S. (eds.) Field Guide to Dynamical Recurrent Networks. IEEE Press, Piscataway (2001) Hochreiter, S., Bengio, Y., Frasconi, P.: Gradient flow in recurrent nets: the difficulty of learning long-term dependencies. In: Kolen, J., Kremer, S. (eds.) Field Guide to Dynamical Recurrent Networks. IEEE Press, Piscataway (2001)
10.
Zurück zum Zitat Janner, M., Wu, J., Kulkarni, T.D., Yildirim, I., Tenenbaum, J.: Self-supervised intrinsic image decomposition. In: Advances in Neural Information Processing Systems, pp. 5938–5948 (2017) Janner, M., Wu, J., Kulkarni, T.D., Yildirim, I., Tenenbaum, J.: Self-supervised intrinsic image decomposition. In: Advances in Neural Information Processing Systems, pp. 5938–5948 (2017)
12.
Zurück zum Zitat Kovacs, B., Bell, S., Snavely, N., Bala, K.: Shading annotations in the wild. In: Computer Vision and Pattern Recognition (CVPR) (2017) Kovacs, B., Bell, S., Snavely, N., Bala, K.: Shading annotations in the wild. In: Computer Vision and Pattern Recognition (CVPR) (2017)
13.
Zurück zum Zitat Laffont, P., Bazin, J.: Intrinsic decomposition of image sequences from local temporal variations. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 433–441, December 2015 Laffont, P., Bazin, J.: Intrinsic decomposition of image sequences from local temporal variations. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 433–441, December 2015
14.
Zurück zum Zitat Laffont, P.-Y.: Intrinsic image decomposition from multiple photographs. Ph.D. thesis, Inria/University of Nice Sophia-Antipolis, October 2012 Laffont, P.-Y.: Intrinsic image decomposition from multiple photographs. Ph.D. thesis, Inria/University of Nice Sophia-Antipolis, October 2012
16.
Zurück zum Zitat Lettry, L., Vanhoey, K., Gool, L.V.: DARN: a deep adversarial residual network for intrinsic image decomposition. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1359–1367 (2018) Lettry, L., Vanhoey, K., Gool, L.V.: DARN: a deep adversarial residual network for intrinsic image decomposition. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1359–1367 (2018)
17.
Zurück zum Zitat Li, Z., Snavely, N.: Learning intrinsic image decomposition from watching the world. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018 Li, Z., Snavely, N.: Learning intrinsic image decomposition from watching the world. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018
19.
Zurück zum Zitat Meka, A., Fox, G., Zollhöfer, M., Richardt, C., Theobalt, C.: Live user-guided intrinsic video for static scenes. IEEE Trans. Vis. Comput. Graph. 23(11), 2447–2454 (2017)CrossRef Meka, A., Fox, G., Zollhöfer, M., Richardt, C., Theobalt, C.: Live user-guided intrinsic video for static scenes. IEEE Trans. Vis. Comput. Graph. 23(11), 2447–2454 (2017)CrossRef
20.
Zurück zum Zitat Meka, A., Zollhöfer, M., Richardt, C., Theobalt, C.: Live intrinsic video. ACM Trans. Graph. 35(4), 109:1–109:14 (2016)CrossRef Meka, A., Zollhöfer, M., Richardt, C., Theobalt, C.: Live intrinsic video. ACM Trans. Graph. 35(4), 109:1–109:14 (2016)CrossRef
21.
Zurück zum Zitat Narihira, T., Maire, M., Yu, S.X.: Direct intrinsics: learning albedo-shading decomposition by convolutional regression. In: Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), ICCV 2015, pp. 2992–2992. IEEE Computer Society, Washington, DC (2015) Narihira, T., Maire, M., Yu, S.X.: Direct intrinsics: learning albedo-shading decomposition by convolutional regression. In: Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), ICCV 2015, pp. 2992–2992. IEEE Computer Society, Washington, DC (2015)
22.
Zurück zum Zitat Narihira, T., Maire, M., Yu, S.X.: Learning lightness from human judgement on relative reflectance. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2965–2973 (2015) Narihira, T., Maire, M., Yu, S.X.: Learning lightness from human judgement on relative reflectance. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2965–2973 (2015)
24.
Zurück zum Zitat Shi, J., Dong, Y., Su, H., Stella, X.Y.: Learning non-lambertian object intrinsics across shapenet categories. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5844–5853. IEEE (2017) Shi, J., Dong, Y., Su, H., Stella, X.Y.: Learning non-lambertian object intrinsics across shapenet categories. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5844–5853. IEEE (2017)
25.
Zurück zum Zitat Weiss, Y.: Deriving intrinsic images from image sequences. In: Eighth IEEE International Conference on Computer Vision, ICCV 2001, Proceedings, vol. 2, pp. 68–75. IEEE (2001) Weiss, Y.: Deriving intrinsic images from image sequences. In: Eighth IEEE International Conference on Computer Vision, ICCV 2001, Proceedings, vol. 2, pp. 68–75. IEEE (2001)
26.
Zurück zum Zitat Xingjian, S., Chen, Z., Wang, H., Yeung, D.-Y., Wong, W.-K., Woo, W.-C.: Convolutional LSTM network: a machine learning approach for precipitation nowcasting. In: Advances in Neural Information Processing Systems, pp. 802–810 (2015) Xingjian, S., Chen, Z., Wang, H., Yeung, D.-Y., Wong, W.-K., Woo, W.-C.: Convolutional LSTM network: a machine learning approach for precipitation nowcasting. In: Advances in Neural Information Processing Systems, pp. 802–810 (2015)
27.
Zurück zum Zitat Ye, G., Garces, E., Liu, Y., Dai, Q., Gutierrez, D.: Intrinsic video and applications. ACM Trans. Graph. 33(4), 80:1–80:11 (2014)CrossRef Ye, G., Garces, E., Liu, Y., Dai, Q., Gutierrez, D.: Intrinsic video and applications. ACM Trans. Graph. 33(4), 80:1–80:11 (2014)CrossRef
28.
Zurück zum Zitat Zhao, Q., Tan, P., Dai, Q., Shen, L., Wu, E., Lin, S.: A closed-form solution to retinex with nonlocal texture constraints. IEEE Trans. Pattern Anal. Mach. Intell. 34(7), 1437–1444 (2012)CrossRef Zhao, Q., Tan, P., Dai, Q., Shen, L., Wu, E., Lin, S.: A closed-form solution to retinex with nonlocal texture constraints. IEEE Trans. Pattern Anal. Mach. Intell. 34(7), 1437–1444 (2012)CrossRef
29.
Zurück zum Zitat Zhou, T., Krähenbühl, P., Efros, A.A.: Learning data-driven reflectance priors for intrinsic image decomposition. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 3469–3477, December 2015 Zhou, T., Krähenbühl, P., Efros, A.A.: Learning data-driven reflectance priors for intrinsic image decomposition. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 3469–3477, December 2015
Metadaten
Titel
Intrinsic Decomposition by Learning from Varying Lighting Conditions
verfasst von
Gregoire Nieto
Mohammad Rouhani
Philippe Robert
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-33720-9_50

Premium Partner