Skip to main content
Erschienen in: International Journal of Computer Vision 2-3/2015

01.09.2015

A Bimodal Co-sparse Analysis Model for Image Processing

verfasst von: Martin Kiechle, Tim Habigt, Simon Hawe, Martin Kleinsteuber

Erschienen in: International Journal of Computer Vision | Ausgabe 2-3/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The success of many computer vision tasks lies in the ability to exploit the interdependency between different image modalities such as intensity and depth. Fusing corresponding information can be achieved on several levels, and one promising approach is the integration at a low level. Moreover, sparse signal models have successfully been used in many vision applications. Within this area of research, the so-called co-sparse analysis model has attracted considerably less attention than its well-known counterpart, the sparse synthesis model, although it has been proven to be very useful in various image processing applications. In this paper, we propose a bimodal co-sparse analysis model that is able to capture the interdependency of two image modalities. It is based on the assumption that a pair of analysis operators exists, so that the co-supports of the corresponding bimodal image structures have a large overlap. We propose an algorithm that is able to learn such a coupled pair of operators from registered and noise-free training data. Furthermore, we explain how this model can be applied to solve linear inverse problems in image processing and how it can be used as a prior in bimodal image registration tasks. This paper extends the work of some of the authors by two major contributions. Firstly, a modification of the learning process is proposed that a priori guarantees unit norm and zero-mean of the rows of the operator. This accounts for the intuition that local texture carries the most important information in image modalities independent of brightness and contrast. Secondly, the model is used in a novel bimodal image registration algorithm, which estimates the transformation parameters of unregistered images of different modalities.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Absil, P. A., Mahony, R., & Sepulchre, R. (2008). Optimization Algorithms on Matrix Manifolds. Princeton: Princeton University Press.CrossRefMATH Absil, P. A., Mahony, R., & Sepulchre, R. (2008). Optimization Algorithms on Matrix Manifolds. Princeton: Princeton University Press.CrossRefMATH
Zurück zum Zitat Baker, S., & Kanade, T. (2002). Limits on super-resolution and how to break them. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(9), 1167–1183.CrossRef Baker, S., & Kanade, T. (2002). Limits on super-resolution and how to break them. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(9), 1167–1183.CrossRef
Zurück zum Zitat Brown, L. G. (1992). A survey of image registration techniques. ACM Computing Surveys (CSUR), 24, 325–376.CrossRef Brown, L. G. (1992). A survey of image registration techniques. ACM Computing Surveys (CSUR), 24, 325–376.CrossRef
Zurück zum Zitat Brown, M., & Süsstrunk, S. (2011). Multi-spectral SIFT for scene category recognition. IEEE conference on computer vision and pattern recognition, pp. 177–184. Brown, M., & Süsstrunk, S. (2011). Multi-spectral SIFT for scene category recognition. IEEE conference on computer vision and pattern recognition, pp. 177–184.
Zurück zum Zitat Candès, E. J., Wakin, M. B., & Boyd, S. P. (2008). Enhancing sparsity by reweighted \(\ell _1\) minimization. Journal of Fourier Analysis and Applications, 14(5–6), 877–905.CrossRefMathSciNetMATH Candès, E. J., Wakin, M. B., & Boyd, S. P. (2008). Enhancing sparsity by reweighted \(\ell _1\) minimization. Journal of Fourier Analysis and Applications, 14(5–6), 877–905.CrossRefMathSciNetMATH
Zurück zum Zitat Chan, D., Buisman, H., Theobalt, C., Thrun, S. (2008). A noise-aware filter for real-time depth upsampling. Workshop on multi-camera and multi-modal sensor fusion algorithms and applications. Chan, D., Buisman, H., Theobalt, C., Thrun, S. (2008). A noise-aware filter for real-time depth upsampling. Workshop on multi-camera and multi-modal sensor fusion algorithms and applications.
Zurück zum Zitat Chen, Y., Ranftl, R., & Pock, T. (2014). Insights into analysis operator learning: From patch-based sparse models to higher order MRFs. IEEE Transactions on Image Processing, 23(3), 1060–1072.CrossRefMathSciNet Chen, Y., Ranftl, R., & Pock, T. (2014). Insights into analysis operator learning: From patch-based sparse models to higher order MRFs. IEEE Transactions on Image Processing, 23(3), 1060–1072.CrossRefMathSciNet
Zurück zum Zitat Cole-Rhodes, A. A., Johnson, K. L., LeMoigne, J., & Zavorin, I. (2003). Multiresolution registration of remote sensing imagery by optimization of mutual information using a stochastic gradient. IEEE Transactions on Image Processing, 12(12), 1495–1511.CrossRefMathSciNet Cole-Rhodes, A. A., Johnson, K. L., LeMoigne, J., & Zavorin, I. (2003). Multiresolution registration of remote sensing imagery by optimization of mutual information using a stochastic gradient. IEEE Transactions on Image Processing, 12(12), 1495–1511.CrossRefMathSciNet
Zurück zum Zitat Collignon, A., Maes, F., Delaere, D., Vandermeulen, D., Suetens, P., & Marchal, G. (1995). Automated multi-modality image registration based on information theory. Information Processing in Medical Imaging, 3, 263–274. Collignon, A., Maes, F., Delaere, D., Vandermeulen, D., Suetens, P., & Marchal, G. (1995). Automated multi-modality image registration based on information theory. Information Processing in Medical Imaging, 3, 263–274.
Zurück zum Zitat Dai, Y., & Yuan, Y. (2001). An efficient hybrid conjugate gradient method for unconstrained optimization. Annals of Operations Research, 103(1–4), 33–47.CrossRefMathSciNetMATH Dai, Y., & Yuan, Y. (2001). An efficient hybrid conjugate gradient method for unconstrained optimization. Annals of Operations Research, 103(1–4), 33–47.CrossRefMathSciNetMATH
Zurück zum Zitat Diebel, J., & Thrun, S. (2005). An application of Markov random fields to range sensing. NIPS, 18, 291–298. Diebel, J., & Thrun, S. (2005). An application of Markov random fields to range sensing. NIPS, 18, 291–298.
Zurück zum Zitat Elad, M., Milanfar, P., & Rubinstein, R. (2007). Analysis versus synthesis in signal priors. Inverse Problems, 23(3), 947–968.CrossRefMathSciNetMATH Elad, M., Milanfar, P., & Rubinstein, R. (2007). Analysis versus synthesis in signal priors. Inverse Problems, 23(3), 947–968.CrossRefMathSciNetMATH
Zurück zum Zitat Fan, X., Rhody, H., & Saber, E. (2010). A spatial-feature-enhanced MMI algorithm for multimodal airborne image registration. IEEE Transactions on Geoscience and Remote Sensing, 48(6), 2580–2589.CrossRef Fan, X., Rhody, H., & Saber, E. (2010). A spatial-feature-enhanced MMI algorithm for multimodal airborne image registration. IEEE Transactions on Geoscience and Remote Sensing, 48(6), 2580–2589.CrossRef
Zurück zum Zitat Freeman, W. T., Pasztor, E. C., & Carmichael, O. T. (2000). Learning Low-Level Vision. International Journal of Computer Vision, 40(1), 25–47.CrossRefMATH Freeman, W. T., Pasztor, E. C., & Carmichael, O. T. (2000). Learning Low-Level Vision. International Journal of Computer Vision, 40(1), 25–47.CrossRefMATH
Zurück zum Zitat Hawe, S., Kleinsteuber, M., & Diepold, K. (2013). Analysis operator learning and its application to image reconstruction. IEEE Transactions on Image Processing, 22(6), 2138–2150.CrossRefMathSciNet Hawe, S., Kleinsteuber, M., & Diepold, K. (2013). Analysis operator learning and its application to image reconstruction. IEEE Transactions on Image Processing, 22(6), 2138–2150.CrossRefMathSciNet
Zurück zum Zitat Hong, C., Dit-Yan, Y., & Yimin, Xiong. (2004). Super-resolution through neighbor embedding. Computer Vision and Pattern Recognition, 1, 275–282.MATH Hong, C., Dit-Yan, Y., & Yimin, Xiong. (2004). Super-resolution through neighbor embedding. Computer Vision and Pattern Recognition, 1, 275–282.MATH
Zurück zum Zitat Hyder, M., & Mahata, K. (2009). A robust algorithm for joint-sparse recovery. IEEE Signal Processing Letters, 16(12), 1091–1094.CrossRef Hyder, M., & Mahata, K. (2009). A robust algorithm for joint-sparse recovery. IEEE Signal Processing Letters, 16(12), 1091–1094.CrossRef
Zurück zum Zitat Jia, K., Wang, X., & Tang, X. (2013). Image transformation based on learning dictionaries across image spaces. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(2), 367–380.CrossRef Jia, K., Wang, X., & Tang, X. (2013). Image transformation based on learning dictionaries across image spaces. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(2), 367–380.CrossRef
Zurück zum Zitat Kiechle, M., Hawe, S., & Kleinsteuber, M. (2013). A Joint Intensity and Depth Co-Sparse Analysis Model for Depth Map Super-Resolution. Proceedings of the international conference on computer vision. Kiechle, M., Hawe, S., & Kleinsteuber, M. (2013). A Joint Intensity and Depth Co-Sparse Analysis Model for Depth Map Super-Resolution. Proceedings of the international conference on computer vision.
Zurück zum Zitat Klein, S., Staring, M., Murphy, K., Viergever, M. A., & Pluim, J. P. W. (2010). Elastix: A toolbox for intensity-based medical image registration. IEEE Transactions on Medical Imaging, 29(1), 196–205.CrossRef Klein, S., Staring, M., Murphy, K., Viergever, M. A., & Pluim, J. P. W. (2010). Elastix: A toolbox for intensity-based medical image registration. IEEE Transactions on Medical Imaging, 29(1), 196–205.CrossRef
Zurück zum Zitat Krotosky, S. J., & Trivedi, M. M. (2007). Mutual information based registration of multimodal stereo videos for person tracking. Computer Vision and Image Understanding, 106(2–3), 270–287.CrossRef Krotosky, S. J., & Trivedi, M. M. (2007). Mutual information based registration of multimodal stereo videos for person tracking. Computer Vision and Image Understanding, 106(2–3), 270–287.CrossRef
Zurück zum Zitat Li, Y., Xue, T., Sun, L., & Liu, J. (2012) Joint example-based depth map super-resolution. In IEEE international conference on multimedia and expo pp. 152–157. Li, Y., Xue, T., Sun, L., & Liu, J. (2012) Joint example-based depth map super-resolution. In IEEE international conference on multimedia and expo pp. 152–157.
Zurück zum Zitat Liu, C., Shum, H. Y., & Freeman, W. T. (2007). Face hallucination: Theory and practice. International Journal of Computer Vision, 75(1), 115–134.CrossRef Liu, C., Shum, H. Y., & Freeman, W. T. (2007). Face hallucination: Theory and practice. International Journal of Computer Vision, 75(1), 115–134.CrossRef
Zurück zum Zitat Lu, J., Min, D., Pahwa, R.S., & Do, M.N. (2011). A revisit to MRF-based depth map super-resolution and enhancement. ICASSP, pp. 985–988. Lu, J., Min, D., Pahwa, R.S., & Do, M.N. (2011). A revisit to MRF-based depth map super-resolution and enhancement. ICASSP, pp. 985–988.
Zurück zum Zitat Mairal, J., Bach, F., & Ponce, J. (2012). Task-driven dictionary learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(4), 791–804.CrossRef Mairal, J., Bach, F., & Ponce, J. (2012). Task-driven dictionary learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(4), 791–804.CrossRef
Zurück zum Zitat Mattes, D., Haynor, D. R., Vesselle, H., Lewellen, T. K., & Eubank, W. (2003). PET-CT image registration in the chest using free-form deformations. IEEE Transactions on Medical Imaging, 22(1), 120–128.CrossRef Mattes, D., Haynor, D. R., Vesselle, H., Lewellen, T. K., & Eubank, W. (2003). PET-CT image registration in the chest using free-form deformations. IEEE Transactions on Medical Imaging, 22(1), 120–128.CrossRef
Zurück zum Zitat Mishali, M., & Eldar, Y. (2008). Reduce and boost: Recovering arbitrary sets of jointly sparse vectors. IEEE Transactions on Signal Processing, 56(10), 4692–4702.CrossRefMathSciNet Mishali, M., & Eldar, Y. (2008). Reduce and boost: Recovering arbitrary sets of jointly sparse vectors. IEEE Transactions on Signal Processing, 56(10), 4692–4702.CrossRefMathSciNet
Zurück zum Zitat Nam, S., Davies, M. E., Elad, M., & Gribonval, R. (2013). The cosparse analysis model and algorithms. Applied and Computational Harmonic Analysis, 34(1), 30–56.CrossRefMathSciNetMATH Nam, S., Davies, M. E., Elad, M., & Gribonval, R. (2013). The cosparse analysis model and algorithms. Applied and Computational Harmonic Analysis, 34(1), 30–56.CrossRefMathSciNetMATH
Zurück zum Zitat Ophir, B., Elad, M., Bertin, N., & Plumbley, M.D. (2011). Sequential minimal eigenvalues: An approach to analysis dictionary learning. EUSIPCO, pp. 1465–1469. Ophir, B., Elad, M., Bertin, N., & Plumbley, M.D. (2011). Sequential minimal eigenvalues: An approach to analysis dictionary learning. EUSIPCO, pp. 1465–1469.
Zurück zum Zitat Orchard, J. (2007). Efficient least squares multimodal registration with a globally exhaustive alignment search. IEEE Transactions on Image Processing, 16(10), 2526–2534.CrossRefMathSciNet Orchard, J. (2007). Efficient least squares multimodal registration with a globally exhaustive alignment search. IEEE Transactions on Image Processing, 16(10), 2526–2534.CrossRefMathSciNet
Zurück zum Zitat Peng, Y., Ganesh, A., Wright, J., Xu, W., & Ma, Y. (2012). RASL: Robust alignment by sparse and low-rank decomposition for linearly correlated images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(11), 2233–2246.CrossRef Peng, Y., Ganesh, A., Wright, J., Xu, W., & Ma, Y. (2012). RASL: Robust alignment by sparse and low-rank decomposition for linearly correlated images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(11), 2233–2246.CrossRef
Zurück zum Zitat Pluim, J. P. W., Maintz, J. B. A., & Viergever, M. A. (2003). Mutual-information-based registration of medical images: A survey. IEEE Transactions on Medical Imaging, 22(8), 986–1004.CrossRef Pluim, J. P. W., Maintz, J. B. A., & Viergever, M. A. (2003). Mutual-information-based registration of medical images: A survey. IEEE Transactions on Medical Imaging, 22(8), 986–1004.CrossRef
Zurück zum Zitat Ravishankar, S., & Bresler, Y. (2013). Learning sparsifying transforms. IEEE Transactions on Signal Processing, 61(5), 1072–1086.CrossRefMathSciNet Ravishankar, S., & Bresler, Y. (2013). Learning sparsifying transforms. IEEE Transactions on Signal Processing, 61(5), 1072–1086.CrossRefMathSciNet
Zurück zum Zitat Rubinstein, R., Peleg, T., & Elad, M. (2013). Analysis K-SVD: A dictionary-learning algorithm for the analysis sparse model. IEEE Transactions on Signal Processing, 61(3), 661–677.CrossRefMathSciNet Rubinstein, R., Peleg, T., & Elad, M. (2013). Analysis K-SVD: A dictionary-learning algorithm for the analysis sparse model. IEEE Transactions on Signal Processing, 61(3), 661–677.CrossRefMathSciNet
Zurück zum Zitat Scharstein, D., & Szeliski, R. (2003). High-accuracy stereo depth maps using structured light. IEEE Conference on Computer Vision and Pattern Recognition, pp. 195–202. Scharstein, D., & Szeliski, R. (2003). High-accuracy stereo depth maps using structured light. IEEE Conference on Computer Vision and Pattern Recognition, pp. 195–202.
Zurück zum Zitat Studholme, C., Hill, D., & Hawkes, D. (1999). An overlap invariant entropy measure of 3D medical image alignment. Pattern Recognition, 32(1), 71–86.CrossRef Studholme, C., Hill, D., & Hawkes, D. (1999). An overlap invariant entropy measure of 3D medical image alignment. Pattern Recognition, 32(1), 71–86.CrossRef
Zurück zum Zitat Tropp, J. A., Gilbert, A. C., & Strauss, M. J. (2006). Algorithms for simultaneous sparse approximation. Part I: Greedy pursuit. Signal Processing, 86(3), 572–588.CrossRefMATH Tropp, J. A., Gilbert, A. C., & Strauss, M. J. (2006). Algorithms for simultaneous sparse approximation. Part I: Greedy pursuit. Signal Processing, 86(3), 572–588.CrossRefMATH
Zurück zum Zitat Viola, P., & Wells, W. M, I. I. I. (1997). Alignment by maximization of mutual information. International Journal of Computer Vision, 24(2), 137–154.CrossRef Viola, P., & Wells, W. M, I. I. I. (1997). Alignment by maximization of mutual information. International Journal of Computer Vision, 24(2), 137–154.CrossRef
Zurück zum Zitat Wang, S., Zhang, D., Liang, Y., & Pan, Q. (2012). Semi-coupled dictionary learning with applications to image super-resolution and photo-sketch synthesis. IEEE conference on computer vision and pattern recognition, pp. 2216–2223. Wang, S., Zhang, D., Liang, Y., & Pan, Q. (2012). Semi-coupled dictionary learning with applications to image super-resolution and photo-sketch synthesis. IEEE conference on computer vision and pattern recognition, pp. 2216–2223.
Zurück zum Zitat Yaghoobi, M., Nam, S., Gribonval, R., & Davies, M. E. (2013). Constrained overcomplete analysis operator learning for cosparse signal modelling. IEEE Transactions on Signal Processing, 61(9), 2341–2355.CrossRef Yaghoobi, M., Nam, S., Gribonval, R., & Davies, M. E. (2013). Constrained overcomplete analysis operator learning for cosparse signal modelling. IEEE Transactions on Signal Processing, 61(9), 2341–2355.CrossRef
Zurück zum Zitat Yang, J., Wright, J., Huang, T., & Ma, Y. (2010). Image super-resolution via sparse representation. IEEE Transactions on Image Processing, 19(11), 2861–2873.CrossRefMathSciNet Yang, J., Wright, J., Huang, T., & Ma, Y. (2010). Image super-resolution via sparse representation. IEEE Transactions on Image Processing, 19(11), 2861–2873.CrossRefMathSciNet
Zurück zum Zitat Yang, Q., Yang, R., Davis, J., Nistér, D. (2007). Spatial-depth super resolution for range images. IEEE conference on computer vision and pattern recognition, pp. 1–8. Yang, Q., Yang, R., Davis, J., Nistér, D. (2007). Spatial-depth super resolution for range images. IEEE conference on computer vision and pattern recognition, pp. 1–8.
Zurück zum Zitat Zeyde, R., Elad, M., & Protter, M. (2012). On single image scale-up using sparse-representations. Curves and Surfaces. Zeyde, R., Elad, M., & Protter, M. (2012). On single image scale-up using sparse-representations. Curves and Surfaces.
Zurück zum Zitat Zitová, B., & Flusser, J. (2003). Image registration methods: A survey. Image and Vision Computing, 21(11), 977–1000.CrossRef Zitová, B., & Flusser, J. (2003). Image registration methods: A survey. Image and Vision Computing, 21(11), 977–1000.CrossRef
Metadaten
Titel
A Bimodal Co-sparse Analysis Model for Image Processing
verfasst von
Martin Kiechle
Tim Habigt
Simon Hawe
Martin Kleinsteuber
Publikationsdatum
01.09.2015
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 2-3/2015
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-014-0786-5

Weitere Artikel der Ausgabe 2-3/2015

International Journal of Computer Vision 2-3/2015 Zur Ausgabe