Skip to main content
Top

2016 | OriginalPaper | Chapter

Dense Segmentation-Aware Descriptors

Authors : Eduard Trulls, Iasonas Kokkinos, Alberto Sanfeliu, Francesc Moreno-Noguer

Published in: Dense Image Correspondences for Computer Vision

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Dense descriptors are becoming increasingly popular in a host of tasks, such as dense image correspondence, bag-of-words image classification, and label transfer. However, the extraction of descriptors on generic image points, rather than selecting geometric features, requires rethinking how to achieve invariance to nuisance parameters. In this work we pursue invariance to occlusions and background changes by introducing segmentation information within dense feature construction. The core idea is to use the segmentation cues to downplay the features coming from image areas that are unlikely to belong to the same region as the feature point. We show how to integrate this idea with dense SIFT, as well as with the dense scale- and rotation-invariant descriptor (SID). We thereby deliver dense descriptors that are invariant to background changes, rotation, and/or scaling. We explore the merit of our technique in conjunction with large displacement motion estimation and wide-baseline stereo, and demonstrate that exploiting segmentation information yields clear improvements.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
We were unaware of this work when first publishing [43].
 
Literature
1.
go back to reference Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15(6), 1373–1396 (2003)CrossRefMATH Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15(6), 1373–1396 (2003)CrossRefMATH
2.
go back to reference Berg, A.C., Malik, J.: Geometric blur for template matching. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, vol. 1. IEEE, New York (2001) Berg, A.C., Malik, J.: Geometric blur for template matching. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, vol. 1. IEEE, New York (2001)
3.
go back to reference Borgefors, G.: Distance transformations in digital images. Comput. Vis. Graphics Image Process. 34(3), 344–371 (1986)CrossRef Borgefors, G.: Distance transformations in digital images. Comput. Vis. Graphics Image Process. 34(3), 344–371 (1986)CrossRef
4.
go back to reference Bovik, A.C., Clark, M., Geisler, W.S.: Multichannel texture analysis using localized spatial filters. Trans. Pattern Anal. Mach. Intell. 12(1), 55–73 (1990)CrossRef Bovik, A.C., Clark, M., Geisler, W.S.: Multichannel texture analysis using localized spatial filters. Trans. Pattern Anal. Mach. Intell. 12(1), 55–73 (1990)CrossRef
6.
go back to reference Casasent, D., Psaltis, D.: Position, rotation, and scale invariant optical correlation. Appl. Opt. 15(7), 1795–1799 (1976)CrossRef Casasent, D., Psaltis, D.: Position, rotation, and scale invariant optical correlation. Appl. Opt. 15(7), 1795–1799 (1976)CrossRef
7.
go back to reference Deriche, R.: Using Canny’s criteria to derive a recursively implemented optimal edge detector. Int. J. Comput. Vis. 1(2), 167–187 (1987)CrossRef Deriche, R.: Using Canny’s criteria to derive a recursively implemented optimal edge detector. Int. J. Comput. Vis. 1(2), 167–187 (1987)CrossRef
8.
go back to reference Dice, L.R.: Measures of the amount of ecologic association between species. Ecology 26(3), 297–302 (1945)CrossRef Dice, L.R.: Measures of the amount of ecologic association between species. Ecology 26(3), 297–302 (1945)CrossRef
9.
go back to reference Dollár, P., Zitnick, C.L.: Structured forests for fast edge detection. In: Proceedings of the International Conference on Computer Vision, pp. 1841–1848. IEEE, New York (2013) Dollár, P., Zitnick, C.L.: Structured forests for fast edge detection. In: Proceedings of the International Conference on Computer Vision, pp. 1841–1848. IEEE, New York (2013)
10.
go back to reference Freeman, W.T., Adelson, E.H.: The design and use of steerable filters. Trans. Pattern Anal. Mach. Intell. 13(9), 891–906 (1991)CrossRef Freeman, W.T., Adelson, E.H.: The design and use of steerable filters. Trans. Pattern Anal. Mach. Intell. 13(9), 891–906 (1991)CrossRef
11.
go back to reference Fulkerson, B., Vedaldi, A., Soatto, S.: Localizing objects with smart dictionaries. In: European Conference on Computer Vision, pp. 179–192. Springer, New York (2008) Fulkerson, B., Vedaldi, A., Soatto, S.: Localizing objects with smart dictionaries. In: European Conference on Computer Vision, pp. 179–192. Springer, New York (2008)
12.
go back to reference Hassner, T., Mayzels, V., Zelnik-Manor, L.: On SIFTs and their scales. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 1522–1528. IEEE, New York (2012) Hassner, T., Mayzels, V., Zelnik-Manor, L.: On SIFTs and their scales. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 1522–1528. IEEE, New York (2012)
13.
go back to reference Kokkinos, I., Yuille, A.: Scale invariance without scale selection. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE, New York (2008) Kokkinos, I., Yuille, A.: Scale invariance without scale selection. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE, New York (2008)
14.
go back to reference Kokkinos, I., Bronstein, M., Yuille, A.: Dense scale invariant descriptors for images and surfaces (2012). INRIA Research Report 7914 Kokkinos, I., Bronstein, M., Yuille, A.: Dense scale invariant descriptors for images and surfaces (2012). INRIA Research Report 7914
15.
go back to reference Kokkinos, I., Bronstein, M.M., Litman, R., Bronstein, A.M.: Intrinsic shape context descriptors for deformable shapes. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 159–166. IEEE, New York (2012) Kokkinos, I., Bronstein, M.M., Litman, R., Bronstein, A.M.: Intrinsic shape context descriptors for deformable shapes. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 159–166. IEEE, New York (2012)
16.
go back to reference Kolmogorov, V.: Convergent tree-reweighted message passing for energy minimization. Trans. Pattern Anal. Mach. Intell. 28(10), 1568–1583 (2006)CrossRef Kolmogorov, V.: Convergent tree-reweighted message passing for energy minimization. Trans. Pattern Anal. Mach. Intell. 28(10), 1568–1583 (2006)CrossRef
17.
go back to reference Leordeanu, M., Sukthankar, R., Sminchisescu, C.: Efficient closed-form solution to generalized boundary detection. In: European Conference on Computer Vision, pp. 516–529. Springer, New York (2012) Leordeanu, M., Sukthankar, R., Sminchisescu, C.: Efficient closed-form solution to generalized boundary detection. In: European Conference on Computer Vision, pp. 516–529. Springer, New York (2012)
18.
go back to reference Liu, C., Yuen, J., Torralba, A.: SIFT flow: Dense correspondence across scenes and its applications. Trans. Pattern Anal. Mach. Intell. 33(5), 978–994 (2011)CrossRef Liu, C., Yuen, J., Torralba, A.: SIFT flow: Dense correspondence across scenes and its applications. Trans. Pattern Anal. Mach. Intell. 33(5), 978–994 (2011)CrossRef
19.
go back to reference Liu, K., Skibbe, H., Schmidt, T., Blein, T., Palme, K., Brox, T., Ronneberger, O.: Rotation-invariant HOG descriptors using Fourier analysis in polar and spherical coordinates. Int. J. Comput. Vis. 106(3), 342–364 (2014)MathSciNetCrossRefMATH Liu, K., Skibbe, H., Schmidt, T., Blein, T., Palme, K., Brox, T., Ronneberger, O.: Rotation-invariant HOG descriptors using Fourier analysis in polar and spherical coordinates. Int. J. Comput. Vis. 106(3), 342–364 (2014)MathSciNetCrossRefMATH
20.
go back to reference Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef
21.
go back to reference Maire, M., Arbeláez, P., Fowlkes, C., Malik, J.: Using contours to detect and localize junctions in natural images. In: Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE, New York (2008)CrossRef Maire, M., Arbeláez, P., Fowlkes, C., Malik, J.: Using contours to detect and localize junctions in natural images. In: Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE, New York (2008)CrossRef
22.
go back to reference Maire, M., Yu, S.X., Perona, P.: Object detection and segmentation from joint embedding of parts and pixels. In: Proceedings of the International Conference on Computer Vision, pp. 2142–2149. IEEE, New York (2011) Maire, M., Yu, S.X., Perona, P.: Object detection and segmentation from joint embedding of parts and pixels. In: Proceedings of the International Conference on Computer Vision, pp. 2142–2149. IEEE, New York (2011)
24.
go back to reference Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., Van Gool, L.: A comparison of affine region detectors. Int. J. Comput. Vis. 65(1–2), 43–72 (2005)CrossRef Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., Van Gool, L.: A comparison of affine region detectors. Int. J. Comput. Vis. 65(1–2), 43–72 (2005)CrossRef
25.
go back to reference Nowak, E., Jurie, F., Triggs, B.: Sampling strategies for bag-of-features image classification. In: European Conference on Computer Vision, pp. 490–503. Springer, New York (2006) Nowak, E., Jurie, F., Triggs, B.: Sampling strategies for bag-of-features image classification. In: European Conference on Computer Vision, pp. 490–503. Springer, New York (2006)
26.
go back to reference Perona, P., Malik, J.: Scale-space and edge detection using anisotropic diffusion. Trans. Pattern Anal. Mach. Intell. 12(7), 629–639 (1990)CrossRef Perona, P., Malik, J.: Scale-space and edge detection using anisotropic diffusion. Trans. Pattern Anal. Mach. Intell. 12(7), 629–639 (1990)CrossRef
27.
go back to reference Porat, M., Zeevi, Y.Y.: The generalized Gabor scheme of image representation in biological and machine vision. Trans. Pattern Anal. Mach. Intell. 10(4), 452–468 (1988)CrossRefMATH Porat, M., Zeevi, Y.Y.: The generalized Gabor scheme of image representation in biological and machine vision. Trans. Pattern Anal. Mach. Intell. 10(4), 452–468 (1988)CrossRefMATH
28.
go back to reference Ren, X., Malik, J.: Learning a classification model for segmentation. In: Proceedings of the International Conference on Computer Vision, pp. 10–17. IEEE, New York (2003) Ren, X., Malik, J.: Learning a classification model for segmentation. In: Proceedings of the International Conference on Computer Vision, pp. 10–17. IEEE, New York (2003)
29.
go back to reference Rudin, L.I., Osher, S., Fatemi, E.: Nonlinear total variation based noise removal algorithms. Phys. D: Nonlinear Phenom. 60(1), 259–268 (1992)CrossRefMathSciNetMATH Rudin, L.I., Osher, S., Fatemi, E.: Nonlinear total variation based noise removal algorithms. Phys. D: Nonlinear Phenom. 60(1), 259–268 (1992)CrossRefMathSciNetMATH
30.
go back to reference Schmid, C., Mohr, R.: Local grayvalue invariants for image retrieval. Trans. Pattern Anal. Mach. Intell. 19(5), 530–534 (1997)CrossRef Schmid, C., Mohr, R.: Local grayvalue invariants for image retrieval. Trans. Pattern Anal. Mach. Intell. 19(5), 530–534 (1997)CrossRef
31.
go back to reference Schmidt, U., Roth, S.: Learning rotation-aware features: from invariant priors to equivariant descriptors. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 2050–2057. IEEE, New York (2012) Schmidt, U., Roth, S.: Learning rotation-aware features: from invariant priors to equivariant descriptors. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 2050–2057. IEEE, New York (2012)
32.
go back to reference Shechtman, E., Irani, M.: Matching local self-similarities across images and videos. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE, New York (2007) Shechtman, E., Irani, M.: Matching local self-similarities across images and videos. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE, New York (2007)
33.
go back to reference Shi, J., Malik, J.: Normalized cuts and image segmentation. Trans. Pattern Anal. Mach. Intell. 22(8), 888–905 (2000)CrossRef Shi, J., Malik, J.: Normalized cuts and image segmentation. Trans. Pattern Anal. Mach. Intell. 22(8), 888–905 (2000)CrossRef
34.
go back to reference Simonyan, K., Vedaldi, A., Zisserman, A.: Descriptor learning using convex optimisation. In: European Conference on Computer Vision, pp. 243–256. Springer, New York (2012) Simonyan, K., Vedaldi, A., Zisserman, A.: Descriptor learning using convex optimisation. In: European Conference on Computer Vision, pp. 243–256. Springer, New York (2012)
35.
go back to reference Simonyan, K., Vedaldi, A., Zisserman, A.: Learning local feature descriptors using convex optimisation. Trans. Pattern Anal. Mach. Intell. 12, 25–70 (2014) Simonyan, K., Vedaldi, A., Zisserman, A.: Learning local feature descriptors using convex optimisation. Trans. Pattern Anal. Mach. Intell. 12, 25–70 (2014)
36.
go back to reference Stein, A., Hebert, M.: Incorporating background invariance into feature-based object recognition. In: Application of Computer Vision, vol. 1, pp. 37–44. IEEE, 7th IEEE Workshop on Applications of Computer Vision (WACV), New York (2005) Stein, A., Hebert, M.: Incorporating background invariance into feature-based object recognition. In: Application of Computer Vision, vol. 1, pp. 37–44. IEEE, 7th IEEE Workshop on Applications of Computer Vision (WACV), New York (2005)
37.
go back to reference Strecha, C., Tuytelaars, T., Van Gool, L.: Dense matching of multiple wide-baseline views. In: Proceedings of the International Conference on Computer Vision, pp. 1194–1201. IEEE, New York (2003) Strecha, C., Tuytelaars, T., Van Gool, L.: Dense matching of multiple wide-baseline views. In: Proceedings of the International Conference on Computer Vision, pp. 1194–1201. IEEE, New York (2003)
38.
go back to reference Strecha, C., von Hansen, W., Van Gool, L., Fua, P., Thoennessen, U.: On benchmarking camera calibration and multi-view stereo for high resolution imagery. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE, New York (2008) Strecha, C., von Hansen, W., Van Gool, L., Fua, P., Thoennessen, U.: On benchmarking camera calibration and multi-view stereo for high resolution imagery. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE, New York (2008)
39.
go back to reference Strecha, C., Bronstein, A.M., Bronstein, M.M., Fua, P.: LDA-hash: improved matching with smaller descriptors. Trans. Pattern Anal. Mach. Intell. 34(1), 66–78 (2012)CrossRef Strecha, C., Bronstein, A.M., Bronstein, M.M., Fua, P.: LDA-hash: improved matching with smaller descriptors. Trans. Pattern Anal. Mach. Intell. 34(1), 66–78 (2012)CrossRef
40.
go back to reference Tola, E., Lepetit, V., Fua, P.: Daisy: An efficient dense descriptor applied to wide-baseline stereo. Trans. Pattern Anal. Mach. Intell. 32(5), 815–830 (2010)CrossRef Tola, E., Lepetit, V., Fua, P.: Daisy: An efficient dense descriptor applied to wide-baseline stereo. Trans. Pattern Anal. Mach. Intell. 32(5), 815–830 (2010)CrossRef
41.
go back to reference Tomasi, C., Manduchi, R.: Bilateral filtering for gray and color images. In: Proceedings of the International Conference on Computer Vision, pp. 839–846. IEEE, New York (1998) Tomasi, C., Manduchi, R.: Bilateral filtering for gray and color images. In: Proceedings of the International Conference on Computer Vision, pp. 839–846. IEEE, New York (1998)
43.
go back to reference Trulls, E., Kokkinos, I., Sanfeliu, A., Moreno-Noguer, F.: Dense segmentation-aware descriptors. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 2890–2897. IEEE, New York (2013) Trulls, E., Kokkinos, I., Sanfeliu, A., Moreno-Noguer, F.: Dense segmentation-aware descriptors. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 2890–2897. IEEE, New York (2013)
44.
go back to reference Trulls, E., Tsogkas, S., Kokkinos, I., Sanfeliu, A., Moreno-Noguer, F.: Segmentation-aware deformable part models. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 168–175. IEEE, New York (2014) Trulls, E., Tsogkas, S., Kokkinos, I., Sanfeliu, A., Moreno-Noguer, F.: Segmentation-aware deformable part models. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 168–175. IEEE, New York (2014)
46.
go back to reference Winder, S., Hua, G., Brown, M.: Picking the best daisy. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 178–185. IEEE, New York (2009) Winder, S., Hua, G., Brown, M.: Picking the best daisy. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 178–185. IEEE, New York (2009)
47.
go back to reference Wolberg, G., Zokai, S.: Robust image registration using log-polar transform. In: International Conference on Pattern Recognition, vol. 1, pp. 493–496. IEEE, New York (2000) Wolberg, G., Zokai, S.: Robust image registration using log-polar transform. In: International Conference on Pattern Recognition, vol. 1, pp. 493–496. IEEE, New York (2000)
48.
go back to reference Yao, J., Cham, W.K.: 3D modeling and rendering from multiple wide-baseline images by match propagation. Signal Process. Image Commun. 21(6), 506–518 (2006)CrossRef Yao, J., Cham, W.K.: 3D modeling and rendering from multiple wide-baseline images by match propagation. Signal Process. Image Commun. 21(6), 506–518 (2006)CrossRef
Metadata
Title
Dense Segmentation-Aware Descriptors
Authors
Eduard Trulls
Iasonas Kokkinos
Alberto Sanfeliu
Francesc Moreno-Noguer
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-23048-1_5