Skip to main content

2016 | OriginalPaper | Buchkapitel

Facilitating and Exploring Planar Homogeneous Texture for Indoor Scene Understanding

verfasst von : Shahzor Ahmad, Loong-Fah Cheong

Erschienen in: Computer Vision – ECCV 2016

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Indoor scenes tend to be abundant with planar homogeneous texture, manifesting as regularly repeating scene elements along a plane. In this work, we propose to exploit such structure to facilitate high-level scene understanding. By robustly fitting a texture projection model to optimal dominant frequency estimates in image patches, we arrive at a projective-invariant method to localize such semantically meaningful regions in multi-planar scenes. The recovered projective parameters also allow an affine-ambiguous rectification in real-world images marred with outliers, room clutter, and photometric severities. Qualitative and quantitative results show our method outperforms existing representative work for both rectification and detection. We then explore the potential of homogeneous texture for two indoor scene understanding tasks. In scenes where vanishing points cannot be reliably detected, or the Manhattan assumption is not satisfied, homogeneous texture detected by the proposed approach provides alternative cues to obtain an indoor scene geometric layout. Second, low-level feature descriptors extracted upon affine rectification of detected texture are found to be not only class-discriminative but also complementary to features without rectification, improving recognition performance on the MIT Indoor67 benchmark.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
Since our detector is not “trained” to produce an exact bounding box, we somewhat differ in our definitions of these parameters from object detection [38]. Object detection methodology considers any more than one detection for a given ground truth as FPs, but all such detections are considered TPs in our scenario.
 
Literatur
1.
Zurück zum Zitat Quattoni, A., Torralba, A.: Recognizing indoor scenes. In: CVPR (2009) Quattoni, A., Torralba, A.: Recognizing indoor scenes. In: CVPR (2009)
2.
Zurück zum Zitat Picard, R.W.: A society of models for video and image libraries. IBM Syst. J. 35(3.4), 292–312 (2010)CrossRef Picard, R.W.: A society of models for video and image libraries. IBM Syst. J. 35(3.4), 292–312 (2010)CrossRef
3.
Zurück zum Zitat Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., Gool, L.V.: A comparison of affine region detectors. IJCV 65(1–2), 43–72 (2005)CrossRef Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., Gool, L.V.: A comparison of affine region detectors. IJCV 65(1–2), 43–72 (2005)CrossRef
4.
Zurück zum Zitat Coughlan, J.M., Yuille, A.L.: Manhattan world: compass direction from a single image by Bayesian inference. In: ICCV (1999) Coughlan, J.M., Yuille, A.L.: Manhattan world: compass direction from a single image by Bayesian inference. In: ICCV (1999)
5.
Zurück zum Zitat Hedau, V., Hoiem, D., Forsyth, D.: Recovering the spatial layout of cluttered rooms. In: ICCV (2009) Hedau, V., Hoiem, D., Forsyth, D.: Recovering the spatial layout of cluttered rooms. In: ICCV (2009)
6.
Zurück zum Zitat Aiger, D., Cohen-Or, D., Mitra, N.J.: Repetition maximization based texture rectification. EUROGRAPHICS 31(2pt2), 439–448 (2012) Aiger, D., Cohen-Or, D., Mitra, N.J.: Repetition maximization based texture rectification. EUROGRAPHICS 31(2pt2), 439–448 (2012)
7.
Zurück zum Zitat Chum, O., Matas, J.: Planar affine rectification from change of scale. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part IV. LNCS, vol. 6495, pp. 347–360. Springer, Heidelberg (2011)CrossRef Chum, O., Matas, J.: Planar affine rectification from change of scale. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part IV. LNCS, vol. 6495, pp. 347–360. Springer, Heidelberg (2011)CrossRef
8.
Zurück zum Zitat Leung, T., Malik, J.: Detecting, localizing and grouping repeated scene elements from an image. In: Buxton, B., Cipolla, R. (eds.) Computer Vision — ECCV 1996. LNCS, vol. 1064, pp. 546–555. Springer, Heidelberg (1996) Leung, T., Malik, J.: Detecting, localizing and grouping repeated scene elements from an image. In: Buxton, B., Cipolla, R. (eds.) Computer Vision — ECCV 1996. LNCS, vol. 1064, pp. 546–555. Springer, Heidelberg (1996)
9.
Zurück zum Zitat Pritts, J., Chum, O., Matas, J.: Detection, rectification and segmentation of copla-nar repeated patterns. In: CVPR (2014) Pritts, J., Chum, O., Matas, J.: Detection, rectification and segmentation of copla-nar repeated patterns. In: CVPR (2014)
10.
Zurück zum Zitat Schaffalitzky, F., Zisserman, A.: Geometric grouping of repeated elements withinimages. In: BMVC (1998) Schaffalitzky, F., Zisserman, A.: Geometric grouping of repeated elements withinimages. In: BMVC (1998)
11.
Zurück zum Zitat Wu, C., Frahm, J.-M., Pollefeys, M.: Detecting large repetitive structures with salient boundaries. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part II. LNCS, vol. 6312, pp. 142–155. Springer, Heidelberg (2010)CrossRef Wu, C., Frahm, J.-M., Pollefeys, M.: Detecting large repetitive structures with salient boundaries. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part II. LNCS, vol. 6312, pp. 142–155. Springer, Heidelberg (2010)CrossRef
12.
Zurück zum Zitat Wu, C., Frahm, J.M., Pollefeys, M.: Repetition-based dense single-view reconstruction. In: CVPR (2011) Wu, C., Frahm, J.M., Pollefeys, M.: Repetition-based dense single-view reconstruction. In: CVPR (2011)
13.
Zurück zum Zitat Hong, W., Yang, A.Y., Huang, K., Ma, Y.: On symmetry and multiple-view geometry: Structure, pose, and calibration from a single image. IJCV 60(3), 241–265 (2004)CrossRef Hong, W., Yang, A.Y., Huang, K., Ma, Y.: On symmetry and multiple-view geometry: Structure, pose, and calibration from a single image. IJCV 60(3), 241–265 (2004)CrossRef
14.
Zurück zum Zitat Tuytelaars, T., Turina, A., Gool, L.V.: Noncombinatorial detection of regular repetitions under perspective skew. TPAMI 25(4), 418–432 (2003)CrossRef Tuytelaars, T., Turina, A., Gool, L.V.: Noncombinatorial detection of regular repetitions under perspective skew. TPAMI 25(4), 418–432 (2003)CrossRef
15.
Zurück zum Zitat Zhang, Z., Liang, X., Ganesh, A., Ma, Y.: TILT: transform invariant low-rank textures. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 314–328. Springer, Heidelberg (2010)CrossRef Zhang, Z., Liang, X., Ganesh, A., Ma, Y.: TILT: transform invariant low-rank textures. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 314–328. Springer, Heidelberg (2010)CrossRef
16.
Zurück zum Zitat Super, B.J., Bovik, A.C.: Planar surface orientation from texture spatial frequencies. Pattern Recogn. 28(5), 729–743 (1995)CrossRef Super, B.J., Bovik, A.C.: Planar surface orientation from texture spatial frequencies. Pattern Recogn. 28(5), 729–743 (1995)CrossRef
17.
Zurück zum Zitat Rosenholtz, R., Malik, J.: Surface orientation from texture: isotropy or homogeneity (or both)? Vis. Res. 37(16), 2283–2293 (1997)CrossRef Rosenholtz, R., Malik, J.: Surface orientation from texture: isotropy or homogeneity (or both)? Vis. Res. 37(16), 2283–2293 (1997)CrossRef
18.
Zurück zum Zitat Ribeiro, E., Hancock, E.R.: Estimating the 3D orientation of texture planes using local spectral analysis. Image Vis. Comput. 18(8), 619–631 (2000)CrossRef Ribeiro, E., Hancock, E.R.: Estimating the 3D orientation of texture planes using local spectral analysis. Image Vis. Comput. 18(8), 619–631 (2000)CrossRef
19.
Zurück zum Zitat Super, B.J., Bovik, A.C.: Three-dimensional orientation from texture using gabor wavelets. In: Proceedings of the SPIE Visual Communications and Image Processing 1991: Image Processing (1991) Super, B.J., Bovik, A.C.: Three-dimensional orientation from texture using gabor wavelets. In: Proceedings of the SPIE Visual Communications and Image Processing 1991: Image Processing (1991)
20.
Zurück zum Zitat Havlicek, J.P., Bovik, A.C., Maragos, P.: Modulation models for image processing and wavelet-based image demodulation. In: Proceedings of the Asilomar Conference on Signals, Systems and Computers (1992) Havlicek, J.P., Bovik, A.C., Maragos, P.: Modulation models for image processing and wavelet-based image demodulation. In: Proceedings of the Asilomar Conference on Signals, Systems and Computers (1992)
21.
Zurück zum Zitat Collins, T., Durou, J., Gurdjos, P., Bartoli, A.: Single-view perspective shape-from-texture with focal length estimation: a piecewise affine approach. In: Proceedings of the 3D Data Processing, Visualization and Transmission (3DPVT) (2010) Collins, T., Durou, J., Gurdjos, P., Bartoli, A.: Single-view perspective shape-from-texture with focal length estimation: a piecewise affine approach. In: Proceedings of the 3D Data Processing, Visualization and Transmission (3DPVT) (2010)
22.
Zurück zum Zitat Criminsi, A., Zisserman, A.: Shape from texture: homogeneity revisited. In: BMVC (2000) Criminsi, A., Zisserman, A.: Shape from texture: homogeneity revisited. In: BMVC (2000)
23.
Zurück zum Zitat Shaw, D., Barnes, N.: Perspective rectangle detection. In: European Conference on Computer Vision Workshop on Applications of Computer Vision (2006) Shaw, D., Barnes, N.: Perspective rectangle detection. In: European Conference on Computer Vision Workshop on Applications of Computer Vision (2006)
24.
Zurück zum Zitat Kosecka, J., Zhang, W.: Extraction, matching and pose recovery based on dominant rectangular structures. In: First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis, 2003 (2003) Kosecka, J., Zhang, W.: Extraction, matching and pose recovery based on dominant rectangular structures. In: First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis, 2003 (2003)
25.
Zurück zum Zitat Stella, X.Y., Zhang, H., Malik, J.: Inferring spatial layout from a single image via depth-ordered grouping. In: CVPR Workshop (2008) Stella, X.Y., Zhang, H., Malik, J.: Inferring spatial layout from a single image via depth-ordered grouping. In: CVPR Workshop (2008)
26.
Zurück zum Zitat Pandey, M., Lazebnik, S.: Scene recognition and weakly supervised object localization with deformable part-based models. In: ICCV (2011) Pandey, M., Lazebnik, S.: Scene recognition and weakly supervised object localization with deformable part-based models. In: ICCV (2011)
27.
Zurück zum Zitat Singh, S., Gupta, A., Efros, A.A.: Unsupervised discovery of mid-level discriminative patches. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part II. LNCS, vol. 7573, pp. 73–86. Springer, Heidelberg (2012) Singh, S., Gupta, A., Efros, A.A.: Unsupervised discovery of mid-level discriminative patches. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part II. LNCS, vol. 7573, pp. 73–86. Springer, Heidelberg (2012)
28.
Zurück zum Zitat Juneja, M., Vedaldi, A., Jawahar, C.V., Zisserman, A.: Blocks that shout: distinctive parts for scene classification. In: CVPR (2013) Juneja, M., Vedaldi, A., Jawahar, C.V., Zisserman, A.: Blocks that shout: distinctive parts for scene classification. In: CVPR (2013)
29.
Zurück zum Zitat Doersch, C., Gupta, A., Efros, A.A.: Mid-level visual element discovery as discriminative mode seeking. In: Proceedings of the Neural Information Processing Systems (2013) Doersch, C., Gupta, A., Efros, A.A.: Mid-level visual element discovery as discriminative mode seeking. In: Proceedings of the Neural Information Processing Systems (2013)
30.
Zurück zum Zitat Zhang, J., Marszaek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: a comprehensive study. IJCV 73(2), 213–238 (2007)CrossRef Zhang, J., Marszaek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: a comprehensive study. IJCV 73(2), 213–238 (2007)CrossRef
31.
Zurück zum Zitat Cimpoi, M., Maji, S., Kokkinos, I., Mohamed, S., Vedaldi, A.: Describing textures in the wild. In: CVPR (2014) Cimpoi, M., Maji, S., Kokkinos, I., Mohamed, S., Vedaldi, A.: Describing textures in the wild. In: CVPR (2014)
32.
Zurück zum Zitat Patterson, G., Xu, C., Su, H., Hays, J.: The SUN attribute database: beyond categories for deeper scene understanding. IJCV 108(1), 59–81 (2014)CrossRef Patterson, G., Xu, C., Su, H., Hays, J.: The SUN attribute database: beyond categories for deeper scene understanding. IJCV 108(1), 59–81 (2014)CrossRef
33.
Zurück zum Zitat Super, B.J., Bovik, A.C.: Shape from texture using local spectral moments. TPAMI 17(4), 333–343 (1995)CrossRef Super, B.J., Bovik, A.C.: Shape from texture using local spectral moments. TPAMI 17(4), 333–343 (1995)CrossRef
34.
Zurück zum Zitat Krumm, J., Shafer, S.: Shape from periodic texture using the spectrogram. In: CVPR (1992) Krumm, J., Shafer, S.: Shape from periodic texture using the spectrogram. In: CVPR (1992)
35.
Zurück zum Zitat Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision, 2nd edn. Cambridge University Press, Cambridge (2004). ISBN 0521540518CrossRefMATH Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision, 2nd edn. Cambridge University Press, Cambridge (2004). ISBN 0521540518CrossRefMATH
36.
Zurück zum Zitat Boykov, Y., Veksler, O., Zabih, R.: Fast approximate energy minimization via graph cuts. TPAMI 23(11), 1222–1239 (2001)CrossRef Boykov, Y., Veksler, O., Zabih, R.: Fast approximate energy minimization via graph cuts. TPAMI 23(11), 1222–1239 (2001)CrossRef
37.
Zurück zum Zitat Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)MathSciNetCrossRef Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)MathSciNetCrossRef
38.
Zurück zum Zitat Everingham, M., Eslami, S.M.A., Gool, L.V., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL visual object classes challenge: a retrospective. IJCV 111(1), 98–136 (2014)CrossRef Everingham, M., Eslami, S.M.A., Gool, L.V., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL visual object classes challenge: a retrospective. IJCV 111(1), 98–136 (2014)CrossRef
39.
Zurück zum Zitat Rother, C.: A new approach for vanishing point detection in architectural environments. In: BMVC (2000) Rother, C.: A new approach for vanishing point detection in architectural environments. In: BMVC (2000)
40.
Zurück zum Zitat Hoiem, D., Efros, A.A., Hebert, M.: Recovering surface layout from an image. IJCV 75(1), 151–172 (2007)CrossRefMATH Hoiem, D., Efros, A.A., Hebert, M.: Recovering surface layout from an image. IJCV 75(1), 151–172 (2007)CrossRefMATH
41.
Zurück zum Zitat Chatfield, K., Lempitsky, V., Vedaldi, A., Zisserman, A.: The devil is in the details: an evaluation of recent feature encoding methods. In: BMVC (2011) Chatfield, K., Lempitsky, V., Vedaldi, A., Zisserman, A.: The devil is in the details: an evaluation of recent feature encoding methods. In: BMVC (2011)
44.
Zurück zum Zitat Wu, J., Rehg, J.M.: CENTRIST: a visual descriptor for scene categorization. TPAMI 33(8), 1489–1501 (2011)CrossRef Wu, J., Rehg, J.M.: CENTRIST: a visual descriptor for scene categorization. TPAMI 33(8), 1489–1501 (2011)CrossRef
45.
Zurück zum Zitat Ojala, T., Pietikinen, M., Menp, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. TPAMI 24(7), 971–987 (2002)CrossRef Ojala, T., Pietikinen, M., Menp, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. TPAMI 24(7), 971–987 (2002)CrossRef
46.
Zurück zum Zitat Lowe, D.G.: Distinctive image features from scale-invariant keypoints. IJCV 60(2), 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. IJCV 60(2), 91–110 (2004)CrossRef
47.
Zurück zum Zitat Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006) Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006)
48.
Zurück zum Zitat Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: ICCV (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: ICCV (2005)
49.
Zurück zum Zitat Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. TPAMI 32(9), 1627–1645 (2010)CrossRef Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. TPAMI 32(9), 1627–1645 (2010)CrossRef
50.
Zurück zum Zitat Xie, L., Wang, J., Guo, B., Zhang, B., Tian, Q.: Orientational pyramid matching for recognizing indoor scenes. In: CVPR (2014) Xie, L., Wang, J., Guo, B., Zhang, B., Tian, Q.: Orientational pyramid matching for recognizing indoor scenes. In: CVPR (2014)
51.
Zurück zum Zitat Zuo, Z., Wang, G., Shuai, B., Zhao, L., Yang, Q., Jiang, X.: Learning discriminative and shareable features for scene classification. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part I. LNCS, vol. 8689, pp. 552–568. Springer, Heidelberg (2014) Zuo, Z., Wang, G., Shuai, B., Zhao, L., Yang, Q., Jiang, X.: Learning discriminative and shareable features for scene classification. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part I. LNCS, vol. 8689, pp. 552–568. Springer, Heidelberg (2014)
52.
Zurück zum Zitat Gong, Y., Wang, L., Guo, R., Lazebnik, S.: Multi-scale orderless pooling of deep convolutional activation features. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part VII. LNCS, vol. 8695, pp. 392–407. Springer, Heidelberg (2014) Gong, Y., Wang, L., Guo, R., Lazebnik, S.: Multi-scale orderless pooling of deep convolutional activation features. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part VII. LNCS, vol. 8695, pp. 392–407. Springer, Heidelberg (2014)
53.
Zurück zum Zitat Lin, D., Lu, C., Liao, R., Jia, J.: Learning important spatial pooling regions for scene classification. In: CVPR (2014) Lin, D., Lu, C., Liao, R., Jia, J.: Learning important spatial pooling regions for scene classification. In: CVPR (2014)
Metadaten
Titel
Facilitating and Exploring Planar Homogeneous Texture for Indoor Scene Understanding
verfasst von
Shahzor Ahmad
Loong-Fah Cheong
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-46475-6_3