Skip to main content

2020 | OriginalPaper | Buchkapitel

Deep Hough Transform for Semantic Line Detection

verfasst von : Qi Han, Kai Zhao, Jun Xu, Ming-Ming Cheng

Erschienen in: Computer Vision – ECCV 2020

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we put forward a simple yet effective method to detect meaningful straight lines, a.k.a. semantic lines, in given scenes. Prior methods take line detection as a special case of object detection, while neglect the inherent characteristics of lines, leading to less efficient and suboptimal results. We propose a one-shot end-to-end framework by incorporating the classical Hough transform into deeply learned representations. By parameterizing lines with slopes and biases, we perform Hough transform to translate deep representations to the parametric space and then directly detect lines in the parametric space. More concretely, we aggregate features along candidate lines on the feature map plane and then assign the aggregated features to corresponding locations in the parametric domain. Consequently, the problem of detecting semantic lines in the spatial domain is transformed to spotting individual points in the parametric domain, making the post-processing steps, i.e. non-maximal suppression, more efficient. Furthermore, our method makes it easy to extract contextual line features, that are critical to accurate line detection. Experimental results on a public dataset demonstrate the advantages of our method over state-of-the-arts. Codes are available at https://​mmcheng.​net/​dhtline/​.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Aggarwal, N., Karl, W.C.: Line detection in images through regularized hough transform. IEEE Trans. Image Process. 15(3), 582–591 (2006)CrossRef Aggarwal, N., Karl, W.C.: Line detection in images through regularized hough transform. IEEE Trans. Image Process. 15(3), 582–591 (2006)CrossRef
2.
Zurück zum Zitat Akinlar, C., Topal, C.: Edlines: a real-time line segment detector with a false detection control. Pattern Recogn. Lett. 32(13), 1633–1642 (2011)CrossRef Akinlar, C., Topal, C.: Edlines: a real-time line segment detector with a false detection control. Pattern Recogn. Lett. 32(13), 1633–1642 (2011)CrossRef
3.
Zurück zum Zitat Ballard, D.: Generating the hough transform to detect arbitary shapes. Pattern Recogn. 13(2) (1981) Ballard, D.: Generating the hough transform to detect arbitary shapes. Pattern Recogn. 13(2) (1981)
5.
Zurück zum Zitat Burns, J.B., Hanson, A.R., Riseman, E.M.: Extracting straight lines. IEEE Trans. Pattern Anal. Mach. Intell. PAMI–8(4), 425–455 (1986)CrossRef Burns, J.B., Hanson, A.R., Riseman, E.M.: Extracting straight lines. IEEE Trans. Pattern Anal. Mach. Intell. PAMI–8(4), 425–455 (1986)CrossRef
6.
Zurück zum Zitat Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. PAMI–8(6), 679–698 (1986)CrossRef Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. PAMI–8(6), 679–698 (1986)CrossRef
7.
Zurück zum Zitat Caplin, S.: Art and Design in Photoshop. Elsevier/Focal (2008) Caplin, S.: Art and Design in Photoshop. Elsevier/Focal (2008)
8.
Zurück zum Zitat Chan, T., Yip, R.K.: Line detection algorithm. In: Proceedings of 13th International Conference on Pattern Recognition, vol. 2, pp. 126–130. IEEE (1996) Chan, T., Yip, R.K.: Line detection algorithm. In: Proceedings of 13th International Conference on Pattern Recognition, vol. 2, pp. 126–130. IEEE (1996)
9.
Zurück zum Zitat Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 801–818 (2018) Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 801–818 (2018)
10.
Zurück zum Zitat Cheng, Z.Q., Li, J.X., Dai, Q., Wu, X., Hauptmann, A.G.: Learning spatial awareness to improve crowd counting. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 6152–6161 (2019) Cheng, Z.Q., Li, J.X., Dai, Q., Wu, X., Hauptmann, A.G.: Learning spatial awareness to improve crowd counting. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 6152–6161 (2019)
11.
Zurück zum Zitat Duda, R.O., Hart, P.E.: Use of the hough transformation to detect lines and curves in pictures. Technical report, Sri International Menlo Park Ca Artificial Intelligence Center (1971) Duda, R.O., Hart, P.E.: Use of the hough transformation to detect lines and curves in pictures. Technical report, Sri International Menlo Park Ca Artificial Intelligence Center (1971)
12.
Zurück zum Zitat Etemadi, A.: Robust segmentation of edge data. In: 1992 International Conference on Image Processing and its Applications, pp. 311–314. IET (1992) Etemadi, A.: Robust segmentation of edge data. In: 1992 International Conference on Image Processing and its Applications, pp. 311–314. IET (1992)
13.
Zurück zum Zitat Fan, D.P., Lin, Z., Zhang, Z., Zhu, M., Cheng, M.M.: Rethinking RGB-D salient object detection: models, datasets, and large-scale benchmarks. IEEE TNNLS (2020) Fan, D.P., Lin, Z., Zhang, Z., Zhu, M., Cheng, M.M.: Rethinking RGB-D salient object detection: models, datasets, and large-scale benchmarks. IEEE TNNLS (2020)
16.
Zurück zum Zitat Fernandes, L.A., Oliveira, M.M.: Real-time line detection through an improved hough transform voting scheme. Pattern Recogn. 41(1), 299–314 (2008)CrossRef Fernandes, L.A., Oliveira, M.M.: Real-time line detection through an improved hough transform voting scheme. Pattern Recogn. 41(1), 299–314 (2008)CrossRef
17.
Zurück zum Zitat Gao, S.H., Cheng, M.M., Zhao, K., Zhang, X.Y., Yang, M.H., Torr, P.: Res2Net: a new multi-scale backbone architecture. IEEE Trans. Pattern Anal. Mach. Intell. 1 (2020) Gao, S.H., Cheng, M.M., Zhao, K., Zhang, X.Y., Yang, M.H., Torr, P.: Res2Net: a new multi-scale backbone architecture. IEEE Trans. Pattern Anal. Mach. Intell. 1 (2020)
18.
Zurück zum Zitat Gao, S.H., Tan, Y.Q., Cheng, M.M., Lu, C., Chen, Y., Yan, S.: Highly efficient salient object detection with 100k parameters. In: European Conference on Computer Vision (ECCV) (2020) Gao, S.H., Tan, Y.Q., Cheng, M.M., Lu, C., Chen, Y., Yan, S.: Highly efficient salient object detection with 100k parameters. In: European Conference on Computer Vision (ECCV) (2020)
19.
Zurück zum Zitat Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015) Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
20.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
22.
Zurück zum Zitat Hough, P.V.: Method and means for recognizing complex patterns. US Patent 3,069,654 (1962) Hough, P.V.: Method and means for recognizing complex patterns. US Patent 3,069,654 (1962)
23.
Zurück zum Zitat Huang, Z., Wang, X., Huang, L., Huang, C., Wei, Y., Liu, W.: CCNet: criss-cross attention for semantic segmentation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 603–612 (2019) Huang, Z., Wang, X., Huang, L., Huang, C., Wei, Y., Liu, W.: CCNet: criss-cross attention for semantic segmentation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 603–612 (2019)
24.
Zurück zum Zitat Illingworth, J., Kittler, J.: The adaptive hough transform. IEEE Trans. Pattern Anal. Mach. Intell. PAMI–9(5), 690–698 (1987)CrossRef Illingworth, J., Kittler, J.: The adaptive hough transform. IEEE Trans. Pattern Anal. Mach. Intell. PAMI–9(5), 690–698 (1987)CrossRef
26.
Zurück zum Zitat Kiryati, N., Eldar, Y., Bruckstein, A.M.: A probabilistic hough transform. Pattern Recogn. 24(4), 303–316 (1991)MathSciNetCrossRef Kiryati, N., Eldar, Y., Bruckstein, A.M.: A probabilistic hough transform. Pattern Recogn. 24(4), 303–316 (1991)MathSciNetCrossRef
27.
Zurück zum Zitat Krages, B.: Photography: The Art of Composition. Simon and Schuster, New York (2012) Krages, B.: Photography: The Art of Composition. Simon and Schuster, New York (2012)
28.
Zurück zum Zitat Law, H., Deng, J.: CornerNet: detecting objects as paired keypoints. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 734–750 (2018) Law, H., Deng, J.: CornerNet: detecting objects as paired keypoints. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 734–750 (2018)
29.
Zurück zum Zitat Lee, J.T., Kim, H.U., Lee, C., Kim, C.S.: Semantic line detection and its applications. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3229–3237 (2017) Lee, J.T., Kim, H.U., Lee, C., Kim, C.S.: Semantic line detection and its applications. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3229–3237 (2017)
30.
Zurück zum Zitat Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017) Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
31.
Zurück zum Zitat Liu, L., Chen, R., Wolf, L., Cohen-Or, D.: Optimizing photo composition. Comput. Graph. Forum 29(2), 469–478 (2010)CrossRef Liu, L., Chen, R., Wolf, L., Cohen-Or, D.: Optimizing photo composition. Comput. Graph. Forum 29(2), 469–478 (2010)CrossRef
32.
Zurück zum Zitat Liu, W., Salzmann, M., Fua, P.: Context-aware crowd counting. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5099–5108 (2019) Liu, W., Salzmann, M., Fua, P.: Context-aware crowd counting. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5099–5108 (2019)
34.
Zurück zum Zitat Paszke, A., et al.: PyTorch: an imperative style, high-performance deep learning library. In: Advances in Neural Information Processing Systems, pp. 8024–8035 (2019) Paszke, A., et al.: PyTorch: an imperative style, high-performance deep learning library. In: Advances in Neural Information Processing Systems, pp. 8024–8035 (2019)
35.
Zurück zum Zitat Princen, J., Illingworth, J., Kittler, J.: A hierarchical approach to line extraction based on the hough transform. Comput. Vis. Graph. Image Process. 52(1), 57–77 (1990)CrossRef Princen, J., Illingworth, J., Kittler, J.: A hierarchical approach to line extraction based on the hough transform. Comput. Vis. Graph. Image Process. 52(1), 57–77 (1990)CrossRef
36.
Zurück zum Zitat Qi, C.R., Chen, X., Litany, O., Guibas, L.J.: ImvoteNet: boosting 3D object detection in point clouds with image votes. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4404–4413 (2020) Qi, C.R., Chen, X., Litany, O., Guibas, L.J.: ImvoteNet: boosting 3D object detection in point clouds with image votes. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4404–4413 (2020)
37.
Zurück zum Zitat Qi, C.R., Litany, O., He, K., Guibas, L.J.: Deep hough voting for 3D object detection in point clouds. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 9277–9286 (2019) Qi, C.R., Litany, O., He, K., Guibas, L.J.: Deep hough voting for 3D object detection in point clouds. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 9277–9286 (2019)
38.
Zurück zum Zitat Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015) Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
39.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: ICLR (2015) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: ICLR (2015)
40.
Zurück zum Zitat Sobel, I.: An isotropic 3 \(\times \) 3 image gradient operator. Presentation at Stanford A.I. Project 1968, February 2014 Sobel, I.: An isotropic 3 \(\times \) 3 image gradient operator. Presentation at Stanford A.I. Project 1968, February 2014
41.
Zurück zum Zitat Tan, Y.Q., Gao, S., Li, X.Y., Cheng, M.M., Ren, B.: Vecroad: Point-based iterative graph exploration for road graphs extraction. In: IEEE CVPR (2020) Tan, Y.Q., Gao, S., Li, X.Y., Cheng, M.M., Ren, B.: Vecroad: Point-based iterative graph exploration for road graphs extraction. In: IEEE CVPR (2020)
42.
Zurück zum Zitat Wang, X., Girshick, R., Gupta, A., He, K.: Non-local neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7794–7803 (2018) Wang, X., Girshick, R., Gupta, A., He, K.: Non-local neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7794–7803 (2018)
43.
Zurück zum Zitat Xie, S., Tu, Z.: Holistically-nested edge detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1395–1403 (2015) Xie, S., Tu, Z.: Holistically-nested edge detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1395–1403 (2015)
44.
Zurück zum Zitat Yacoub, S.B., Jolion, J.M.: Hierarchical line extraction. IEE Proc.-Vis. Image Signal Process. 142(1), 7–14 (1995)CrossRef Yacoub, S.B., Jolion, J.M.: Hierarchical line extraction. IEE Proc.-Vis. Image Signal Process. 142(1), 7–14 (1995)CrossRef
46.
Zurück zum Zitat Zhang, Z., et al.: PPGnet: learning point-pair graph for line segment detection. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, 16–20 June 2019 (2019) Zhang, Z., et al.: PPGnet: learning point-pair graph for line segment detection. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, 16–20 June 2019 (2019)
47.
Zurück zum Zitat Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2881–2890 (2017) Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2881–2890 (2017)
Metadaten
Titel
Deep Hough Transform for Semantic Line Detection
verfasst von
Qi Han
Kai Zhao
Jun Xu
Ming-Ming Cheng
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-58545-7_15

Premium Partner