Skip to main content
Top

2018 | OriginalPaper | Chapter

A Fine-Grained Filtered Viewpoint Informed Keypoint Prediction from 2D Images

Authors : Qingnan Li, Ruimin Hu, Yixin Chen, Jingwen Yan, Jing Xiao

Published in: Advances in Multimedia Information Processing – PCM 2017

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Viewpoint informed keypoint prediction from 2D images is an essential task in computer vision, which captures the fine details of rigid objects, however, the cases of ambiguous viewpoint predicted by the convolutional neural network, especially for two peaks of high confidence viewpoint proposals, may specify a set of erroneous keypoints. To address the above issue, we present multiscale convolutional neural networks and propose a filter to ensure high confidence viewpoint informed, which provides a global perspective for keypoint prediction. Leveraging the global precedence, we combine multiscale local appearance based keypoint likelihood with filtered viewpoint conditioned likelihood to induce a considerable performance gain. Experimentally, we show that our framework outperforms state-of-the-art methods on PASCAL 3D benchmark.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Long, J., Zhang, N., Darrell, T.: Do convnets learn correspondence? In: Advances in Neural Information Processing Systems, vol. 2, pp. 1601–1609 (2014) Long, J., Zhang, N., Darrell, T.: Do convnets learn correspondence? In: Advances in Neural Information Processing Systems, vol. 2, pp. 1601–1609 (2014)
2.
go back to reference Su, H., Qi, C.R., Li, Y., Guibas, L.J.: Render for CNN: viewpoint estimation in images using CNNs trained with rendered 3D model views. In: IEEE International Conference on Computer Vision, pp. 2686–2694 (2014) Su, H., Qi, C.R., Li, Y., Guibas, L.J.: Render for CNN: viewpoint estimation in images using CNNs trained with rendered 3D model views. In: IEEE International Conference on Computer Vision, pp. 2686–2694 (2014)
3.
go back to reference Yang, Y., Ramanan, D.: Articulated pose estimation using flexible mixtures of parts. In: Computer Vision & Pattern Recognition, vol. 32, no. 14, pp. 1385–1392 (2011) Yang, Y., Ramanan, D.: Articulated pose estimation using flexible mixtures of parts. In: Computer Vision & Pattern Recognition, vol. 32, no. 14, pp. 1385–1392 (2011)
4.
go back to reference Gkioxari, G., Hariharan, B., Girshick, R., Malik, J.: Using k-poselets for detecting people and localizing their keypoints. In: Computer Vision & Pattern Recognition, pp. 3582–3589 (2014) Gkioxari, G., Hariharan, B., Girshick, R., Malik, J.: Using k-poselets for detecting people and localizing their keypoints. In: Computer Vision & Pattern Recognition, pp. 3582–3589 (2014)
5.
go back to reference Tulsiani, S., Malik, J.: Viewpoints and keypoints. In: Computer Vision & Pattern Recognition, pp. 1510–1519 (2015) Tulsiani, S., Malik, J.: Viewpoints and keypoints. In: Computer Vision & Pattern Recognition, pp. 1510–1519 (2015)
6.
go back to reference Zhang, N., Shelhamer, E., Gao, Y., Darrell, T.: Fine-grained pose prediction, normalization, and recognition. Comput. Sci. 69(2), 207–221 (2016) Zhang, N., Shelhamer, E., Gao, Y., Darrell, T.: Fine-grained pose prediction, normalization, and recognition. Comput. Sci. 69(2), 207–221 (2016)
7.
go back to reference Gkioxari, G., Arbelaez, P., Bourdev, L., Malik, J.: Articulated pose estimation using discriminative armlet classifiers. In: IEEE International Conference on Computer Vision, vol. 9, no. 4, pp. 3342–3349 (2013) Gkioxari, G., Arbelaez, P., Bourdev, L., Malik, J.: Articulated pose estimation using discriminative armlet classifiers. In: IEEE International Conference on Computer Vision, vol. 9, no. 4, pp. 3342–3349 (2013)
8.
go back to reference Bourdev, L., Maji, S., Brox, T., Malik, J.: Detecting people using mutually consistent poselet activations. In: European Conference on Computer Vision, pp. 168–181 (2010)CrossRef Bourdev, L., Maji, S., Brox, T., Malik, J.: Detecting people using mutually consistent poselet activations. In: European Conference on Computer Vision, pp. 168–181 (2010)CrossRef
9.
go back to reference Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results (2012) Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results (2012)
10.
go back to reference Felzenszwalb, P.F., Girshick, R.B., Mcallester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)CrossRef Felzenszwalb, P.F., Girshick, R.B., Mcallester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)CrossRef
11.
go back to reference Chabot, F., Chaouch, M., Rabarisoa, J., Teulire, C., Chateau, T.: Deep MANTA: a coarse-to-fine many-task network for joint 2D and 3D vehicle analysis from monocular image. In: Computer Vision & Pattern Recognition (2017) Chabot, F., Chaouch, M., Rabarisoa, J., Teulire, C., Chateau, T.: Deep MANTA: a coarse-to-fine many-task network for joint 2D and 3D vehicle analysis from monocular image. In: Computer Vision & Pattern Recognition (2017)
12.
go back to reference Mousavian, A., Anguelov, D., Flynn, J., Kosecka, J.: 3D bounding box estimation using deep learning and geometry. In: Computer Vision & Pattern Recognition (2017) Mousavian, A., Anguelov, D., Flynn, J., Kosecka, J.: 3D bounding box estimation using deep learning and geometry. In: Computer Vision & Pattern Recognition (2017)
13.
go back to reference Xiang, Y., Choi, W., Lin, Y., Savarese, S.: Subcategory-aware convolutional neural networks for object proposals and detection. In: IEEE Winter Conference on Applications of Computer Vision (2017) Xiang, Y., Choi, W., Lin, Y., Savarese, S.: Subcategory-aware convolutional neural networks for object proposals and detection. In: IEEE Winter Conference on Applications of Computer Vision (2017)
14.
go back to reference Toshev, A., Szegedy, C.: DeepPose: human pose estimation via deep neural networks. In: Computer Vision & Pattern Recognition, pp. 1653–1660 (2014) Toshev, A., Szegedy, C.: DeepPose: human pose estimation via deep neural networks. In: Computer Vision & Pattern Recognition, pp. 1653–1660 (2014)
15.
go back to reference Tompson, J., Jain, A., Lecun, Y., Bregler, C.: Joint training of a convolutional network and a graphical model for human pose estimation. In: Eprint Arxiv, pp. 1799–1807 (2014) Tompson, J., Jain, A., Lecun, Y., Bregler, C.: Joint training of a convolutional network and a graphical model for human pose estimation. In: Eprint Arxiv, pp. 1799–1807 (2014)
Metadata
Title
A Fine-Grained Filtered Viewpoint Informed Keypoint Prediction from 2D Images
Authors
Qingnan Li
Ruimin Hu
Yixin Chen
Jingwen Yan
Jing Xiao
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-77383-4_17