Skip to main content
Top

2021 | OriginalPaper | Chapter

Scale-Aware Network with Attentional Selection for Human Pose Estimation

Authors : Tianqi Lv, Lingrui Wu, Junhua Zhou, Zhonghua Liao, Xiang Zhai

Published in: Human Centered Computing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Human pose estimation is a fundamental yet challenging task in computer vision. Human pose estimation from a single image is a challenging problem due to the limited information of 2D images and the large variations in configuration and appearance of body parts. Recent works has largely improved the result of human pose estimation because of the development of convolutional neural network. However, there still exists many difficult cases, such as occluded keypoints, complex background and scale variations of human body keypoints, which cannot be well dealt with. In this paper, we design a novel scale-aware network with attentional selection that extracts multi-scale semantic information and meaningful features. Specifically, we propose a Feature Pyramid Supervision Module (FPSM), which can improve the estimation accuracy of scale variations. Meanwhile, a Spatial and Channel Attention Module (SCAM) is designed for recalibrating the spatial and channel features. Based on the proposed algorithm, we achieve state-of-the-art result on LSP dataset and make competitive performance on MPII Human Pose dataset.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Toshev, A., Szegedy, C.: Deeppose: human pose estimation via deep neural networks (2013) Toshev, A., Szegedy, C.: Deeppose: human pose estimation via deep neural networks (2013)
2.
go back to reference Wei, S., Ramakrishna, V., Kanade, T., Sheikh, Y.: Convolutional pose machines (2016) Wei, S., Ramakrishna, V., Kanade, T., Sheikh, Y.: Convolutional pose machines (2016)
5.
go back to reference Chou, C.J., Chien, J.T., Chen, H.T.: Self adversarial training for human pose estimation (2017) Chou, C.J., Chien, J.T., Chen, H.T.: Self adversarial training for human pose estimation (2017)
6.
go back to reference Ke, L., Chang, M.C., Qi, H., Lyu, S.: Multi-scale structure-aware network for human pose estimation (2018) Ke, L., Chang, M.C., Qi, H., Lyu, S.: Multi-scale structure-aware network for human pose estimation (2018)
7.
go back to reference Yang, W., Li, S., Ouyang, W., Li, H., Wang, X.: Learning feature pyramids for human pose estimation. In: 2017 Computer Vision and Pattern Recognition (2014) Yang, W., Li, S., Ouyang, W., Li, H., Wang, X.: Learning feature pyramids for human pose estimation. In: 2017 Computer Vision and Pattern Recognition (2014)
8.
go back to reference Andriluka, M., Pishchulin, L., Gehler, P., Schiele, B.: 2d human pose estimation: new benchmark and state of the art analysis. In: Computer Vision and Pattern Recognition (2014) Andriluka, M., Pishchulin, L., Gehler, P., Schiele, B.: 2d human pose estimation: new benchmark and state of the art analysis. In: Computer Vision and Pattern Recognition (2014)
9.
go back to reference Johnson, S., Everingham, M.: Clustered pose and nonlinear appearance models for human pose estimation. In: British Machine Vision Conference (2010) Johnson, S., Everingham, M.: Clustered pose and nonlinear appearance models for human pose estimation. In: British Machine Vision Conference (2010)
10.
11.
go back to reference Chu, X., Yang, W., Ouyang, W., Ma, C., Yuille, A.L., Wang, X.: Multi-context attention for human pose estimation. In: Computer Vision and Pattern Recognition (2017) Chu, X., Yang, W., Ouyang, W., Ma, C., Yuille, A.L., Wang, X.: Multi-context attention for human pose estimation. In: Computer Vision and Pattern Recognition (2017)
12.
go back to reference Cao, Z., Simon, T., Wei, S.E., Sheikh, Y.: Realtime multi-person 2D pose estimation using part affinity fields (2016) Cao, Z., Simon, T., Wei, S.E., Sheikh, Y.: Realtime multi-person 2D pose estimation using part affinity fields (2016)
13.
go back to reference Tsung, Y.L., Dollar, P., Girshick, R., He, K., Belongie, S.: Feature pyramid networks for object detection (2016) Tsung, Y.L., Dollar, P., Girshick, R., He, K., Belongie, S.: Feature pyramid networks for object detection (2016)
14.
go back to reference Itti, L., Koch, C.: Computational modelling of visual attention. Nat. Rev. Neurosci. 2(3), 194–203 (2001) Itti, L., Koch, C.: Computational modelling of visual attention. Nat. Rev. Neurosci. 2(3), 194–203 (2001)
15.
go back to reference Zhang, H., Goodfellow, I., Metaxas, D., Odena, A.: Self-attention generative adversarial networks (2018) Zhang, H., Goodfellow, I., Metaxas, D., Odena, A.: Self-attention generative adversarial networks (2018)
16.
go back to reference Li, H., Xiong, P., An, J., Wang, L.: Pyramid attention network for semantic segmentation (2018) Li, H., Xiong, P., An, J., Wang, L.: Pyramid attention network for semantic segmentation (2018)
17.
go back to reference Fu, J., Liu, J., Tian, H., Fang, Z., Lu, H.: Dual attention network for scene segmentation (2018) Fu, J., Liu, J., Tian, H., Fang, Z., Lu, H.: Dual attention network for scene segmentation (2018)
18.
go back to reference He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
19.
go back to reference Tieleman, T., Hinton, G.: Lecture 6.5-rmsprop: divide the gradient by a running average of its recent magnitude. COURSERA: Neural Netw. Mach. Learn. 4(2), 26–31 (2012) Tieleman, T., Hinton, G.: Lecture 6.5-rmsprop: divide the gradient by a running average of its recent magnitude. COURSERA: Neural Netw. Mach. Learn. 4(2), 26–31 (2012)
20.
go back to reference Ke, S., Lan, C., Xing, J., Zeng, W., Dong, L., Wang, J.: Human pose estimation using global and local normalization. In: IEEE International Conference on Computer Vision (2017) Ke, S., Lan, C., Xing, J., Zeng, W., Dong, L., Wang, J.: Human pose estimation using global and local normalization. In: IEEE International Conference on Computer Vision (2017)
21.
go back to reference Tang, W., Yu, P., Wu, Y.: Deeply learned compositional models for human pose estimation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 190–206 (2018) Tang, W., Yu, P., Wu, Y.: Deeply learned compositional models for human pose estimation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 190–206 (2018)
Metadata
Title
Scale-Aware Network with Attentional Selection for Human Pose Estimation
Authors
Tianqi Lv
Lingrui Wu
Junhua Zhou
Zhonghua Liao
Xiang Zhai
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-70626-5_35

Premium Partner