Skip to main content
Top

2019 | OriginalPaper | Chapter

Stacked Mixed-Scale Networks for Human Pose Estimation

Authors : Xuan Wang, Zhi Li, Yanan Chen, Peilin Jiang, Fei Wang

Published in: PRICAI 2019: Trends in Artificial Intelligence

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Human pose estimation is an important problem in computer vision, which has been dominated by deep learning techniques in recent years. In this paper, we propose a novel model, named Mixed-Scale Dense Block, that exploits dilation convolution layers and dense concatenation connections to maximise the information flow through the block. Consequently, it captures the feature representation in different scales more effectively and efficiently. Comparing with the baseline method, Hourglass models, our model employs fewer learning parameters. Nevertheless, experiments demonstrate that the proposed model produces more accurate predictions. Meanwhile, our method achieves the comparable accuracy to state-of-the-art techniques. Especially in some indicators, our approach has better performance. In addition, this model is easy to implement and could be improved by most existing techniques that are adopted to promote the hourglass models.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Andriluka, M., Roth, S., Schiele, B.: Pictorial structures revisited: people detection and articulated pose estimation. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 1014–1021. IEEE, Miami (2009) Andriluka, M., Roth, S., Schiele, B.: Pictorial structures revisited: people detection and articulated pose estimation. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 1014–1021. IEEE, Miami (2009)
2.
go back to reference Andriluka, M., Pishchulin, L., Gehler, P., Schiele, B.: 2D human pose estimation: new benchmark and state of the art analysis. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 3686–3693. IEEE, Columbus (2014) Andriluka, M., Pishchulin, L., Gehler, P., Schiele, B.: 2D human pose estimation: new benchmark and state of the art analysis. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 3686–3693. IEEE, Columbus (2014)
4.
go back to reference Chen, Y., Shen, C., Wei, X.S., Liu, L., Yang, J.: Adversarial PoseNet: a structure-aware convolutional network for human pose estimation. In: Proceedings of International Conference on Computer Vision, pp. 1221–1230. IEEE, Venice (2017) Chen, Y., Shen, C., Wei, X.S., Liu, L., Yang, J.: Adversarial PoseNet: a structure-aware convolutional network for human pose estimation. In: Proceedings of International Conference on Computer Vision, pp. 1221–1230. IEEE, Venice (2017)
6.
go back to reference Chu, X., Yang, W., Ouyang, W., Ma, C., Yuille, A.L., Wang X.: Multi-context attention for human pose estimation. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 5669–5678. IEEE, Honolulu (2017) Chu, X., Yang, W., Ouyang, W., Ma, C., Yuille, A.L., Wang X.: Multi-context attention for human pose estimation. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 5669–5678. IEEE, Honolulu (2017)
7.
go back to reference Dantone, M., Gall, J., Leistner, C., van Gool, L.: Human pose estimation using body parts dependent joint regressors. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 3041–3048. IEEE, Portland (2013) Dantone, M., Gall, J., Leistner, C., van Gool, L.: Human pose estimation using body parts dependent joint regressors. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 3041–3048. IEEE, Portland (2013)
8.
go back to reference He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 770–778. IEEE, Las Vegas (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 770–778. IEEE, Las Vegas (2016)
11.
go back to reference Huang, G., Liu, Z., Maaten, L.V.D., Weinberger, K.Q.: Densely connected convolutional networks. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 2261–2269. IEEE, Honolulu (2017) Huang, G., Liu, Z., Maaten, L.V.D., Weinberger, K.Q.: Densely connected convolutional networks. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 2261–2269. IEEE, Honolulu (2017)
12.
13.
go back to reference Larsson, G., Maire, M., Shakhnarovich, G.: FractalNet: ultra-deep neural networks without residuals. In: Proceedings of International Conference on Learning Representations, Toulon (2017) Larsson, G., Maire, M., Shakhnarovich, G.: FractalNet: ultra-deep neural networks without residuals. In: Proceedings of International Conference on Learning Representations, Toulon (2017)
14.
go back to reference Luvizon, D.C., Tabia, H., Picard, D.: Human pose regression by combining indirect part detection and contextual information. arXiv preprint arXiv:1710.02322 (2017) Luvizon, D.C., Tabia, H., Picard, D.: Human pose regression by combining indirect part detection and contextual information. arXiv preprint arXiv:​1710.​02322 (2017)
15.
go back to reference Mehta, S., Mercan, E., Bartlett, J., Weaver, D.L., Elmore, J.G., Shapiro, L.G.: Learning to segment breast biopsy whole slide images. arXiv preprint arXiv:1709.02554 (2017) Mehta, S., Mercan, E., Bartlett, J., Weaver, D.L., Elmore, J.G., Shapiro, L.G.: Learning to segment breast biopsy whole slide images. arXiv preprint arXiv:​1709.​02554 (2017)
17.
go back to reference Ning, G., Zhang, Z., He, Z.: Knowledge-guided deep fractal neural networks for human pose estimation. IEEE Trans. Multimedia 20(5), 1246–1259 (2018)CrossRef Ning, G., Zhang, Z., He, Z.: Knowledge-guided deep fractal neural networks for human pose estimation. IEEE Trans. Multimedia 20(5), 1246–1259 (2018)CrossRef
18.
go back to reference Srivastava, R.K., Greff, K., Schmidhuber, J.: Training very deep networks. In: Proceedings of Advances in Neural Information Processing Systems, pp. 2377–2385. Curran Associates, Montreal (2015) Srivastava, R.K., Greff, K., Schmidhuber, J.: Training very deep networks. In: Proceedings of Advances in Neural Information Processing Systems, pp. 2377–2385. Curran Associates, Montreal (2015)
19.
go back to reference Sun, D., Yang, X., Liu, M., Kautz, J.: PWC-net: CNNs for optical flow using pyramid, warping, and cost volume. arXiv preprint arXiv:1709.02371 (2017) Sun, D., Yang, X., Liu, M., Kautz, J.: PWC-net: CNNs for optical flow using pyramid, warping, and cost volume. arXiv preprint arXiv:​1709.​02371 (2017)
20.
go back to reference Wei, S., Ramakrishna, V., Kanade, T., Sheikh, Y.: Convolutional pose machines. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 4724–4732. IEEE, Las Vegas (2016) Wei, S., Ramakrishna, V., Kanade, T., Sheikh, Y.: Convolutional pose machines. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, pp. 4724–4732. IEEE, Las Vegas (2016)
21.
go back to reference Yang, W., Li, S., Ouyang, W., Li, H., Wang, X.: Learning feature pyramids for human pose estimation. In: Proceedings of International Conference on Computer Vision, pp. 1290–1299. IEEE, Venice (2017) Yang, W., Li, S., Ouyang, W., Li, H., Wang, X.: Learning feature pyramids for human pose estimation. In: Proceedings of International Conference on Computer Vision, pp. 1290–1299. IEEE, Venice (2017)
22.
go back to reference Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions. In: Proceedings of International Conference on Learning Representations, San Juan (2016) Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions. In: Proceedings of International Conference on Learning Representations, San Juan (2016)
Metadata
Title
Stacked Mixed-Scale Networks for Human Pose Estimation
Authors
Xuan Wang
Zhi Li
Yanan Chen
Peilin Jiang
Fei Wang
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-29908-8_18

Premium Partner