Skip to main content
Top

2018 | OriginalPaper | Chapter

Affine Transformation Capsule Net

Authors : Runkun Lu, Jianwei Liu, Siming Lian, Xin Zuo

Published in: Trends and Applications in Knowledge Discovery and Data Mining

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

CapsNet is a great attempt to relieve the drawback of CNN, where the routing by agreement method is tolerant to small changes in the viewpoint on one entity inside an image. This ground breaking has attracted the attention of many researchers, however, original CapsNet only utilizes the length of digit capsules in the classification task, which ignores the information of orientation. Based on this, we propose an Affine Transformation Capsule Net (AT-CapsNet) which we leverage both of the length and orientation information of digit capsules by adding a single-layer perceptron substitutes for the operation of computing length of vectors. In addition, we explain AT-CapsNet model’s architecture from five perspectives and further analyse model complexity and the difference between dynamic routing and attention mechanism. The experimental results outperform the efficiency of our proposed algorithm in real world data sets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Frosst, N., Hinton, G.E., Sabour, S.: Matrix capsules with em routing. In: International Conference on Learning Representations, accepted as poster, Vancouver, BC, Canada (2018) Frosst, N., Hinton, G.E., Sabour, S.: Matrix capsules with em routing. In: International Conference on Learning Representations, accepted as poster, Vancouver, BC, Canada (2018)
3.
go back to reference Sabour, S., Frosst, N., Hinton, G.E.: Dynamic routing between capsules. In: Advances in Neural Information Processing Systems, pp. 3859–3869, Long Beach, CA, USA (2017) Sabour, S., Frosst, N., Hinton, G.E.: Dynamic routing between capsules. In: Advances in Neural Information Processing Systems, pp. 3859–3869, Long Beach, CA, USA (2017)
4.
go back to reference Springenberg, J.T., Dosovitskiy, A., Brox, T., Riedmiller, M.: Striving for simplicity: The all convolutional net. arXiv preprint arXiv:1412.6806 (2014) Springenberg, J.T., Dosovitskiy, A., Brox, T., Riedmiller, M.: Striving for simplicity: The all convolutional net. arXiv preprint arXiv:​1412.​6806 (2014)
5.
go back to reference Zeiler, M.D., Fergus, R.: Stochastic pooling for regularization of deep convolutional neural networks. arXiv preprint arXiv:1301.3557 (2013) Zeiler, M.D., Fergus, R.: Stochastic pooling for regularization of deep convolutional neural networks. arXiv preprint arXiv:​1301.​3557 (2013)
6.
go back to reference LeCun, Y., Boser, B.E., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W.E., Jackel, L.D.: Handwritten digit recognition with a back-propagation network. In: Advances in Neural Information Processing Systems, pp. 396–404, Morgan Kaufmann, Denver, Colorado, USA (1990) LeCun, Y., Boser, B.E., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W.E., Jackel, L.D.: Handwritten digit recognition with a back-propagation network. In: Advances in Neural Information Processing Systems, pp. 396–404, Morgan Kaufmann, Denver, Colorado, USA (1990)
7.
go back to reference Jaderberg, M., Simonyan, K., Zisserman, A., Kavukcuoglu, K.: Spatial transformer networks. In: Advances in Neural Information Processing Systems, pp. 2017–2025, Montreal, Quebec, Canada (2015) Jaderberg, M., Simonyan, K., Zisserman, A., Kavukcuoglu, K.: Spatial transformer networks. In: Advances in Neural Information Processing Systems, pp. 2017–2025, Montreal, Quebec, Canada (2015)
8.
go back to reference Jia, X., De Brabandere, B., Tuytelaars, T., Gool, L.V.: Dynamic filter networks. In: Advances in Neural Information Processing Systems, pp. 667–675, Barcelona, Spain (2016) Jia, X., De Brabandere, B., Tuytelaars, T., Gool, L.V.: Dynamic filter networks. In: Advances in Neural Information Processing Systems, pp. 667–675, Barcelona, Spain (2016)
10.
go back to reference Cohen, T., Welling, M.: Group equivariant convolutional networks. In: International Conference on Machine Learning, pp. 2990–2999, New York City, NY, USA (2016) Cohen, T., Welling, M.: Group equivariant convolutional networks. In: International Conference on Machine Learning, pp. 2990–2999, New York City, NY, USA (2016)
11.
go back to reference Dieleman, S., De Fauw, J., Kavukcuoglu, K.: Exploiting cyclic symmetry in convolutional neural networks. arXiv preprint arXiv:1602.02660 (2016) Dieleman, S., De Fauw, J., Kavukcuoglu, K.: Exploiting cyclic symmetry in convolutional neural networks. arXiv preprint arXiv:​1602.​02660 (2016)
12.
go back to reference Oyallon, E., Mallat, S.: Deep roto-translation scattering for object classification in CVPR. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2865–2873, Boston, MA, USA (2015) Oyallon, E., Mallat, S.: Deep roto-translation scattering for object classification in CVPR. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2865–2873, Boston, MA, USA (2015)
15.
go back to reference Yao, L., et al.: Video description generation incorporating spatio-temporal features and a soft-attention mechanism. arXiv preprint arXiv:1502.08029 (2015) Yao, L., et al.: Video description generation incorporating spatio-temporal features and a soft-attention mechanism. arXiv preprint arXiv:​1502.​08029 (2015)
16.
go back to reference Xiao, H., Rasul, K., Vollgraf, R.: Fashionmnist: a novel image dataset for benchmarking machine learning algorithms. arXiv preprint arXiv:1708.07747 (2017) Xiao, H., Rasul, K., Vollgraf, R.: Fashionmnist: a novel image dataset for benchmarking machine learning algorithms. arXiv preprint arXiv:​1708.​07747 (2017)
17.
go back to reference Goodfellow, I.J., et al.: Generative adversarial nets. In: International Conference on Neural Information Processing Systems MIT Press, pp. 2672–2680, Montreal, Quebec, Canada (2014) Goodfellow, I.J., et al.: Generative adversarial nets. In: International Conference on Neural Information Processing Systems MIT Press, pp. 2672–2680, Montreal, Quebec, Canada (2014)
18.
go back to reference Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 (2015) Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:​1511.​06434 (2015)
Metadata
Title
Affine Transformation Capsule Net
Authors
Runkun Lu
Jianwei Liu
Siming Lian
Xin Zuo
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-030-04503-6_24

Premium Partner