Skip to main content
Erschienen in: The Journal of Supercomputing 6/2022

10.01.2022

Matrix-product neural network based on sequence block matrix product

verfasst von: Chuanhui Shan, Jun Ou, Xiumei Chen

Erschienen in: The Journal of Supercomputing | Ausgabe 6/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Convolution neural networks (CNNs) based on the discrete convolutional operation have achieved great success in image processing, voice and audio processing, natural language processing and other fields. However, it is still an open problem how to develop new models instead of CNNs. Using the idea of the sequence block matrix product, we propose a novel operation and its corresponding neural network, namely two-dimensional discrete matrix-product operation (TDDMPO) and matrix-product neural network (MPNN). We present the definition of the TDDMPO, a series of its properties and matrix-product theorem in detail, and then construct its corresponding MPNN. Experimental results on Fashion-MNIST, SVHN, FLOWER17 and FLOWER102 datasets show that MPNNs obtain 1.65–13.04% relative performance improvement in comparison with the corresponding CNNs, and the amount of calculation of matrix-product layers of MPNNs obtains 41× to 57× reduction in comparison with the corresponding convolutional layers of CNNs. Hence, it is a potential model that may open some new directions for deep neural networks, particularly alternatives to CNNs.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Hubel DH, Wiesel T (1962) Receptive fields, binocular interaction, and functional architecture in the cats visual cortex. J Physiol 160(1):106–154CrossRef Hubel DH, Wiesel T (1962) Receptive fields, binocular interaction, and functional architecture in the cats visual cortex. J Physiol 160(1):106–154CrossRef
2.
Zurück zum Zitat Wiesel T, Hubel DH (1959) Receptive fields of single neurons in the cats striate cortex. J Physiol 148(3):574–591CrossRef Wiesel T, Hubel DH (1959) Receptive fields of single neurons in the cats striate cortex. J Physiol 148(3):574–591CrossRef
3.
Zurück zum Zitat Fukushima K (1979) Neural network model for a mechanism of pattern recognition unaffected by shift in position-Neocognitron. IEICE Techn Rep 62(10):658–665 Fukushima K (1979) Neural network model for a mechanism of pattern recognition unaffected by shift in position-Neocognitron. IEICE Techn Rep 62(10):658–665
4.
Zurück zum Zitat Fukushima K (1980) Neocognitron: a self-organizing neural network for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern 36(4):193–202CrossRef Fukushima K (1980) Neocognitron: a self-organizing neural network for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern 36(4):193–202CrossRef
5.
Zurück zum Zitat Fukushima K (2013) Artificial vision by multi-layered neural networks: neocognitron and its advances. Neural Netw 37:103–119CrossRef Fukushima K (2013) Artificial vision by multi-layered neural networks: neocognitron and its advances. Neural Netw 37:103–119CrossRef
6.
Zurück zum Zitat LeCun Y, Boser B, Denker JS et al (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551CrossRef LeCun Y, Boser B, Denker JS et al (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551CrossRef
7.
Zurück zum Zitat LeCun Y, Bottou L, Bengio Y et al (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef LeCun Y, Bottou L, Bengio Y et al (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef
8.
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst, pp 1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst, pp 1097–1105
9.
Zurück zum Zitat Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Comput Sci Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Comput Sci
10.
Zurück zum Zitat Russakovsky O, Deng J, Su H et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252MathSciNetCrossRef Russakovsky O, Deng J, Su H et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252MathSciNetCrossRef
11.
Zurück zum Zitat Wengrowski E, Purri M, Dana K et al (2019) Deep CNNs as a method to classify rotating objects based on monostatic RCS. IET Radar Sonar Navig 13(7):1092–1100CrossRef Wengrowski E, Purri M, Dana K et al (2019) Deep CNNs as a method to classify rotating objects based on monostatic RCS. IET Radar Sonar Navig 13(7):1092–1100CrossRef
12.
Zurück zum Zitat Wu X, Zhang Z, Zhang W et al (2021) A convolutional neural network based on grouping structure for scene classification. Remote Sens 13(13):2457–2477CrossRef Wu X, Zhang Z, Zhang W et al (2021) A convolutional neural network based on grouping structure for scene classification. Remote Sens 13(13):2457–2477CrossRef
13.
Zurück zum Zitat Hagag A, Omara I, Alfarra ANK, Mekawy F (2021) Handwritten chemical formulas classification model using deep transfer convolutional neural networks. In: International Conference on Electronic Engineering (ICEEM), pp 1–6 Hagag A, Omara I, Alfarra ANK, Mekawy F (2021) Handwritten chemical formulas classification model using deep transfer convolutional neural networks. In: International Conference on Electronic Engineering (ICEEM), pp 1–6
14.
Zurück zum Zitat Teli MN (2021) TeliNet, a simple and shallow convolution neural network (CNN) to classify CT scans of COVID-19 patients. arXiv:2107.04930 Teli MN (2021) TeliNet, a simple and shallow convolution neural network (CNN) to classify CT scans of COVID-19 patients. arXiv:​2107.​04930
15.
Zurück zum Zitat Shawky OA, Hagag A, El-Dahshan E et al (2020) Remote sensing image scene classification using CNN-MLP with data augmentation. Optik Int J Light Electron Opt 165356 Shawky OA, Hagag A, El-Dahshan E et al (2020) Remote sensing image scene classification using CNN-MLP with data augmentation. Optik Int J Light Electron Opt 165356
16.
Zurück zum Zitat He K, Gkioxari G, Dollr P et al (2017) Mask r-cnn. In: 2017 IEEE International Conference on Computer Vision (ICCV). IEEE, pp 2980–2988 He K, Gkioxari G, Dollr P et al (2017) Mask r-cnn. In: 2017 IEEE International Conference on Computer Vision (ICCV). IEEE, pp 2980–2988
17.
Zurück zum Zitat Liu B, Liu Q, Zhang T et al (2019) MSSTResNet-TLD: a robust tracking method based on tracking-learning-detection framework by using multi-scale spatio-temporal residual network feature model. Neurocomputing 175–194 Liu B, Liu Q, Zhang T et al (2019) MSSTResNet-TLD: a robust tracking method based on tracking-learning-detection framework by using multi-scale spatio-temporal residual network feature model. Neurocomputing 175–194
18.
Zurück zum Zitat Liu Z, Waqas M, Yang J et al (2021) A multi-task CNN for maritime target detection. IEEE Signal Process Lett 28:434–438CrossRef Liu Z, Waqas M, Yang J et al (2021) A multi-task CNN for maritime target detection. IEEE Signal Process Lett 28:434–438CrossRef
19.
Zurück zum Zitat Fan M, Tian S, Liu K et al (2021) Infrared small target detection based on region proposal and CNN classifier. SIViP 1–10 Fan M, Tian S, Liu K et al (2021) Infrared small target detection based on region proposal and CNN classifier. SIViP 1–10
20.
Zurück zum Zitat Hou F, Lei W, Li S et al (2021) Deep learning-based subsurface target detection from GPR scans. IEEE Sens J 21(6):8161–8171CrossRef Hou F, Lei W, Li S et al (2021) Deep learning-based subsurface target detection from GPR scans. IEEE Sens J 21(6):8161–8171CrossRef
21.
Zurück zum Zitat Mnih V, Kavukcuoglu K, Silver D et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533CrossRef Mnih V, Kavukcuoglu K, Silver D et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533CrossRef
22.
Zurück zum Zitat Silver D, Schrittwieser J, Simonyan K et al (2017) Mastering the game of Go without human knowledge. Nature 550(7676):354–359CrossRef Silver D, Schrittwieser J, Simonyan K et al (2017) Mastering the game of Go without human knowledge. Nature 550(7676):354–359CrossRef
23.
Zurück zum Zitat Zoughi T, Homayounpour MM (2019) A gender-aware deep neural network structure for speech recognition, Iranian Journal of Science and Technology-Transactions of. Electr Eng 43(3):635–644 Zoughi T, Homayounpour MM (2019) A gender-aware deep neural network structure for speech recognition, Iranian Journal of Science and Technology-Transactions of. Electr Eng 43(3):635–644
24.
Zurück zum Zitat Perdana BBSP, Irawan B, Setianingsih C (2019) Hate speech detection in indonesian language on instagram comment section using deep neural network classification method. In: 2019 IEEE Asia Pacific Conference on Wireless and Mobile (APWiMob). IEEE Perdana BBSP, Irawan B, Setianingsih C (2019) Hate speech detection in indonesian language on instagram comment section using deep neural network classification method. In: 2019 IEEE Asia Pacific Conference on Wireless and Mobile (APWiMob). IEEE
25.
Zurück zum Zitat Krishnan PT, Balasubramanian P (2019) Detection of alphabets for machine translation of sign language using deep neural net. In: 2019 International Conference on Data Science and Communication (IconDSC) Krishnan PT, Balasubramanian P (2019) Detection of alphabets for machine translation of sign language using deep neural net. In: 2019 International Conference on Data Science and Communication (IconDSC)
26.
Zurück zum Zitat Hinton GE, Sabour S, Frosst N (2018) Matrix capsules with EM routing. In: International Conference on Representation Learning Hinton GE, Sabour S, Frosst N (2018) Matrix capsules with EM routing. In: International Conference on Representation Learning
27.
Zurück zum Zitat Gonzalez RC, Wintz P (1997) Digital image processing. Addison-Wesley, New YorkMATH Gonzalez RC, Wintz P (1997) Digital image processing. Addison-Wesley, New YorkMATH
28.
Zurück zum Zitat Bhabatosh C (1977) Digital image processing and analysis. PHI Learning Pvt Ltd, New Delhi Bhabatosh C (1977) Digital image processing and analysis. PHI Learning Pvt Ltd, New Delhi
29.
Zurück zum Zitat Zhang XD (2017) Matrix analysis and applications. Cambridge University Press, CambridgeCrossRef Zhang XD (2017) Matrix analysis and applications. Cambridge University Press, CambridgeCrossRef
30.
Zurück zum Zitat Bouvrie J (2006) Notes on convolutional neural networks. Center for Biological and Computational Learning, Massachusetts, pp 38–44 Bouvrie J (2006) Notes on convolutional neural networks. Center for Biological and Computational Learning, Massachusetts, pp 38–44
31.
Zurück zum Zitat Xiao H, Rasul K, Vollgraf R (2017) Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms. arXiv:1708.07747 Xiao H, Rasul K, Vollgraf R (2017) Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms. arXiv:​1708.​07747
32.
Zurück zum Zitat Netzer Y, Wang T, Coates A et al (2011) Reading digits in natural images with unsupervised feature learning. Adv Neural Inf Process Syst 4–12 Netzer Y, Wang T, Coates A et al (2011) Reading digits in natural images with unsupervised feature learning. Adv Neural Inf Process Syst 4–12
33.
Zurück zum Zitat Nilsback ME, Zisserman A (2008) Automated flower classification over a large number of classes. In: Sixth Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP 2008, Bhubaneswar, India, 16–19 December 2008. IEEE Nilsback ME, Zisserman A (2008) Automated flower classification over a large number of classes. In: Sixth Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP 2008, Bhubaneswar, India, 16–19 December 2008. IEEE
Metadaten
Titel
Matrix-product neural network based on sequence block matrix product
verfasst von
Chuanhui Shan
Jun Ou
Xiumei Chen
Publikationsdatum
10.01.2022
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 6/2022
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-021-04194-5

Weitere Artikel der Ausgabe 6/2022

The Journal of Supercomputing 6/2022 Zur Ausgabe

Premium Partner