nach oben

The Journal of Supercomputing

Erschienen in:

10.01.2022

Matrix-product neural network based on sequence block matrix product

verfasst von: Chuanhui Shan, Jun Ou, Xiumei Chen

Erschienen in: The Journal of Supercomputing | Ausgabe 6/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Convolution neural networks (CNNs) based on the discrete convolutional operation have achieved great success in image processing, voice and audio processing, natural language processing and other fields. However, it is still an open problem how to develop new models instead of CNNs. Using the idea of the sequence block matrix product, we propose a novel operation and its corresponding neural network, namely two-dimensional discrete matrix-product operation (TDDMPO) and matrix-product neural network (MPNN). We present the definition of the TDDMPO, a series of its properties and matrix-product theorem in detail, and then construct its corresponding MPNN. Experimental results on Fashion-MNIST, SVHN, FLOWER17 and FLOWER102 datasets show that MPNNs obtain 1.65–13.04% relative performance improvement in comparison with the corresponding CNNs, and the amount of calculation of matrix-product layers of MPNNs obtains 41× to 57× reduction in comparison with the corresponding convolutional layers of CNNs. Hence, it is a potential model that may open some new directions for deep neural networks, particularly alternatives to CNNs.

Vorheriger Artikel An optimized hardware design of a two-dimensional guide filter and its application in image denoising

Nächster Artikel Energy optimization for CAN bus and media controls in electric vehicles using deep learning algorithms

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Nur mit Berechtigung zugänglich

Hubel DH, Wiesel T (1962) Receptive fields, binocular interaction, and functional architecture in the cats visual cortex. J Physiol 160(1):106–154CrossRef

Wiesel T, Hubel DH (1959) Receptive fields of single neurons in the cats striate cortex. J Physiol 148(3):574–591CrossRef

Fukushima K (1979) Neural network model for a mechanism of pattern recognition unaffected by shift in position-Neocognitron. IEICE Techn Rep 62(10):658–665

Fukushima K (1980) Neocognitron: a self-organizing neural network for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern 36(4):193–202CrossRef

Fukushima K (2013) Artificial vision by multi-layered neural networks: neocognitron and its advances. Neural Netw 37:103–119CrossRef

LeCun Y, Boser B, Denker JS et al (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551CrossRef

LeCun Y, Bottou L, Bengio Y et al (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef

Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst, pp 1097–1105

Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Comput Sci

10.

Russakovsky O, Deng J, Su H et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252MathSciNetCrossRef

11.

Wengrowski E, Purri M, Dana K et al (2019) Deep CNNs as a method to classify rotating objects based on monostatic RCS. IET Radar Sonar Navig 13(7):1092–1100CrossRef

12.

Wu X, Zhang Z, Zhang W et al (2021) A convolutional neural network based on grouping structure for scene classification. Remote Sens 13(13):2457–2477CrossRef

13.

Hagag A, Omara I, Alfarra ANK, Mekawy F (2021) Handwritten chemical formulas classification model using deep transfer convolutional neural networks. In: International Conference on Electronic Engineering (ICEEM), pp 1–6

14.

Teli MN (2021) TeliNet, a simple and shallow convolution neural network (CNN) to classify CT scans of COVID-19 patients. arXiv:2107.04930

15.

Shawky OA, Hagag A, El-Dahshan E et al (2020) Remote sensing image scene classification using CNN-MLP with data augmentation. Optik Int J Light Electron Opt 165356

16.

He K, Gkioxari G, Dollr P et al (2017) Mask r-cnn. In: 2017 IEEE International Conference on Computer Vision (ICCV). IEEE, pp 2980–2988

17.

Liu B, Liu Q, Zhang T et al (2019) MSSTResNet-TLD: a robust tracking method based on tracking-learning-detection framework by using multi-scale spatio-temporal residual network feature model. Neurocomputing 175–194

18.

Liu Z, Waqas M, Yang J et al (2021) A multi-task CNN for maritime target detection. IEEE Signal Process Lett 28:434–438CrossRef

19.

Fan M, Tian S, Liu K et al (2021) Infrared small target detection based on region proposal and CNN classifier. SIViP 1–10

20.

Hou F, Lei W, Li S et al (2021) Deep learning-based subsurface target detection from GPR scans. IEEE Sens J 21(6):8161–8171CrossRef

21.

Mnih V, Kavukcuoglu K, Silver D et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533CrossRef

22.

Silver D, Schrittwieser J, Simonyan K et al (2017) Mastering the game of Go without human knowledge. Nature 550(7676):354–359CrossRef

23.

Zoughi T, Homayounpour MM (2019) A gender-aware deep neural network structure for speech recognition, Iranian Journal of Science and Technology-Transactions of. Electr Eng 43(3):635–644

24.

Perdana BBSP, Irawan B, Setianingsih C (2019) Hate speech detection in indonesian language on instagram comment section using deep neural network classification method. In: 2019 IEEE Asia Pacific Conference on Wireless and Mobile (APWiMob). IEEE

25.

Krishnan PT, Balasubramanian P (2019) Detection of alphabets for machine translation of sign language using deep neural net. In: 2019 International Conference on Data Science and Communication (IconDSC)

26.

Hinton GE, Sabour S, Frosst N (2018) Matrix capsules with EM routing. In: International Conference on Representation Learning

27.

Gonzalez RC, Wintz P (1997) Digital image processing. Addison-Wesley, New YorkMATH

28.

Bhabatosh C (1977) Digital image processing and analysis. PHI Learning Pvt Ltd, New Delhi

29.

Zhang XD (2017) Matrix analysis and applications. Cambridge University Press, CambridgeCrossRef

30.

Bouvrie J (2006) Notes on convolutional neural networks. Center for Biological and Computational Learning, Massachusetts, pp 38–44

31.

Xiao H, Rasul K, Vollgraf R (2017) Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms. arXiv:1708.07747

32.

Netzer Y, Wang T, Coates A et al (2011) Reading digits in natural images with unsupervised feature learning. Adv Neural Inf Process Syst 4–12

33.

Nilsback ME, Zisserman A (2008) Automated flower classification over a large number of classes. In: Sixth Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP 2008, Bhubaneswar, India, 16–19 December 2008. IEEE

Titel: Matrix-product neural network based on sequence block matrix product
verfasst von: Chuanhui Shan
Jun Ou
Xiumei Chen
Publikationsdatum: 10.01.2022
Verlag: Springer US
Erschienen in: The Journal of Supercomputing / Ausgabe 6/2022
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI: https://doi.org/10.1007/s11227-021-04194-5

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Weitere Artikel der Ausgabe 6/2022

Resource pricing and offloading decisions in mobile edge computing based on the Stackelberg game

Human pose, hand and mesh estimation using deep learning: a survey

Kinematic and dynamic control model of wheeled mobile robot under internet of things and neural network

Evaluating low-level software-based hardening techniques for configurable GPU architectures

Correction to: AnonSURP: an anonymous and secure ultralightweight RFID protocol for deployment in internet of vehicles systems

OHUQI: Mining on-shelf high-utility quantitative itemsets

Premium Partner