Skip to main content
Top
Published in: Neural Processing Letters 3/2022

21-04-2022

Multiview Objects Recognition Using Deep Learning-Based Wrap-CNN with Voting Scheme

Authors: D. Balamurugan, S. S. Aravinth, P. Chandra Shaker Reddy, Ajay Rupani, A. Manikandan

Published in: Neural Processing Letters | Issue 3/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Industrial automation effectively reduces the human effort in various activities of the industry. In many autonomous systems, object recognition plays a vital role. Thus, finding a solution for the accurate recognition of objection for the autonomous system is motivated among researchers. In this sense, various techniques have been designed with the support of classifiers and machine learning techniques. But those techniques lack their performance in the case of Multiview object recognition. It is found that a single classifier or machine learning algorithm is not enough to recognize Multiview objects accurately. In this paper, a Wrap Convolutional Neural Network (Wrap-CNN) with a voting scheme is proposed to solve the Multiview object recognition problem and attain better recognition accuracy. The proposed model consists of three phases such as pre-processing, pre-training CNNs and voting schemes. The pre-processing phase is done to remove the unwanted noise. These pre-trained CNN models are used as feature extractors and classify the images into their respective classes. Here, the Wrap-CNN, nine pre-trained CNN are used in parallels, such as Alex Net, VGGNet, GoogLeNet, Inceptionv3, SqueezeNet, ResNet v2, Xception, MobileNetV2 and ShuffleNet. Finally, the output class from the nine predicted classes is chosen based voting scheme. The system was tested in two scenarios, such as images without rotation and with rotation. The overall accuracy is 99% and 93% for without rotation and with rotation recognition, respectively. Ultimately the system proves the effectiveness for the Multiview object recognition, which can be used for the industrial automation system.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Li H, H Lu, Z Lin, X Shen, and B Price (2015) Lcnn: low-level feature embedded CNN for salient object detection. arXiv preprint Li H, H Lu, Z Lin, X Shen, and B Price (2015) Lcnn: low-level feature embedded CNN for salient object detection. arXiv preprint
2.
go back to reference Lowe DG (2001) Local feature view clustering for 3D object recognition. In: proceedings of the 2001 IEEE computer society conference on computer vision and pattern recognition. CVPR 2001, 1: I-I). IEEE Lowe DG (2001) Local feature view clustering for 3D object recognition. In: proceedings of the 2001 IEEE computer society conference on computer vision and pattern recognition. CVPR 2001, 1: I-I). IEEE
3.
go back to reference Thomas A, Ferrar V, Leibe B, Tuytelaars T, Schiel B. and Gool LV. (2006) Towards multiview object class detection. In: 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06), IEEE, 2: 1589–1596 Thomas A, Ferrar V, Leibe B, Tuytelaars T, Schiel B. and Gool LV. (2006) Towards multiview object class detection. In: 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06), IEEE, 2: 1589–1596
4.
go back to reference Pepik B, Stark M, Gehler P, Schiele B (2015) Multiview and 3d deformable part models. IEEE Trans Pattern Anal Mach Intell 37(11):2232–2245CrossRef Pepik B, Stark M, Gehler P, Schiele B (2015) Multiview and 3d deformable part models. IEEE Trans Pattern Anal Mach Intell 37(11):2232–2245CrossRef
5.
go back to reference Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105
6.
go back to reference Simonyan K and Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint Simonyan K and Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint
7.
go back to reference Karthick S, Maniraj S (2019) Different medical image registration techniques: a comparative analysis. Curr Med Imaging 15(10):911–921CrossRef Karthick S, Maniraj S (2019) Different medical image registration techniques: a comparative analysis. Curr Med Imaging 15(10):911–921CrossRef
8.
go back to reference Wu Z, Song S, Khosla A, Yu F, Zhang L, Tang X and Xiao J (2015) 3d shapenets: a deep representation for volumetric shapes. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1912–1920 Wu Z, Song S, Khosla A, Yu F, Zhang L, Tang X and Xiao J (2015) 3d shapenets: a deep representation for volumetric shapes. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1912–1920
9.
go back to reference Johns E, Aodha OM and Brostow GJ (2015) Becoming the expert-interactive multi-class machine teaching. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2616–2624 Johns E, Aodha OM and Brostow GJ (2015) Becoming the expert-interactive multi-class machine teaching. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2616–2624
10.
go back to reference Su H, Maji S, Kalogerakis E and Learned-Miller E (2015) Multi-view convolutional neural networks for 3d shape recognition. In: proceedings of the IEEE international conference on computer vision, pp. 945–953 Su H, Maji S, Kalogerakis E and Learned-Miller E (2015) Multi-view convolutional neural networks for 3d shape recognition. In: proceedings of the IEEE international conference on computer vision, pp. 945–953
11.
go back to reference Muneeswaran K, Ganesan L, Arumugam S, Soundar KR (2005) Texture classification with combined rotation and scale invariant wavelet features. Pattern Recogn 38(10):1495–1506CrossRef Muneeswaran K, Ganesan L, Arumugam S, Soundar KR (2005) Texture classification with combined rotation and scale invariant wavelet features. Pattern Recogn 38(10):1495–1506CrossRef
12.
go back to reference Manipoonchelvi P, Muneeswaran K (2014) Multi region based image retrieval system. Sadhana 39(2):333–344CrossRef Manipoonchelvi P, Muneeswaran K (2014) Multi region based image retrieval system. Sadhana 39(2):333–344CrossRef
13.
go back to reference Manipoonchelvi P, Muneeswaran K (2015) Significant region-based image retrieval. SIViP 9(8):1795–1804CrossRef Manipoonchelvi P, Muneeswaran K (2015) Significant region-based image retrieval. SIViP 9(8):1795–1804CrossRef
14.
go back to reference Yang Y, Zhang W, Xie Y (2015) Image automatic annotation via multiview deep representation. J Vis Commun Image Represent 33:368–377CrossRef Yang Y, Zhang W, Xie Y (2015) Image automatic annotation via multiview deep representation. J Vis Commun Image Represent 33:368–377CrossRef
15.
go back to reference Shi B, Bai S, Zhou Z, Bai X (2015) Deeppano: deep panoramic representation for 3-d shape recognition. IEEE Signal Process Lett 22(12):2339–2343CrossRef Shi B, Bai S, Zhou Z, Bai X (2015) Deeppano: deep panoramic representation for 3-d shape recognition. IEEE Signal Process Lett 22(12):2339–2343CrossRef
16.
go back to reference Khan S, Hayat M, Bennamoun M, Sohel FA, Togneri R (2017) Cost-sensitive learning of deep feature representations from imbalanced data. IEEE Trans Neural Netw Learn Syst 29(8):3573–3587 Khan S, Hayat M, Bennamoun M, Sohel FA, Togneri R (2017) Cost-sensitive learning of deep feature representations from imbalanced data. IEEE Trans Neural Netw Learn Syst 29(8):3573–3587
17.
go back to reference Yan Y, Chen M, Shyu ML and Chen SC (2015) Deep learning for imbalanced multimedia data classification. In: 2015 IEEE international symposium on multimedia (ISM), IEEE, pp. 483–488 Yan Y, Chen M, Shyu ML and Chen SC (2015) Deep learning for imbalanced multimedia data classification. In: 2015 IEEE international symposium on multimedia (ISM), IEEE, pp. 483–488
18.
go back to reference Huang C, Li Y, Loy CC and Tang X (2016) Learning deep representation for imbalanced classification. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 5375–5384 Huang C, Li Y, Loy CC and Tang X (2016) Learning deep representation for imbalanced classification. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 5375–5384
19.
go back to reference Wu H, Prasad S (2017) Semi-supervised deep learning using pseudo labels for hyperspectral image classification. IEEE Trans Image Process 27(3):1259–1270MathSciNetCrossRef Wu H, Prasad S (2017) Semi-supervised deep learning using pseudo labels for hyperspectral image classification. IEEE Trans Image Process 27(3):1259–1270MathSciNetCrossRef
20.
go back to reference Tang C, Ling Y, Yang X, Jin W, Zheng C (2018) Multiview object detection based on deep learning. Appl Sci 8(9):1423CrossRef Tang C, Ling Y, Yang X, Jin W, Zheng C (2018) Multiview object detection based on deep learning. Appl Sci 8(9):1423CrossRef
21.
go back to reference Rocco I, Arandjelovic R and Sivic J (2017) Convolutional neural network architecture for geometric matching. In: proceedings of the IEEE conference on computer vision and pattern recognition pp. 6148–6157 Rocco I, Arandjelovic R and Sivic J (2017) Convolutional neural network architecture for geometric matching. In: proceedings of the IEEE conference on computer vision and pattern recognition pp. 6148–6157
22.
go back to reference Wang L, Wang L, Lu H, Zhang P, Ruan X (2018) Salient object detection with recurrent fully convolutional networks. IEEE Trans Pattern Anal Mach Intell 41(7):1734–1746CrossRef Wang L, Wang L, Lu H, Zhang P, Ruan X (2018) Salient object detection with recurrent fully convolutional networks. IEEE Trans Pattern Anal Mach Intell 41(7):1734–1746CrossRef
23.
go back to reference Shi W, van de Zedde R, Jiang H, Kootstra G (2019) Plant-part segmentation using deep learning and multiview vision. Biosyst Eng 187:81–95CrossRef Shi W, van de Zedde R, Jiang H, Kootstra G (2019) Plant-part segmentation using deep learning and multiview vision. Biosyst Eng 187:81–95CrossRef
24.
go back to reference Koohzadi M, Charkari NM, Ghaderi F (2020) Unsupervised representation learning based on the deep multiview ensemble learning. Appl Intell 50(2):562–581CrossRef Koohzadi M, Charkari NM, Ghaderi F (2020) Unsupervised representation learning based on the deep multiview ensemble learning. Appl Intell 50(2):562–581CrossRef
25.
go back to reference Gao Z, Wang DY, Xue YB, Xu GP, Zhang H, Wang YL (2018) 3D object recognition based on pairwise multiview convolutional neural networks. J Vis Commun Image Represent 56:305–315CrossRef Gao Z, Wang DY, Xue YB, Xu GP, Zhang H, Wang YL (2018) 3D object recognition based on pairwise multiview convolutional neural networks. J Vis Commun Image Represent 56:305–315CrossRef
27.
go back to reference Zhu C, Miao D, Wang Z, Zhou R, Wei L, Zhang X (2020) Global and local multiview multilabel learning. Neurocomputing 371:67–77CrossRef Zhu C, Miao D, Wang Z, Zhou R, Wei L, Zhang X (2020) Global and local multiview multilabel learning. Neurocomputing 371:67–77CrossRef
28.
go back to reference Zhu XF, Li XL, Zhang SC (2016) Block-row sparse multiview multilabel learning for image classification. IEEE Trans Cybern 46(2):450–461CrossRef Zhu XF, Li XL, Zhang SC (2016) Block-row sparse multiview multilabel learning for image classification. IEEE Trans Cybern 46(2):450–461CrossRef
29.
go back to reference Q.Y. Tan, G.X. Yu, C. Domeniconi, J. Wang, and Z.L. Zhang, (2018) Multi-view weak-label learning based on matrix completion. In: proceedings of the 2018 SIAM international conference on data mining (SIAM 2018), pp. 450–458 Q.Y. Tan, G.X. Yu, C. Domeniconi, J. Wang, and Z.L. Zhang, (2018) Multi-view weak-label learning based on matrix completion. In: proceedings of the 2018 SIAM international conference on data mining (SIAM 2018), pp. 450–458
30.
go back to reference Qian BY, Wang X, Ye JP, Davidson I (2015) A reconstruction error based framework for multilabel and multiview learning. IEEE Trans Knowl Data Eng 27(3):594–607CrossRef Qian BY, Wang X, Ye JP, Davidson I (2015) A reconstruction error based framework for multilabel and multiview learning. IEEE Trans Knowl Data Eng 27(3):594–607CrossRef
31.
go back to reference Nie FP, Tian L, Wang R, Li XL (2020) Multiview semi-supervised learning model for image classification. IEEE Trans Knowl Data Eng 32(12):2389–2400CrossRef Nie FP, Tian L, Wang R, Li XL (2020) Multiview semi-supervised learning model for image classification. IEEE Trans Knowl Data Eng 32(12):2389–2400CrossRef
32.
go back to reference Li H, Lin Z, Shen X, Brandt J. and Hua G (2015) A convolutional neural network cascade for face detection. In: procedings of the IEEE conference on computer vision and pattern recognition, pp. 5325–5334 Li H, Lin Z, Shen X, Brandt J. and Hua G (2015) A convolutional neural network cascade for face detection. In: procedings of the IEEE conference on computer vision and pattern recognition, pp. 5325–5334
33.
go back to reference Ding C, Tao D (2017) Trunk-branch ensemble convolutional neural networks for video-based face recognition. IEEE Trans Pattern Anal Mach Intell 40(4):1002–1014CrossRef Ding C, Tao D (2017) Trunk-branch ensemble convolutional neural networks for video-based face recognition. IEEE Trans Pattern Anal Mach Intell 40(4):1002–1014CrossRef
34.
go back to reference Szegedy C, Vanhoucke V, Ioffe S, Shlens J. and Wojna Z (2016) Rethinking the inception architecture for computer vision. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2818–2826 Szegedy C, Vanhoucke V, Ioffe S, Shlens J. and Wojna Z (2016) Rethinking the inception architecture for computer vision. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2818–2826
35.
go back to reference Iandola FN, Han S, Moskewicz MW, Ashraf K, Dally WJ and Keutzer K (2016) SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size. arXiv preprint Iandola FN, Han S, Moskewicz MW, Ashraf K, Dally WJ and Keutzer K (2016) SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size. arXiv preprint
36.
go back to reference He K, Zhang X, Ren S and Sun J. (2016). Deep residual learning for image recognition. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778 He K, Zhang X, Ren S and Sun J. (2016). Deep residual learning for image recognition. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778
37.
go back to reference Szegedy C, Ioffe S, Vanhoucke V and Alemi A. (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. In: proceedings of the AAAI conference on artificial intelligence 31(1) Szegedy C, Ioffe S, Vanhoucke V and Alemi A. (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. In: proceedings of the AAAI conference on artificial intelligence 31(1)
38.
go back to reference Chollet F (2017) Xception: deep learning with depthwise separable convolutions. In: proceedings of the EEE conference on computer vision and pattern recognition, pp. 1251–1258 Chollet F (2017) Xception: deep learning with depthwise separable convolutions. In: proceedings of the EEE conference on computer vision and pattern recognition, pp. 1251–1258
39.
go back to reference Sandler M, Howard A, Zhu M, Zhmoginov A and Chen LC (2018) Mobilenetv2: inverted residuals and linear bottlenecks. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4510–4520 Sandler M, Howard A, Zhu M, Zhmoginov A and Chen LC (2018) Mobilenetv2: inverted residuals and linear bottlenecks. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4510–4520
40.
go back to reference Zhang X, Zhou X, Lin M. and Sun J (2018) Shufflenet: an extremely efficient convolutional neural network for mobile devices. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 6848–6856 Zhang X, Zhou X, Lin M. and Sun J (2018) Shufflenet: an extremely efficient convolutional neural network for mobile devices. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 6848–6856
41.
go back to reference Krogh A, Vedelsby J (1995) Validation, and active learning. Adv Neural Inf Process Syst 7(7):231 Krogh A, Vedelsby J (1995) Validation, and active learning. Adv Neural Inf Process Syst 7(7):231
42.
go back to reference Deng J, Dong W, Socher R. Li LJ, Li K and Fei-Fei L (2009). Imagenet: a large-scale hierarchical image database. In: 2009 IEEE conference on computer vision and pattern recognition, pp. 248–255. IEEE Deng J, Dong W, Socher R. Li LJ, Li K and Fei-Fei L (2009). Imagenet: a large-scale hierarchical image database. In: 2009 IEEE conference on computer vision and pattern recognition, pp. 248–255. IEEE
43.
go back to reference Wang J, Yang Y, Mao J, Huang Z, Huang C and Xu, W. (2016). Cnn-rnn: a unified framework for multilabel image classification. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2285–2294 Wang J, Yang Y, Mao J, Huang Z, Huang C and Xu, W. (2016). Cnn-rnn: a unified framework for multilabel image classification. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2285–2294
44.
go back to reference Wu XZ and Zhou ZH (2017) A unified view of multilabel performance measures. In: international conference on machine learning, pp. 3780–3788, PMLR Wu XZ and Zhou ZH (2017) A unified view of multilabel performance measures. In: international conference on machine learning, pp. 3780–3788, PMLR
47.
go back to reference Sengan S, Prabhu LAJ, Ramachandran V, Priya V, Ravi L, Subramaniyaswamy V (2020) Images super-resolution by optimal deep AlexNet architecture for medical application: a novel DOCALN. J Intell Fuzzy Syst 39(6):8259–8272CrossRef Sengan S, Prabhu LAJ, Ramachandran V, Priya V, Ravi L, Subramaniyaswamy V (2020) Images super-resolution by optimal deep AlexNet architecture for medical application: a novel DOCALN. J Intell Fuzzy Syst 39(6):8259–8272CrossRef
48.
go back to reference Özyurt F (2020) A fused CNN model for WBC detection with MRMR feature selection and extreme learning machine. Soft Comput 24(11):8163–8172CrossRef Özyurt F (2020) A fused CNN model for WBC detection with MRMR feature selection and extreme learning machine. Soft Comput 24(11):8163–8172CrossRef
50.
go back to reference Liu Y, B Fan, S Xiang, and C Pan (2019) Relation-shape convolutional neural network for point cloud analysis. In: proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 8895–8904 Liu Y, B Fan, S Xiang, and C Pan (2019) Relation-shape convolutional neural network for point cloud analysis. In: proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 8895–8904
Metadata
Title
Multiview Objects Recognition Using Deep Learning-Based Wrap-CNN with Voting Scheme
Authors
D. Balamurugan
S. S. Aravinth
P. Chandra Shaker Reddy
Ajay Rupani
A. Manikandan
Publication date
21-04-2022
Publisher
Springer US
Published in
Neural Processing Letters / Issue 3/2022
Print ISSN: 1370-4621
Electronic ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-021-10679-4

Other articles of this Issue 3/2022

Neural Processing Letters 3/2022 Go to the issue