Top

Pattern Recognition and Image Analysis

Published in:

01-03-2023 | APPLIED PROBLEMS

Fine-Grained Object Recognition Using a Combination Model of Navigator–Teacher–Scrutinizer and Spinal Networks

Authors: Nurhasanah, Yulianto, Gede Putra Kusuma

Published in: Pattern Recognition and Image Analysis | Issue 1/2023

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Fine-grained object recognition aims to recognize objects with a large variety of intraclass and low variations between classes. To overcome this problem, using a simple model may hard to find more discriminative parts. Thus, we proposed a combination model of navigator–teacher–scrutinizer and spinal networks to improve accuracy. Employing two feature extractors, residual networks with 50 and 101 layers deep, and replacing the basic fully connected layer with spinal network outperform the baseline results on Stanford Cars, Fine-Grained Visual Classification of Aircraft, and 275 Bird Species datasets.

previous article Effects of Different Pretrained Deep Learning Algorithms as Feature Extractor in Tomato Plant Health Classification

next article Design and Implementation of Land Area Calculation for Maps Using Mask Region Based Convolutional Neural Networks Deep Neural Network

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

P. Chopra, “ProgressiveSpinalNet architecture for FC layers,” (2021). arXiv:2103.11373 [cs.LG]

A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, N. Houlsby, “An image is worth 16x16 words: Transformers for image recognition at scale,” (2020). arXiv:2010.11929 [cs.CV]

W. Geng, F. Han, J. Lin, L. Zhu, J. Bai, S. Wang, L. He, Q. Xiao, and Zh. Lai, “Fine-grained grocery product recognition by one-shot learning,” in MM ’18: Proc. 26th ACM Int. Conf. on Multimedia, Seoul, 2018 (Association for Computing Machinery, New York, 2018), Vol. 2, pp. 1706–1714. https://doi.org/10.1145/3240508.3240522

Gerry, “285 Bird Species - Classification,” (2019). https://www.kaggle.com/gpiosenka/100-bird-species.

T. Gevers and A. Smeulders, “Foreword,” in Computer Vision–ECCV 2016, Ed. by B. Leibe, J. Matas, N. Sebe, and M. Welling, Lecture Notes in Computer Science, Vol. 9908 (Springer, Cham, 2016), p. V. https://doi.org/10.1007/978-3-319-46493-0CrossRef

M. Grandini, E. Bagli, and G. Visani, “Metrics for multi-class classification: An overview,” (2020). arXiv:2008.05756 [stat.ML]

J. He, J.-N. Chen, Sh. Liu, A. Kortylewski, Ch. Yang, Yu. Bai, and Ch. Wang, “TransFG: A transformer architecture for fine-grained recognition,” Proc. AAAI 36, 852–860 (2022). https://doi.org/10.1609/aaai.v36i1.19967

K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Las Vegas, 2016 (IEEE, 2016), pp. 770–778. doi https://doi.org/10.1109/CVPR.2016.90

X. He and Y. Peng, “Fine-grained image classification via combining vision and language,” in IEEE Conf. Computer Vision Pattern Recognition (CVPR), Honolulu, Hawaii, 2017 (IEEE, 2017), pp. 7332–7340. https://doi.org/10.1109/CVPR.2017.775

10.

H. M. D. Kabir, M. Abdar, A. Khosravi, S. M. J. Jalali, A. F. Atiya, S. Nahavandi, and D. Srinivasan, “SpinalNet: Deep neural network with gradual input,” IEEE Trans. Artif. Intell. (2022). https://doi.org/10.1109/TAI.2022.3185179

11.

Y. Karaki and N. Ivanov, “Hyperparameters of multilayer perceptron with normal distributed weights,” Pattern Recognit. Image Anal. 30, 170–173 (2020). https://doi.org/10.1134/S1054661820020054CrossRef

12.

D. Korsch and J. Denzler, “In defense of active part selection for fine-grained classification,” Pattern Recognit. Image Anal. 28, 658–663 (2018). https://doi.org/10.1134/S105466181804020XCrossRef

13.

J. Krause, M. Stark, J. Deng, and L. Fei-Fei, “3D object representations for fine-grained categorization,” in IEEE Int. Conf. on Computer Vision Workshops, Sydney, 2013 (IEEE, 2013), pp. 554–561. https://doi.org/10.1109/ICCVW.2013.77

14.

H. Li, P. Chaudhari, H. Yang, M. Lam, A. Ravichandran, R. Bhotika, and S. Soatto, “Rethinking the hyperparameters for fine-tuning,” in Interational Conf. on Learning Representations (ICLR), 2020 (2020), pp. 165–184. arXiv:2002.11770 [cs.CV]

15.

X. Liu, T. Xia, J. Wang, Y. Yang, F. Zhou, and Y. Lin, “Fully convolutional attention networks for fine-grained recognition,” (2016). arXiv:1603.06765v4 [cs.CV]

16.

S. Maji, E. Rahtu, J. Kannala, M. Blaschko, and A. Vedaldi, “Fine-grained visual classification of aircraft,” (2013). arXiv:1306.5151v1 [cs.CV]

17.

L. Qi, X. Lu, and X. Li, “Exploiting spatial relation for fine-grained image classification,” Pattern Recognit. 91, 47–55 (2019). https://doi.org/10.1016/j.patcog.2019.02.007CrossRef

18.

S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: Towards real-time object detection with region proposal networks,” IEEE Trans. Pattern Anal. Mach. Intell. 39, 1137–1149 (2017). doi https://doi.org/10.1109/TPAMI.2016.2577031CrossRef

19.

Y. Seo and K. Shin, “Image classification of fine-grained fashion image based on style using pre-trained convolutional neural network,” in IEEE 3rd Int. Conf. Big Data Analysis (ICBDA), Shanghai, 2018 (IEEE, 2018), pp. 387–390. https://doi.org/10.1109/ICBDA.2018.8367713

20.

C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie, “The caltech-ucsd birds-200-2011 dataset,” (2011).

21.

Y. Yan, B. Ni, H. Wei, and X. Yang, “Fine-grained image analysis via progressive feature learning,” Neurocomputing 396, 254–265 (2020). https://doi.org/10.1016/j.neucom.2018.07.100CrossRef

22.

G. Yang, Y. He, Y. Yang, and B. Xu, “Fine-grained image classification for crop disease based on attention mechanism,” Front. Plant Sci. 11, 1–15 (2020). https://doi.org/10.3389/fpls.2020.600854CrossRef

23.

Z. Yang, T. Luo, D. Wang, Z. Hu, J. Gao, and L. Wang, “Learning to navigate for fine-grained classification,” in Computer Vision–ECCV 2018, Lecture Notes in Computer Science, Vol. 11218 (Springer, Cham, 2018), pp. 438–454. https://doi.org/10.1007/978-3-030-01264-9_26CrossRef

24.

S. R. Young, D. C. Rose, T. P. Karnowski, S. H. Lim, and R. M. Patton, “Optimizing deep learning hyper-parameters through an evolutionary algorithm,” in Proc. Workshop on Machine Learning in High-Performance Computing Environments, Austin, Texas, 2015 (Association for Computing Machinery, New York, 2015), p. 4. https://doi.org/10.1145/2834892.2834896

25.

Y. Yu, Q. Jin, and C. W. Chen, “FF-CMNET : A CNN-based model for fine-grained classification of car models based on feature fusion, in IEEE Int. Conf. on Multimedia and Expo (ICME), San Diego, Calif., 2018 (IEEE, 2018), pp. 1–6. https://doi.org/10.1109/ICME.2018.8486443

26.

N. Zhang, J. Donahue, R. Girshick, and T. Darrell, “Part-based R-CNNs for fine-grained category detection,” in Computer Vision–ECCV 2014, Ed. by D. Fleet, T. Pajdla, B. Schiele, and T. Tuytelaars (Springer, Cham, 2014), pp. 834–849. https://doi.org/10.1007/978-3-319-10590-1_54CrossRef

27.

H. Zheng, J. Fu, T. Mei, and J. Luo, “Learning multi-attention convolutional neural network for fine-grained image recognition,” in IEEE Int. Conf. on Computer Vision (ICCV), Venice, 2017 (IEEE, 2017). https://doi.org/10.1109/ICCV.2017.557

Title: Fine-Grained Object Recognition Using a Combination Model of Navigator–Teacher–Scrutinizer and Spinal Networks
Authors: Nurhasanah
Yulianto
Gede Putra Kusuma
Publication date: 01-03-2023
Publisher: Pleiades Publishing
Published in: Pattern Recognition and Image Analysis / Issue 1/2023
Print ISSN: 1054-6618
Electronic ISSN: 1555-6212
DOI: https://doi.org/10.1134/S1054661822040083

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Other articles of this Issue 1/2023

Design and Implementation of Land Area Calculation for Maps Using Mask Region Based Convolutional Neural Networks Deep Neural Network

Pattern-Recognition Tools and Their Applications

Controllable Image Caption Based on Adaptive Weight and Optimization Strategy

Term Frequency and Estimating the Closeness of Short Texts to the Semantic Standard

Effects of Different Pretrained Deep Learning Algorithms as Feature Extractor in Tomato Plant Health Classification

Anisotropic Localized Wavelets for Image Processing

Premium Partner