Skip to main content
Top
Published in: Pattern Recognition and Image Analysis 1/2023

01-03-2023 | APPLIED PROBLEMS

Fine-Grained Object Recognition Using a Combination Model of Navigator–Teacher–Scrutinizer and Spinal Networks

Authors: Nurhasanah, Yulianto, Gede Putra Kusuma

Published in: Pattern Recognition and Image Analysis | Issue 1/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Fine-grained object recognition aims to recognize objects with a large variety of intraclass and low variations between classes. To overcome this problem, using a simple model may hard to find more discriminative parts. Thus, we proposed a combination model of navigator–teacher–scrutinizer and spinal networks to improve accuracy. Employing two feature extractors, residual networks with 50 and 101 layers deep, and replacing the basic fully connected layer with spinal network outperform the baseline results on Stanford Cars, Fine-Grained Visual Classification of Aircraft, and 275 Bird Species datasets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference P. Chopra, “ProgressiveSpinalNet architecture for FC layers,” (2021). arXiv:2103.11373 [cs.LG] P. Chopra, “ProgressiveSpinalNet architecture for FC layers,” (2021). arXiv:2103.11373 [cs.LG]
2.
go back to reference A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, N. Houlsby, “An image is worth 16x16 words: Transformers for image recognition at scale,” (2020). arXiv:2010.11929 [cs.CV] A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, N. Houlsby, “An image is worth 16x16 words: Transformers for image recognition at scale,” (2020). arXiv:2010.11929 [cs.CV]
3.
go back to reference W. Geng, F. Han, J. Lin, L. Zhu, J. Bai, S. Wang, L. He, Q. Xiao, and Zh. Lai, “Fine-grained grocery product recognition by one-shot learning,” in MM ’18: Proc. 26th ACM Int. Conf. on Multimedia, Seoul, 2018 (Association for Computing Machinery, New York, 2018), Vol. 2, pp. 1706–1714. https://doi.org/10.1145/3240508.3240522 W. Geng, F. Han, J. Lin, L. Zhu, J. Bai, S. Wang, L. He, Q. Xiao, and Zh. Lai, “Fine-grained grocery product recognition by one-shot learning,” in MM ’18: Proc. 26th ACM Int. Conf. on Multimedia, Seoul, 2018 (Association for Computing Machinery, New York, 2018), Vol. 2, pp. 1706–1714. https://​doi.​org/​10.​1145/​3240508.​3240522
4.
go back to reference Gerry, “285 Bird Species - Classification,” (2019). https://www.kaggle.com/gpiosenka/100-bird-species. Gerry, “285 Bird Species - Classification,” (2019). https://​www.​kaggle.​com/​gpiosenka/​100-bird-species.​
6.
go back to reference M. Grandini, E. Bagli, and G. Visani, “Metrics for multi-class classification: An overview,” (2020). arXiv:2008.05756 [stat.ML] M. Grandini, E. Bagli, and G. Visani, “Metrics for multi-class classification: An overview,” (2020). arXiv:2008.05756 [stat.ML]
14.
go back to reference H. Li, P. Chaudhari, H. Yang, M. Lam, A. Ravichandran, R. Bhotika, and S. Soatto, “Rethinking the hyperparameters for fine-tuning,” in Interational Conf. on Learning Representations (ICLR), 2020 (2020), pp. 165–184. arXiv:2002.11770 [cs.CV] H. Li, P. Chaudhari, H. Yang, M. Lam, A. Ravichandran, R. Bhotika, and S. Soatto, “Rethinking the hyperparameters for fine-tuning,” in Interational Conf. on Learning Representations (ICLR), 2020 (2020), pp. 165–184. arXiv:2002.11770 [cs.CV]
15.
go back to reference X. Liu, T. Xia, J. Wang, Y. Yang, F. Zhou, and Y. Lin, “Fully convolutional attention networks for fine-grained recognition,” (2016). arXiv:1603.06765v4 [cs.CV] X. Liu, T. Xia, J. Wang, Y. Yang, F. Zhou, and Y. Lin, “Fully convolutional attention networks for fine-grained recognition,” (2016). arXiv:1603.06765v4 [cs.CV]
16.
go back to reference S. Maji, E. Rahtu, J. Kannala, M. Blaschko, and A. Vedaldi, “Fine-grained visual classification of aircraft,” (2013). arXiv:1306.5151v1 [cs.CV] S. Maji, E. Rahtu, J. Kannala, M. Blaschko, and A. Vedaldi, “Fine-grained visual classification of aircraft,” (2013). arXiv:1306.5151v1 [cs.CV]
20.
go back to reference C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie, “The caltech-ucsd birds-200-2011 dataset,” (2011). C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie, “The caltech-ucsd birds-200-2011 dataset,” (2011).
24.
go back to reference S. R. Young, D. C. Rose, T. P. Karnowski, S. H. Lim, and R. M. Patton, “Optimizing deep learning hyper-parameters through an evolutionary algorithm,” in Proc. Workshop on Machine Learning in High-Performance Computing Environments, Austin, Texas, 2015 (Association for Computing Machinery, New York, 2015), p. 4. https://doi.org/10.1145/2834892.2834896 S. R. Young, D. C. Rose, T. P. Karnowski, S. H. Lim, and R. M. Patton, “Optimizing deep learning hyper-parameters through an evolutionary algorithm,” in Proc. Workshop on Machine Learning in High-Performance Computing Environments, Austin, Texas, 2015 (Association for Computing Machinery, New York, 2015), p. 4. https://​doi.​org/​10.​1145/​2834892.​2834896
25.
Metadata
Title
Fine-Grained Object Recognition Using a Combination Model of Navigator–Teacher–Scrutinizer and Spinal Networks
Authors
Nurhasanah
Yulianto
Gede Putra Kusuma
Publication date
01-03-2023
Publisher
Pleiades Publishing
Published in
Pattern Recognition and Image Analysis / Issue 1/2023
Print ISSN: 1054-6618
Electronic ISSN: 1555-6212
DOI
https://doi.org/10.1134/S1054661822040083

Other articles of this Issue 1/2023

Pattern Recognition and Image Analysis 1/2023 Go to the issue

PATTERN RECOGNITION AND IMAGE ANALYSIS AUTOMATED SYSTEMS, HARDWARE AND SOFTWARE

Pattern-Recognition Tools and Their Applications

MATHEMATICAL THEORY OF IMAGES AND SIGNALS REPRESENTING, PROCESSING, ANALYSIS, RECOGNITION AND UNDERSTANDING

Controllable Image Caption Based on Adaptive Weight and Optimization Strategy

ARTIFICIAL INTELLIGENCE TECHNIQUES IN PATTERN RECOGNITION AND IMAGE ANALYSIS

Term Frequency and Estimating the Closeness of Short Texts to the Semantic Standard

MATHEMATICAL THEORY OF IMAGES AND SIGNALS REPRESENTING, PROCESSING, ANALYSIS, RECOGNITION AND UNDERSTANDING

Anisotropic Localized Wavelets for Image Processing

Premium Partner