Skip to main content
Top
Published in: Cognitive Computation 5/2023

25-05-2023

Few-Shot Object Detection with Weight Imprinting

Authors: Dingtian Yan, Jitao Huang, Hai Sun, Fuqiang Ding

Published in: Cognitive Computation | Issue 5/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The goal of few-shot learning is to learn a solution to a problem from limited training samples. In recent years, with the promotion and application of deep neural network–based vision algorithms, the problem of data scarcity has become increasingly prominent. This has prompted comprehensive study on few-shot learning algorithms among academic and industrial communities. This paper first analyzes the bias phenomenon of proposal estimation in the classic transfer learning few-shot object detection paradigm, and then proposes an improved scheme that combines weight imprinting and model decoupling. On the one hand, we extend the weight imprinting algorithm on the general Faster R-CNN framework to enhance the fine-tuning performance; on the other hand, we exploit model decoupling to minimize the over-fitting in data-scarce scenarios. Our proposed method achieves 12.3, 15.0, and 18.9 (nAP) top accuracy on novel set of COCO under 5-shot, 10-shot, and 30-shot settings, and achieves 57.7 and 60.2 (nAP50) top accuracy on novel set of VOC Split 3 under 5-shot and 10-shot settings. Compared with the latest published studies, our proposed method provides a competitive detection performance on novel categories only via fine-tuning. Moreover, it retains the original architecture of the network and is practical in real industrial scenarios.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Wang X, Huang TE, Darrell T, Gonzalez JE, Yu F. Frustratingly simple few-shot object detection. In: Proceedings of the 37th International Conference on Machine Learning. 2020 Jul 13. p. 9919–28. Wang X, Huang TE, Darrell T, Gonzalez JE, Yu F. Frustratingly simple few-shot object detection. In: Proceedings of the 37th International Conference on Machine Learning. 2020 Jul 13. p. 9919–28.
2.
go back to reference Han G, Huang S, Ma J, He Y, Chang SF. Meta faster r-cnn: towards accurate few-shot object detection with attentive feature alignment. In: Proceedings of the AAAI Conference on Artificial Intelligence. 2022 Jun 28. Vol. 36, No. 1, p. 780–9. Han G, Huang S, Ma J, He Y, Chang SF. Meta faster r-cnn: towards accurate few-shot object detection with attentive feature alignment. In: Proceedings of the AAAI Conference on Artificial Intelligence. 2022 Jun 28. Vol. 36, No. 1, p. 780–9.
3.
go back to reference Hsieh TI, Lo YC, Chen HT, Liu TL. One-shot object detection with co-attention and co-excitation. Adv Neural Inf Process Syst. 2019;32. Hsieh TI, Lo YC, Chen HT, Liu TL. One-shot object detection with co-attention and co-excitation. Adv Neural Inf Process Syst. 2019;32.
4.
go back to reference Chen X, Jiang M, Zhao Q. Leveraging bottom-up and top-down attention for few-shot object detection. arXiv preprint; 2020 Jul 23. arXiv:2007.12104. Chen X, Jiang M, Zhao Q. Leveraging bottom-up and top-down attention for few-shot object detection. arXiv preprint; 2020 Jul 23. arXiv:​2007.​12104.
5.
go back to reference Yan X, Chen Z, Xu A, Wang X, Liang X, Lin L. Meta r-cnn: towards general solver for instance-level low-shot learning. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019. p. 9577–86. Yan X, Chen Z, Xu A, Wang X, Liang X, Lin L. Meta r-cnn: towards general solver for instance-level low-shot learning. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019. p. 9577–86.
6.
7.
go back to reference Qiao L, Zhao Y, Li Z, Qiu X, Wu J, Zhang C. Defrcn: decoupled faster r-cnn for few-shot object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021. p. 8681–90. Qiao L, Zhao Y, Li Z, Qiu X, Wu J, Zhang C. Defrcn: decoupled faster r-cnn for few-shot object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021. p. 8681–90.
8.
go back to reference Chen WY, Liu YC, Kira Z, Wang YC, Huang JB. A closer look at few-shot classification. In: International Conference on Learning Representations. 2019 May. Chen WY, Liu YC, Kira Z, Wang YC, Huang JB. A closer look at few-shot classification. In: International Conference on Learning Representations. 2019 May.
9.
go back to reference Wang Y, Yao Q, Kwok JT, Ni LM. Generalizing from a few examples: a survey on few-shot learning. ACM Comput Surv (csur). 2020 Jun 12;53(3):1–34. Wang Y, Yao Q, Kwok JT, Ni LM. Generalizing from a few examples: a survey on few-shot learning. ACM Comput Surv (csur). 2020 Jun 12;53(3):1–34.
10.
go back to reference Wang YX, Ramanan D, Hebert M. Meta-learning to detect rare objects. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019. pp. 9925–34. Wang YX, Ramanan D, Hebert M. Meta-learning to detect rare objects. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019. pp. 9925–34.
11.
go back to reference Fan Z, Ma Y, Li Z, Sun J. Generalized few-shot object detection without forgetting. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. pp. 4527–36. Fan Z, Ma Y, Li Z, Sun J. Generalized few-shot object detection without forgetting. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. pp. 4527–36.
12.
go back to reference Qi H, Brown M, Lowe DG. Low-shot learning with imprinted weights. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2018. pp. 5822–30. Qi H, Brown M, Lowe DG. Low-shot learning with imprinted weights. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2018. pp. 5822–30.
13.
go back to reference Antonelli S, Avola D, Cinque L, Crisostomi D, Foresti GL, Galasso F, Marini MR, Mecca A, Pannone D. Few-shot object detection: a survey. ACM Comput Surv (CSUR). 2022;54(11s):1–37. Antonelli S, Avola D, Cinque L, Crisostomi D, Foresti GL, Galasso F, Marini MR, Mecca A, Pannone D. Few-shot object detection: a survey. ACM Comput Surv (CSUR). 2022;54(11s):1–37.
14.
go back to reference Wang YX, Hebert M. Learning to learn: model regression networks for easy small sample learning. In: European Conference on Computer Vision. Cham: Springer; 2016 Oct 8. pp. 616–34. Wang YX, Hebert M. Learning to learn: model regression networks for easy small sample learning. In: European Conference on Computer Vision. Cham: Springer; 2016 Oct 8. pp. 616–34.
15.
go back to reference Finn C, Abbeel P, Levine S. Model-agnostic meta-learning for fast adaptation of deep networks. In: International conference on machine learning. PMLR; 2017 Jul 17. p. 1126–35. Finn C, Abbeel P, Levine S. Model-agnostic meta-learning for fast adaptation of deep networks. In: International conference on machine learning. PMLR; 2017 Jul 17. p. 1126–35.
16.
go back to reference Kang B, Liu Z, Wang X, Yu F, Feng J, Darrell T. Few-shot object detection via feature reweighting. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019. p. 8420–9. Kang B, Liu Z, Wang X, Yu F, Feng J, Darrell T. Few-shot object detection via feature reweighting. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019. p. 8420–9.
17.
go back to reference Xiao Y, Marlet R. Few-shot object detection and viewpoint estimation for objects in the wild. In: European conference on computer vision. Cham: Springer; 2020 Aug 23. p. 192–210. Xiao Y, Marlet R. Few-shot object detection and viewpoint estimation for objects in the wild. In: European conference on computer vision. Cham: Springer; 2020 Aug 23. p. 192–210.
18.
go back to reference Liu L, Ma B, Zhang Y, Yi X, Li H. Afd-net: Adaptive fully-dual network for few-shot object detection. In: Proceedings of the 29th ACM International Conference on Multimedia. 2021 Oct 17. p. 2549–57. Liu L, Ma B, Zhang Y, Yi X, Li H. Afd-net: Adaptive fully-dual network for few-shot object detection. In: Proceedings of the 29th ACM International Conference on Multimedia. 2021 Oct 17. p. 2549–57.
19.
go back to reference Fan Q, Zhuo W, Tang CK, Tai YW. Few-shot object detection with attention-RPN and multi-relation detector. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020. p. 4013–22. Fan Q, Zhuo W, Tang CK, Tai YW. Few-shot object detection with attention-RPN and multi-relation detector. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020. p. 4013–22.
20.
go back to reference Li B, Yang B, Liu C, Liu F, Ji R, Ye Q. Beyond max-margin: Class margin equilibrium for few-shot object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. p. 7363–72. Li B, Yang B, Liu C, Liu F, Ji R, Ye Q. Beyond max-margin: Class margin equilibrium for few-shot object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. p. 7363–72.
21.
go back to reference Zhang G, Luo Z, Cui K, Lu S. Meta-detr: few-shot object detection via unified image-level meta-learning. arXiv preprint. 2021 Mar 22;2(6). arXiv:2103.11731. Zhang G, Luo Z, Cui K, Lu S. Meta-detr: few-shot object detection via unified image-level meta-learning. arXiv preprint. 2021 Mar 22;2(6). arXiv:​2103.​11731.
22.
go back to reference Lin W, Deng Y, Gao Y, Wang N, Zhou J, Liu L, Zhang L, Wang P. CAT: cross-attention transformer for one-shot object detection. arXiv preprint; 2021 Apr 30. arXiv:2104.14984. Lin W, Deng Y, Gao Y, Wang N, Zhou J, Liu L, Zhang L, Wang P. CAT: cross-attention transformer for one-shot object detection. arXiv preprint; 2021 Apr 30. arXiv:​2104.​14984.
23.
go back to reference Chen H, Wang Y, Wang G, Qiao Y. Lstd: a low-shot transfer detector for object detection. In: Proceedings of the AAAI conference on artificial intelligence. 2018 Apr 29. Vol. 32, No. 1.CrossRef Chen H, Wang Y, Wang G, Qiao Y. Lstd: a low-shot transfer detector for object detection. In: Proceedings of the AAAI conference on artificial intelligence. 2018 Apr 29. Vol. 32, No. 1.CrossRef
24.
25.
go back to reference Wu J, Liu S, Huang D, Wang Y. Multi-scale positive sample refinement for few-shot object detection. In: Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVI 16. Springer International Publishing; 2020. pp. 456–72. Wu J, Liu S, Huang D, Wang Y. Multi-scale positive sample refinement for few-shot object detection. In: Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVI 16. Springer International Publishing; 2020. pp. 456–72.
26.
go back to reference Sun B, Li B, Cai S, Yuan Y, Zhang C. Fsce: Few-shot object detection via contrastive proposal encoding. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. pp. 7352–62. Sun B, Li B, Cai S, Yuan Y, Zhang C. Fsce: Few-shot object detection via contrastive proposal encoding. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. pp. 7352–62.
27.
go back to reference Zhang W, Wang YX. Hallucination improves few-shot object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. pp. 13008–17. Zhang W, Wang YX. Hallucination improves few-shot object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. pp. 13008–17.
28.
go back to reference Wu A, Han Y, Zhu L, Yang Y. Universal-prototype enhancing for few-shot object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021. p. 9567–76. Wu A, Han Y, Zhu L, Yang Y. Universal-prototype enhancing for few-shot object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021. p. 9567–76.
29.
go back to reference Khandelwal S, Goyal R, Sigal L. Unit: Unified knowledge transfer for any-shot object detection and segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. pp. 5951–61. Khandelwal S, Goyal R, Sigal L. Unit: Unified knowledge transfer for any-shot object detection and segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021. pp. 5951–61.
30.
go back to reference Gidaris S, Komodakis N. Dynamic few-shot visual learning without forgetting. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2018. pp. 4367–75. Gidaris S, Komodakis N. Dynamic few-shot visual learning without forgetting. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2018. pp. 4367–75.
31.
go back to reference Dhillon GS, Chaudhari P, Ravichandran A, Soatto S. A baseline for few-shot image classification. arXiv preprint; 2019 Sep 6. arXiv:1909.02729. Dhillon GS, Chaudhari P, Ravichandran A, Soatto S. A baseline for few-shot image classification. arXiv preprint; 2019 Sep 6. arXiv:​1909.​02729.
32.
go back to reference Jung HG, Lee SW. Few-shot learning with geometric constraints. IEEE Trans Neural Netw Learn Syst. 2020 Jan 1;31(11):4660–72. Jung HG, Lee SW. Few-shot learning with geometric constraints. IEEE Trans Neural Netw Learn Syst. 2020 Jan 1;31(11):4660–72.
33.
go back to reference Ren S, He K, Girshick R, Sun J. Faster r-cnn: towards real-time object detection with region proposal networks. Adv Neural Inf Process Syst. 2015;28. Ren S, He K, Girshick R, Sun J. Faster r-cnn: towards real-time object detection with region proposal networks. Adv Neural Inf Process Syst. 2015;28.
34.
go back to reference Yosinski J, Clune J, Bengio Y, Lipson H. How transferable are features in deep neural networks? Adv Neural Inf Process Syst. 2014;27. Yosinski J, Clune J, Bengio Y, Lipson H. How transferable are features in deep neural networks? Adv Neural Inf Process Syst. 2014;27.
35.
go back to reference Guirguis K, Hendawy A, Eskandar G, Abdelsamad M, Kayser M, Beyerer J. CFA: constraint-based finetuning approach for generalized few-shot object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022. p. 4039–49. Guirguis K, Hendawy A, Eskandar G, Abdelsamad M, Kayser M, Beyerer J. CFA: constraint-based finetuning approach for generalized few-shot object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022. p. 4039–49.
Metadata
Title
Few-Shot Object Detection with Weight Imprinting
Authors
Dingtian Yan
Jitao Huang
Hai Sun
Fuqiang Ding
Publication date
25-05-2023
Publisher
Springer US
Published in
Cognitive Computation / Issue 5/2023
Print ISSN: 1866-9956
Electronic ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-023-10152-5

Other articles of this Issue 5/2023

Cognitive Computation 5/2023 Go to the issue

Premium Partner