Skip to main content
Erschienen in: Pattern Analysis and Applications 3/2023

17.06.2023 | Theoretical Advances

CB-FPN: object detection feature pyramid network based on context information and bidirectional efficient fusion

verfasst von: Zhibo Liu, Jian Cheng

Erschienen in: Pattern Analysis and Applications | Ausgabe 3/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Feature pyramid network (FPN) is a typical structure in object detection. It can improve the accuracy of detection results by fusing feature information at different resolutions and enhancing the expression ability of different levels of features. Among them, the mismatch between the resolution of feature information and the receptive field and the limited way of feature fusion hinder the full exchange of feature information. To solve the above problems, this paper designs a new structure called an object detection feature pyramid network based on context information and an efficient bidirectional fusion (CB-FPN): (1) Before feature fusion, this study designs a context enhancement module with cross stage partial network (CSPNet) module (CEM-CSP). By using carefully designed dilated convolutions on high-level features, rich context information and receptive fields are obtained to match appropriate feature information. (2) In feature fusion, this study designed a bidirectional efficient feature pyramid network (BE-FPN) module to fuse features efficiently. After adding these two modified architectures to Faster R-CNN with ResNet-50, the average precision (AP) improves from 37.5 to 39.2 on COCO val-2017 data set. In addition, extensive experiments show the effectiveness of our methods on one-stage, two-stage, and anchor-free models.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Yang S, Luo P, Loy C, et al. (2016) Wider face: a face detection benchmark. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5525–5533 Yang S, Luo P, Loy C, et al. (2016) Wider face: a face detection benchmark. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5525–5533
4.
Zurück zum Zitat Leitner J, Förster A, Schmidhuber J (2014) Improving robot vision models for object detection through interaction. In: International joint conference on neural networks (IJCNN), pp 3355–3362 Leitner J, Förster A, Schmidhuber J (2014) Improving robot vision models for object detection through interaction. In: International joint conference on neural networks (IJCNN), pp 3355–3362
5.
Zurück zum Zitat Malburg L, Rieder M, Seiger R et al (2021) Object detection for smart factory processes by machine learning. Procedia Comput Sci 184:581–588CrossRef Malburg L, Rieder M, Seiger R et al (2021) Object detection for smart factory processes by machine learning. Procedia Comput Sci 184:581–588CrossRef
7.
Zurück zum Zitat Lin T, Dollár P, Girshick R, He K, et al. (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125 Lin T, Dollár P, Girshick R, He K, et al. (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125
9.
Zurück zum Zitat Lin T, Goyal P, Girshick R, et al. (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988 Lin T, Goyal P, Girshick R, et al. (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988
10.
Zurück zum Zitat Li Z, Peng C, Yu G, et al. (2018) Detnet: Design backbone for object detection. In: Proceedings of the European conference on computer vision (ECCV), pp 334–350 Li Z, Peng C, Yu G, et al. (2018) Detnet: Design backbone for object detection. In: Proceedings of the European conference on computer vision (ECCV), pp 334–350
11.
Zurück zum Zitat Lin T, Maire M, Belongie S, et al. (2014) Microsoft coco: Common objects in context. In: European conference on computer vision, pp 740–755 Lin T, Maire M, Belongie S, et al. (2014) Microsoft coco: Common objects in context. In: European conference on computer vision, pp 740–755
13.
Zurück zum Zitat Deng J, Dong W, Socher R, et al. (2009) Imagenet: A large-scale hierarchical image database. In: IEEE conference on computer vision and pattern recognition, pp 248–255 Deng J, Dong W, Socher R, et al. (2009) Imagenet: A large-scale hierarchical image database. In: IEEE conference on computer vision and pattern recognition, pp 248–255
14.
Zurück zum Zitat Liu S, Qi L, Qin H, et al. (2018) Path aggregation network for instance segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8759–8768 Liu S, Qi L, Qin H, et al. (2018) Path aggregation network for instance segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8759–8768
16.
Zurück zum Zitat Tan M, Pang R, Le QV Efficientdet: Scalable and efficient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10781–10790 Tan M, Pang R, Le QV Efficientdet: Scalable and efficient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10781–10790
17.
Zurück zum Zitat Pang J, Chen K, Shi J, et al. (2019) Libra R-CNN: Towards balanced learning for object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 821–830 Pang J, Chen K, Shi J, et al. (2019) Libra R-CNN: Towards balanced learning for object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 821–830
19.
Zurück zum Zitat He K, Gkioxari G, Dollár P, et al. (2019) Mask r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969. He K, Gkioxari G, Dollár P, et al. (2019) Mask r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969.
20.
Zurück zum Zitat Cai Z, Vasconcelos N. Cascade r-cnn: delving into high quality object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6154–6162. Cai Z, Vasconcelos N. Cascade r-cnn: delving into high quality object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6154–6162.
22.
Zurück zum Zitat Redmon J, Divvala S, Girshick R, et al. (2020) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788 Redmon J, Divvala S, Girshick R, et al. (2020) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
24.
Zurück zum Zitat Tian Z, Shen C, Chen H, et al. (2019) FCOS: Fully convolutional one-stage object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 9627–9636 Tian Z, Shen C, Chen H, et al. (2019) FCOS: Fully convolutional one-stage object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 9627–9636
28.
Zurück zum Zitat Chen X, Li LJ, Gupta A, et al. (2018) Iterative visual reasoning beyond convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7239–7248 Chen X, Li LJ, Gupta A, et al. (2018) Iterative visual reasoning beyond convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7239–7248
29.
Zurück zum Zitat Hu H, Gu J, Zhang Z, et al. (2018) Relation networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3588–3597. Hu H, Gu J, Zhang Z, et al. (2018) Relation networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3588–3597.
31.
Zurück zum Zitat Wang CY, Liao HYM, Wu YH, et al. (2020) CSPNet: A new backbone that can enhance learning capability of cnn. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp 390–391 Wang CY, Liao HYM, Wu YH, et al. (2020) CSPNet: A new backbone that can enhance learning capability of cnn. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp 390–391
32.
Zurück zum Zitat Wang P, Chen P, Yuan Y, et al. (2018) Understanding convolution for semantic segmentation. In: IEEE winter conference on applications of computer vision (WACV), pp 1451–1460 Wang P, Chen P, Yuan Y, et al. (2018) Understanding convolution for semantic segmentation. In: IEEE winter conference on applications of computer vision (WACV), pp 1451–1460
Metadaten
Titel
CB-FPN: object detection feature pyramid network based on context information and bidirectional efficient fusion
verfasst von
Zhibo Liu
Jian Cheng
Publikationsdatum
17.06.2023
Verlag
Springer London
Erschienen in
Pattern Analysis and Applications / Ausgabe 3/2023
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-023-01173-9

Weitere Artikel der Ausgabe 3/2023

Pattern Analysis and Applications 3/2023 Zur Ausgabe

Premium Partner