nach oben

Pattern Analysis and Applications

Erschienen in:

17.06.2023 | Theoretical Advances

CB-FPN: object detection feature pyramid network based on context information and bidirectional efficient fusion

verfasst von: Zhibo Liu, Jian Cheng

Erschienen in: Pattern Analysis and Applications | Ausgabe 3/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Feature pyramid network (FPN) is a typical structure in object detection. It can improve the accuracy of detection results by fusing feature information at different resolutions and enhancing the expression ability of different levels of features. Among them, the mismatch between the resolution of feature information and the receptive field and the limited way of feature fusion hinder the full exchange of feature information. To solve the above problems, this paper designs a new structure called an object detection feature pyramid network based on context information and an efficient bidirectional fusion (CB-FPN): (1) Before feature fusion, this study designs a context enhancement module with cross stage partial network (CSPNet) module (CEM-CSP). By using carefully designed dilated convolutions on high-level features, rich context information and receptive fields are obtained to match appropriate feature information. (2) In feature fusion, this study designed a bidirectional efficient feature pyramid network (BE-FPN) module to fuse features efficiently. After adding these two modified architectures to Faster R-CNN with ResNet-50, the average precision (AP) improves from 37.5 to 39.2 on COCO val-2017 data set. In addition, extensive experiments show the effectiveness of our methods on one-stage, two-stage, and anchor-free models.

Vorheriger Artikel RKHS subspace domain adaption via minimum distribution gap

Nächster Artikel Exponential filtering technique for Euclidean norm-regularized extreme learning machines

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Yang S, Luo P, Loy C, et al. (2016) Wider face: a face detection benchmark. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5525–5533

Chen J, Bai T (2020) Saanet: Spatial adaptive alignment network for object detection in automatic driving. Image Vis Comput 94:103873. https://doi.org/10.1016/j.imavis.2020.103873CrossRef

Praveen SP et al (2022) ResNet-32 and FastAI for diagnoses of ductal carcinoma from 2D tissue slides. Sci Rep 12(1):20804. https://doi.org/10.1038/s41598-022-25089-2CrossRef

Leitner J, Förster A, Schmidhuber J (2014) Improving robot vision models for object detection through interaction. In: International joint conference on neural networks (IJCNN), pp 3355–3362

Malburg L, Rieder M, Seiger R et al (2021) Object detection for smart factory processes by machine learning. Procedia Comput Sci 184:581–588CrossRef

Jyotismita C, Woźniak M (2023) Deep learning for neurodegenerative disorder (2016 to 2022): a systematic review. Biomed Sign Process Control 80:104223. https://doi.org/10.1016/j.bspc.2022.104223CrossRef

Lin T, Dollár P, Girshick R, He K, et al. (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125

Ren S, He K, Girshick R et al (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. Adv Neural Inf Process Syst. https://doi.org/10.1109/TPAMI.2016.2577031CrossRef

Lin T, Goyal P, Girshick R, et al. (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988

10.

Li Z, Peng C, Yu G, et al. (2018) Detnet: Design backbone for object detection. In: Proceedings of the European conference on computer vision (ECCV), pp 334–350

11.

Lin T, Maire M, Belongie S, et al. (2014) Microsoft coco: Common objects in context. In: European conference on computer vision, pp 740–755

12.

Everingham M, Winn J, Andrew Z et al (2010) The pascal visual object classes (voc) challenge. Int J Comput Vis 88(2):303–338. https://doi.org/10.1007/s11263-009-0275-4CrossRef

13.

Deng J, Dong W, Socher R, et al. (2009) Imagenet: A large-scale hierarchical image database. In: IEEE conference on computer vision and pattern recognition, pp 248–255

14.

Liu S, Qi L, Qin H, et al. (2018) Path aggregation network for instance segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8759–8768

15.

Ghiasi G, Lin T, R Pang, et al. (2019) Nas-fpn: Learning scalable feature pyramid architecture for object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp7036–7045. https://doi.org/10.48550/arXiv.1904.07392

16.

Tan M, Pang R, Le QV Efficientdet: Scalable and efficient object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10781–10790

17.

Pang J, Chen K, Shi J, et al. (2019) Libra R-CNN: Towards balanced learning for object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 821–830

18.

Cao J, Chen Q, Guo J, et al. (2020) Attention-guided context feature pyramid network for object detection. https://doi.org/10.48550/arXiv.2005.11475

19.

He K, Gkioxari G, Dollár P, et al. (2019) Mask r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969.

20.

Cai Z, Vasconcelos N. Cascade r-cnn: delving into high quality object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6154–6162.

21.

Liu Y, Wang Y, Wang S, et al. (2020) Cbnet: a novel composite backbone network architecture for object detection. In: Proceedings of the AAAI conference on artificial intelligence, pp 11653–11660. https://doi.org/10.48550/arXiv.1909.03625

22.

Redmon J, Divvala S, Girshick R, et al. (2020) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788

23.

Liu W, Anguelov D, Erhan D, et al. (2016) SSD: Single shot multibox detector. In: European conference on computer vision, pp 21–37. https://doi.org/10.1007/978-3-319-46448-0_2

24.

Tian Z, Shen C, Chen H, et al. (2019) FCOS: Fully convolutional one-stage object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 9627–9636

25.

Zhu L, Lee F, Cai J et al (2022) An improved feature pyramid network for object detection. Neurocomputing 483:127–139. https://doi.org/10.1016/j.neucom.2022.02.016CrossRef

26.

Xiong S, Wu X, Chen H et al (2021) Bi-directional skip connection feature pyramid network and sub-pixel convolution for high-quality object detection. Neurocomputing 440:185–196. https://doi.org/10.1016/j.neucom.2021.01.021CrossRef

27.

Shen L, You L, Peng B et al (2021) Group multi-scale attention pyramid network for traffic sign detection. Neurocomputing 452:1–14. https://doi.org/10.1016/j.neucom.2021.04.083CrossRef

28.

Chen X, Li LJ, Gupta A, et al. (2018) Iterative visual reasoning beyond convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7239–7248

29.

Hu H, Gu J, Zhang Z, et al. (2018) Relation networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3588–3597.

30.

Guo C, Fan B, Zhang Q, et al. (2020) AugFPN: Improving multi-scale feature learning for object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 12595–12604. https://doi.org/10.48550/arXiv.1912.05384

31.

Wang CY, Liao HYM, Wu YH, et al. (2020) CSPNet: A new backbone that can enhance learning capability of cnn. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp 390–391

32.

Wang P, Chen P, Yuan Y, et al. (2018) Understanding convolution for semantic segmentation. In: IEEE winter conference on applications of computer vision (WACV), pp 1451–1460

33.

Chen K, Wang J, Pang J, et al. (2019) MMDetection: Open mmlab detection toolbox and benchmark. https://doi.org/10.48550/arXiv.1906.07155

Titel: CB-FPN: object detection feature pyramid network based on context information and bidirectional efficient fusion
verfasst von: Zhibo Liu
Jian Cheng
Publikationsdatum: 17.06.2023
Verlag: Springer London
Erschienen in: Pattern Analysis and Applications / Ausgabe 3/2023
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI: https://doi.org/10.1007/s10044-023-01173-9

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 3/2023

A new multidimensional discriminant representation for robust person re-identification

A review of natural language processing in contact centre automation

Self-label correction for image classification with noisy labels

2D MRI registration using glowworm swarm optimization with partial opposition-based learning for brain tumor progression

Deep Fuzzy SegNet-based lung nodule segmentation and optimized deep learning for lung cancer detection

Weighted edit distance optimized using genetic algorithm for SMILES-based compound similarity

Premium Partner