Skip to main content
Erschienen in: The Journal of Supercomputing 5/2020

30.08.2018

Concise feature pyramid region proposal network for multi-scale object detection

verfasst von: Baofu Fang, Lu Fang

Erschienen in: The Journal of Supercomputing | Ausgabe 5/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Object detection is a hot research issue in the field of computer vision. Many methods focus on detecting large objects. And features of small objects are easily weakened or even disappeared after multiple convolution layers. So the detection rate of multi-scale objects is unsatisfied. Aiming at this problem, a concise feature pyramid region proposal network (CFPRPN) is proposed to address the problem of small objects detection in this paper without missing the large objects. In the process of object detection, we propose a new method of adjustment for the object location. So the balanced detection of multi-scale objects is realized. CFPRPN combines image pyramids and feature pyramids. An image pyramid consists of scaled versions of an image and the feature pyramids produce multiple layers’ feature maps. They are both conducive to capturing the feature information of small objects in deep convolutional networks. At the same time, proposals of overlapping sizes from different layers are applied to improve the recall rate of multi-scale objects. These series operations are beneficial for CFPRPN to extract better proposals. We experimentally prove that after adding the fine-tuning location, the detection rate of multi-scale object is further improved. The inspiring thing is that refining location method is suitable for most algorithms of object detection.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zhou Y, Han J, Yuan X, Wei Z, Hong R (2017) Inverse sparse group lasso model for robust object tracking. IEEE Trans Multimed 19(8):1798–1810CrossRef Zhou Y, Han J, Yuan X, Wei Z, Hong R (2017) Inverse sparse group lasso model for robust object tracking. IEEE Trans Multimed 19(8):1798–1810CrossRef
2.
Zurück zum Zitat Wang H, Fan Y, Fang B (2018) Generalized linear discriminant analysis based on Euclidean norm for gait recognition. Int J Mach Learn Cybernet 9(4):569–576CrossRef Wang H, Fan Y, Fang B (2018) Generalized linear discriminant analysis based on Euclidean norm for gait recognition. Int J Mach Learn Cybernet 9(4):569–576CrossRef
3.
Zurück zum Zitat Ommer B, Malik J (2009) Multi-scale object detection by clustering lines. In: IEEE International Conference on Computer Vision, pp 484–491 Ommer B, Malik J (2009) Multi-scale object detection by clustering lines. In: IEEE International Conference on Computer Vision, pp 484–491
4.
Zurück zum Zitat Uijlings JR, Sande KE, Gevers T, Smeulders AW (2013) Selective search for object recognition. Int J Comput Vis 104(2):154–171CrossRef Uijlings JR, Sande KE, Gevers T, Smeulders AW (2013) Selective search for object recognition. Int J Comput Vis 104(2):154–171CrossRef
5.
Zurück zum Zitat Zitnick CL, Doll´ar P (2014) Edge boxes: Locating object proposals from edges. In European Conference on Computer Vision, pp 391–405 Zitnick CL, Doll´ar P (2014) Edge boxes: Locating object proposals from edges. In European Conference on Computer Vision, pp 391–405
6.
Zurück zum Zitat Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp 580–587 Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp 580–587
7.
Zurück zum Zitat Girshick R (2015) Fast R-CNN. In: IEEE International Conference on Computer Vision, pp 1440–1448 Girshick R (2015) Fast R-CNN. In: IEEE International Conference on Computer Vision, pp 1440–1448
8.
Zurück zum Zitat Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In International Conference on Neural Information Processing Systems, pp 91–99 Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In International Conference on Neural Information Processing Systems, pp 91–99
9.
Zurück zum Zitat Shrivastava A, Gupta A, Girshick R (2016) Training region-based object detectors with online hard example mining. In IEEE International Conference on Computer Vision and Pattern Recognition, pp 761–769 Shrivastava A, Gupta A, Girshick R (2016) Training region-based object detectors with online hard example mining. In IEEE International Conference on Computer Vision and Pattern Recognition, pp 761–769
10.
Zurück zum Zitat Gkioxari G, Girshick R, Malik J (2015) Contextual action recognition with R-CNN. In: IEEE International Conference on Computer Vision, pp 1080–1088 Gkioxari G, Girshick R, Malik J (2015) Contextual action recognition with R-CNN. In: IEEE International Conference on Computer Vision, pp 1080–1088
11.
Zurück zum Zitat Yuan X, Xie L, Abouelenien M (2018) A regularized ensemble framework of deep learning for cancer detection from multi-class, imbalanced training data. Pattern Recognit 77:160–172CrossRef Yuan X, Xie L, Abouelenien M (2018) A regularized ensemble framework of deep learning for cancer detection from multi-class, imbalanced training data. Pattern Recognit 77:160–172CrossRef
12.
Zurück zum Zitat Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition, p 4 Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition, p 4
13.
Zurück zum Zitat Zeiler MD, Krishnan D, Taylor GW, Fergus R (2010) Deconvolutional networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 2528–2535 Zeiler MD, Krishnan D, Taylor GW, Fergus R (2010) Deconvolutional networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 2528–2535
14.
Zurück zum Zitat Kantorov V, Oquab M, Cho, Laptev I (2016) ContextLocNet: context-aware deep network models for weakly supervised localization. In: European Conference on Computer Vision, pp 350–365 Kantorov V, Oquab M, Cho, Laptev I (2016) ContextLocNet: context-aware deep network models for weakly supervised localization. In: European Conference on Computer Vision, pp 350–365
15.
Zurück zum Zitat Everingham M, Zisserman A, Williams CK et al The PASCAL visual object classes challenge 2007 (VOC2007) results. In: International Conference on Machine Learning Challenges: Evaluating Predictive Uncertainty Visual Object Classification and Recognizing Textual Entailment. Springer, Berlin, pp 117–176 Everingham M, Zisserman A, Williams CK et al The PASCAL visual object classes challenge 2007 (VOC2007) results. In: International Conference on Machine Learning Challenges: Evaluating Predictive Uncertainty Visual Object Classification and Recognizing Textual Entailment. Springer, Berlin, pp 117–176
16.
Zurück zum Zitat Deng J, Dong W, Socher R, Li LJ, Li K, Li FF (2009) ImageNet: a large-scale hierarchical image database. In: IEEE International Conference on Computer Vision and Pattern Recognition, pp 248–255 Deng J, Dong W, Socher R, Li LJ, Li K, Li FF (2009) ImageNet: a large-scale hierarchical image database. In: IEEE International Conference on Computer Vision and Pattern Recognition, pp 248–255
Metadaten
Titel
Concise feature pyramid region proposal network for multi-scale object detection
verfasst von
Baofu Fang
Lu Fang
Publikationsdatum
30.08.2018
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 5/2020
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-018-2569-1

Weitere Artikel der Ausgabe 5/2020

The Journal of Supercomputing 5/2020 Zur Ausgabe

Premium Partner