Skip to main content
Erschienen in: International Journal of Computer Vision 9/2019

27.03.2019

Deep Learning Approach in Aerial Imagery for Supporting Land Search and Rescue Missions

verfasst von: Dunja Božić-Štulić, Željko Marušić, Sven Gotovac

Erschienen in: International Journal of Computer Vision | Ausgabe 9/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we propose a novel approach to person detection in UAV aerial images for search and rescue tasks in Mediterranean and Sub-Mediterranean landscapes. Person detection in very high spatial resolution images involves target objects that are relatively small and often camouflaged within the environment; thus, such detection is a challenging and demanding task. The proposed method starts by reducing the search space through a visual attention algorithm that detects the salient or most prominent segments in the image. To reduce the number of non-relevant salient regions, we selected those regions most likely to contain a person using pre-trained and fine-tuned convolutional neural networks (CNNs) for detection. We established a special database called HERIDAL to train and test our model. This database was compiled for training purposes, and it contains over 68,750 image patches of wilderness acquired from an aerial perspective as well as approximately 500 labelled full-size real-world images intended for testing purposes. The proposed method achieved a detection rate of 88.9% and a precision of 34.8%, which demonstrates better effectiveness than the system currently used by Croatian Mountain search and rescue (SAR) teams (IPSAR), which is based on mean-shift segmentation. We also used the HERIDAL database to train and test a state-of-the-art region proposal network, Faster R-CNN (Ren et al. in Faster R-CNN: towards real-time object detection with region proposal networks, 2015. CoRR arXiv:​1506.​01497), which achieved comparable but slightly worse results than those of our proposed method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
The data set has been published on IPSAR website, http://​ipsar.​fesb.​unist.​hr under the page “HERIDAL” or direct link: http://​ipsar.​fesb.​unist.​hr/​HERIDAL%20​database.​html.
 
Literatur
Zurück zum Zitat Angelova, A., Krizhevsky, A., Vanhoucke, V., Ogale, A., & Ferguson, D. (2015). Real-time pedestrian detection with deep network cascades. In Proceedings of BMVC 2015. Angelova, A., Krizhevsky, A., Vanhoucke, V., Ogale, A., & Ferguson, D. (2015). Real-time pedestrian detection with deep network cascades. In Proceedings of BMVC 2015.
Zurück zum Zitat Borji, A., Cheng, M. M., Hou, Q., Jiang, H., & Li, J. (2014). Salient object detection: A survey. arXiv preprint arXiv:1411.5878. Borji, A., Cheng, M. M., Hou, Q., Jiang, H., & Li, J. (2014). Salient object detection: A survey. arXiv preprint arXiv:​1411.​5878.
Zurück zum Zitat Chen, C., Liu, M. Y., Tuzel, O., & Xiao, J. (2017). R-CNN for small object detection. In S. H. Lai, V. Lepetit, K. Nishino, & Y. Sato (Eds.), Computer Vision—ACCV 2016 (pp. 214–230). Cham: Springer.CrossRef Chen, C., Liu, M. Y., Tuzel, O., & Xiao, J. (2017). R-CNN for small object detection. In S. H. Lai, V. Lepetit, K. Nishino, & Y. Sato (Eds.), Computer Vision—ACCV 2016 (pp. 214–230). Cham: Springer.CrossRef
Zurück zum Zitat Daubechies, I. (1992). Ten lectures on wavelets. Philadelphia, PA: Society for Industrial and Applied Mathematics.CrossRefMATH Daubechies, I. (1992). Ten lectures on wavelets. Philadelphia, PA: Society for Industrial and Applied Mathematics.CrossRefMATH
Zurück zum Zitat Girshick, R. B., Donahue, J., Darrell, T., & Malik, J. (2013). Rich feature hierarchies for accurate object detection and semantic segmentation. CoRR arXiv:1311.2524. Girshick, R. B., Donahue, J., Darrell, T., & Malik, J. (2013). Rich feature hierarchies for accurate object detection and semantic segmentation. CoRR arXiv:​1311.​2524.
Zurück zum Zitat Hosang, J., Omran, M., Benenson, R., & Schiele, B. (2015). Taking a deeper look at pedestrians. In IEEE conference on computer vision and pattern recognition (CVPR). Hosang, J., Omran, M., Benenson, R., & Schiele, B. (2015). Taking a deeper look at pedestrians. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Proceedings of the 25th international conference on neural information processing systems—Volume 1, Curran Associates Inc., USA, NIPS’12 (pp. 1097–1105). http://dl.acm.org/citation.cfm?id=2999134.2999257. Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Proceedings of the 25th international conference on neural information processing systems—Volume 1, Curran Associates Inc., USA, NIPS’12 (pp. 1097–1105). http://​dl.​acm.​org/​citation.​cfm?​id=​2999134.​2999257.
Zurück zum Zitat Lecun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. In Proceedings of the IEEE (pp. 2278–2324). Lecun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. In Proceedings of the IEEE (pp. 2278–2324).
Zurück zum Zitat Leroy, J., Riche, N., Mancas, M., Gosselin, B., & Dutoit, T. (2014). Superrare: an object-oriented saliency algorithm based on superpixels rarity. Leroy, J., Riche, N., Mancas, M., Gosselin, B., & Dutoit, T. (2014). Superrare: an object-oriented saliency algorithm based on superpixels rarity.
Zurück zum Zitat Li, J., Levine, M. D., An, X., Xu, X., & He, H. (2016). Visual saliency based on scale-space analysis in the frequency domain. CoRR arXiv:1605.01999. Li, J., Levine, M. D., An, X., Xu, X., & He, H. (2016). Visual saliency based on scale-space analysis in the frequency domain. CoRR arXiv:​1605.​01999.
Zurück zum Zitat Musić, J., Orović, I., Marasović, T., Papić, V., & Stanković, S. (2016). Gradient compressive sensing for image data reduction in UAV based search and rescue in the wild. In Mathematical problems in engineering, 2016. https://doi.org/10.1155/2016/6827414. Musić, J., Orović, I., Marasović, T., Papić, V., & Stanković, S. (2016). Gradient compressive sensing for image data reduction in UAV based search and rescue in the wild. In Mathematical problems in engineering, 2016. https://​doi.​org/​10.​1155/​2016/​6827414.
Zurück zum Zitat Ren, S., He, K., Girshick, R. B., & Sun, J. (2015). Faster R-CNN: Towards real-time object detection with region proposal networks. CoRR arXiv:1506.01497. Ren, S., He, K., Girshick, R. B., & Sun, J. (2015). Faster R-CNN: Towards real-time object detection with region proposal networks. CoRR arXiv:​1506.​01497.
Zurück zum Zitat Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., et al. (2014). Imagenet large scale visual recognition challenge. CoRR arXiv:1409.0575. Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., et al. (2014). Imagenet large scale visual recognition challenge. CoRR arXiv:​1409.​0575.
Zurück zum Zitat Simonyan, K., & Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. CoRR arXiv:1409.1556. Simonyan, K., & Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. CoRR arXiv:​1409.​1556.
Zurück zum Zitat Sokalski, J., Breckon, T. P., & Cowling, I. (2010). Automatic salient object detection in uav imagery. In Proceedings of the 25th international unmanned air vehicle systems (pp. 1–12). Sokalski, J., Breckon, T. P., & Cowling, I. (2010). Automatic salient object detection in uav imagery. In Proceedings of the 25th international unmanned air vehicle systems (pp. 1–12).
Zurück zum Zitat Tian, Y., Luo, P., Wang, X., & Tang, X. (2015). Deep learning strong parts for pedestrian detection. In: 2015 IEEE international conference on computer vision (ICCV) (pp. 1904–1912). Tian, Y., Luo, P., Wang, X., & Tang, X. (2015). Deep learning strong parts for pedestrian detection. In: 2015 IEEE international conference on computer vision (ICCV) (pp. 1904–1912).
Zurück zum Zitat Treisman, A. M., & Gelade, G. (1980). A feature-integration theory of attention. Cognit Psychol, 12(1), 97–136.CrossRef Treisman, A. M., & Gelade, G. (1980). A feature-integration theory of attention. Cognit Psychol, 12(1), 97–136.CrossRef
Zurück zum Zitat Turić, H., Dujmić, H., & Papić, V. (2010). Two-stage segmentation of aerial images for search and rescue. Information Technology and Control, 39, 138–145. Turić, H., Dujmić, H., & Papić, V. (2010). Two-stage segmentation of aerial images for search and rescue. Information Technology and Control, 39, 138–145.
Zurück zum Zitat Yuan, P., Zhong, Y., & Yuan, Y. (2017). Faster r-cnn with region proposal refinement. Yuan, P., Zhong, Y., & Yuan, Y. (2017). Faster r-cnn with region proposal refinement.
Zurück zum Zitat Zhang, L., Lin, L., Liang, X., & He, K. (2016). Is faster R-CNN doing well for pedestrian detection? In B. Leibe, J. Matas, N. Sebe, & M. Welling (Eds.), Computer Vision—ECCV 2016 (pp. 443–457). Cham: Springer.CrossRef Zhang, L., Lin, L., Liang, X., & He, K. (2016). Is faster R-CNN doing well for pedestrian detection? In B. Leibe, J. Matas, N. Sebe, & M. Welling (Eds.), Computer Vision—ECCV 2016 (pp. 443–457). Cham: Springer.CrossRef
Metadaten
Titel
Deep Learning Approach in Aerial Imagery for Supporting Land Search and Rescue Missions
verfasst von
Dunja Božić-Štulić
Željko Marušić
Sven Gotovac
Publikationsdatum
27.03.2019
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 9/2019
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-019-01177-1

Weitere Artikel der Ausgabe 9/2019

International Journal of Computer Vision 9/2019 Zur Ausgabe

Premium Partner