Skip to main content

2020 | OriginalPaper | Buchkapitel

Smart Solution to Detect Images in Limited Visibility Conditions Based Convolutional Neural Networks

verfasst von : Ha Huy Cuong Nguyen, Duc Hien Nguyen, Van Loi Nguyen, Thanh Thuy Nguyen

Erschienen in: Advances in Computational Collective Intelligence

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Decrease in visibility causes many difficulties in vision, tracking. Current classic object detection techniques do not give satisfying results in less visibility. It is essential to detect and recognize the objects under such conditions and devise a better object detection mechanism. The paper proposes a solution to this problem by using a multi step approach that uses Saliency techniques and modern object detection algorithms to obtain the desired results. The distorted image is enhanced via a deep neural network for visibility enhancement. The image frame of a better quality undergoes saliency techniques so that less visible objects are visible. Faster Region-based Convolutional Neural Network (R-CNN) then runs on the saliency output to yield bounding boxes for all the objects. The coordinates of the bounding boxes are then applied on the original image thus detecting all the objects in a distorted image with less visibility.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Kopf, J., et al.: Deep photo: model-based photograph enhancement and viewing. In: ACM Transactions on Graphics, Proceedings of SIGGRAPH Asia, vol. 27, pp. 116:1–116:10 (2008) Kopf, J., et al.: Deep photo: model-based photograph enhancement and viewing. In: ACM Transactions on Graphics, Proceedings of SIGGRAPH Asia, vol. 27, pp. 116:1–116:10 (2008)
3.
Zurück zum Zitat Hussain, F., Jeong, J.: Visibility enhancement of scene images degraded by foggy weather conditions with deep neural networks. J. Sens. 2016, Article ID 3894832, 9 (2016) Hussain, F., Jeong, J.: Visibility enhancement of scene images degraded by foggy weather conditions with deep neural networks. J. Sens. 2016, Article ID 3894832, 9 (2016)
4.
Zurück zum Zitat Amirul Islam, M.D., Kalash, M., Bruce, N.D.B.: Revisiting salient object detection: simultaneous detection, ranking, and subitizing of multiple salient objects. In: Computer Vision and Pattern Recognition (2018). arXiv:1803.05082 Amirul Islam, M.D., Kalash, M., Bruce, N.D.B.: Revisiting salient object detection: simultaneous detection, ranking, and subitizing of multiple salient objects. In: Computer Vision and Pattern Recognition (2018). arXiv:​1803.​05082
6.
Zurück zum Zitat Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal net- works. arXiv preprint arXiv:1506.01497 (2015) Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal net- works. arXiv preprint arXiv:​1506.​01497 (2015)
7.
Zurück zum Zitat Guo, J., Ren, T., Bei, J., Zhu, Y.: Salient object detection in RGB-D image based on saliency fusion and propagation. In: Proceedings of the 7th International Conference on Internet Multi-media Computing and Service, p. 59. ACM (2015) Guo, J., Ren, T., Bei, J., Zhu, Y.: Salient object detection in RGB-D image based on saliency fusion and propagation. In: Proceedings of the 7th International Conference on Internet Multi-media Computing and Service, p. 59. ACM (2015)
8.
Zurück zum Zitat Song, H., Liu, Z., Du, H., Sun, G., Le Meur, O., Ren, T.: Depth-aware salient object detec-tion and segmentation via multiscale discriminative saliency fusion and bootstrap learning. IEEE Trans. Image Process. 26(9), 4204–4216 (2017)MathSciNetCrossRef Song, H., Liu, Z., Du, H., Sun, G., Le Meur, O., Ren, T.: Depth-aware salient object detec-tion and segmentation via multiscale discriminative saliency fusion and bootstrap learning. IEEE Trans. Image Process. 26(9), 4204–4216 (2017)MathSciNetCrossRef
9.
Zurück zum Zitat Guo, F., Tang, J., Cai, Z.: Fusion strategy for single image dehazing. Int. J. Digital Content Technol. Appl. 7(1), 19 (2013)CrossRef Guo, F., Tang, J., Cai, Z.: Fusion strategy for single image dehazing. Int. J. Digital Content Technol. Appl. 7(1), 19 (2013)CrossRef
10.
Zurück zum Zitat Ansia, S., Aswathy, A.L.: Single image haze removal using white balancing and saliency map. Procedia Comput. Sci. 46, 12–19 (2015)CrossRef Ansia, S., Aswathy, A.L.: Single image haze removal using white balancing and saliency map. Procedia Comput. Sci. 46, 12–19 (2015)CrossRef
11.
Zurück zum Zitat Erdem, E.: A region covariance-based visual attention model for RGB-D images. Int. J. Intell. Syst. Appl. Eng. 4(4), 128–134 (2016)CrossRef Erdem, E.: A region covariance-based visual attention model for RGB-D images. Int. J. Intell. Syst. Appl. Eng. 4(4), 128–134 (2016)CrossRef
12.
Zurück zum Zitat Qu, L., He, S., Zhang, J., Tian, J., Tang, Y., Yang, Q.: RGBD salient object detection via deep fusion. IEEE Trans. Image Process. 26(5), 2274–2285 (2017)MathSciNetCrossRef Qu, L., He, S., Zhang, J., Tian, J., Tang, Y., Yang, Q.: RGBD salient object detection via deep fusion. IEEE Trans. Image Process. 26(5), 2274–2285 (2017)MathSciNetCrossRef
13.
Zurück zum Zitat Du, J.: Understanding of object detection based on CNN family and YOLO. In: Journal of Physics: Conference Series, vol. 1004, p. 012029. IOP Publishing (2018) Du, J.: Understanding of object detection based on CNN family and YOLO. In: Journal of Physics: Conference Series, vol. 1004, p. 012029. IOP Publishing (2018)
14.
Zurück zum Zitat Erdem, E., Erdem, A.: Visual saliency estimation by nonlinearly integrating features using region covariances. J. Vis. 13(4), 11–11 (2013)CrossRef Erdem, E., Erdem, A.: Visual saliency estimation by nonlinearly integrating features using region covariances. J. Vis. 13(4), 11–11 (2013)CrossRef
15.
Zurück zum Zitat Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. TPAMI 40(4), 834–848 (2018)CrossRef Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. TPAMI 40(4), 834–848 (2018)CrossRef
16.
Zurück zum Zitat Lakshmanaprabu, S.K., Mohanty, S.N., Shankar, K., Arunkumar, N.: Optimal deep learning model for classification of lung cancer on CT images. Future Gener. Comput. Syst. 19(1), 374–382 (2019) Lakshmanaprabu, S.K., Mohanty, S.N., Shankar, K., Arunkumar, N.: Optimal deep learning model for classification of lung cancer on CT images. Future Gener. Comput. Syst. 19(1), 374–382 (2019)
17.
Zurück zum Zitat Rajeshwari, P., Abhishek, P., Srikanth, P., Vinod, T.: Object detection: an overview. Int. J. Trend Sci. Res. Dev. (IJTSRD) 3(1), 1663–1665 (2019) Rajeshwari, P., Abhishek, P., Srikanth, P., Vinod, T.: Object detection: an overview. Int. J. Trend Sci. Res. Dev. (IJTSRD) 3(1), 1663–1665 (2019)
18.
Zurück zum Zitat Tian, Y., Yang, G., Wang, Z., et al.: Apple detection during different growth stages in orchards using the improved YOLO-V3 model. Comput. Electron. Agric. 157, 417–426 (2019)CrossRef Tian, Y., Yang, G., Wang, Z., et al.: Apple detection during different growth stages in orchards using the improved YOLO-V3 model. Comput. Electron. Agric. 157, 417–426 (2019)CrossRef
22.
Zurück zum Zitat Kim, S., Ji, Y., Lee, K.: An effective sign language learning with object detection based ROI segmentation. In: Second IEEE International Conference on Robotic Computing (IRC), Laguna Hills, CA, pp. 330–333 (2018) Kim, S., Ji, Y., Lee, K.: An effective sign language learning with object detection based ROI segmentation. In: Second IEEE International Conference on Robotic Computing (IRC), Laguna Hills, CA, pp. 330–333 (2018)
23.
Zurück zum Zitat Nwankpa, C., Ijomah, W., Gachagan, A., Marshall, S.: Activation functions: comparison of trends in practice and research for deep learning. arXiv:1811.03378 [cs.LG], November 2018 Nwankpa, C., Ijomah, W., Gachagan, A., Marshall, S.: Activation functions: comparison of trends in practice and research for deep learning. arXiv:​1811.​03378 [cs.LG], November 2018
25.
Zurück zum Zitat Munera, S., Amigo, J.M., Blasco, J., Cubero, S., Talens, P., Alexios, N.: Ripeness monitoring of two cultivars of nectarine using VIS-NIR hyperspectral reflectance imaging. J. Food Eng. 214(3), 29–39 (2017)CrossRef Munera, S., Amigo, J.M., Blasco, J., Cubero, S., Talens, P., Alexios, N.: Ripeness monitoring of two cultivars of nectarine using VIS-NIR hyperspectral reflectance imaging. J. Food Eng. 214(3), 29–39 (2017)CrossRef
27.
Zurück zum Zitat Zhang, Y., Sohn, K., Villegas, R., Pan, G., Lee, H.: Improving object detection with deep convolutional networks via bayesian optimization and structured prediction. arXiv:1504.03293 [cs.CV], January 2016 Zhang, Y., Sohn, K., Villegas, R., Pan, G., Lee, H.: Improving object detection with deep convolutional networks via bayesian optimization and structured prediction. arXiv:​1504.​03293 [cs.CV], January 2016
28.
Zurück zum Zitat Lu, Y., Javidi, T., Lazebnik, S.: Adaptive object detection using adjacency and zoom prediction. arXiv:1512.07711 [cs.CV], April 2016 Lu, Y., Javidi, T., Lazebnik, S.: Adaptive object detection using adjacency and zoom prediction. arXiv:​1512.​07711 [cs.CV], April 2016
29.
Zurück zum Zitat Santagapita, P.R., Tylewicz, U., Panarese, V., Rocculi, P., Dalla Rosa, M.: Non-destructive assessment of kiwifruit physic-chemical parameters to optimize the osmotic dehydration process: a study on FT-NIR spectroscopy. J. Biosyst. Eng. 142(2), 101–129 (2016)CrossRef Santagapita, P.R., Tylewicz, U., Panarese, V., Rocculi, P., Dalla Rosa, M.: Non-destructive assessment of kiwifruit physic-chemical parameters to optimize the osmotic dehydration process: a study on FT-NIR spectroscopy. J. Biosyst. Eng. 142(2), 101–129 (2016)CrossRef
30.
Zurück zum Zitat Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. arXiv:1506.01497v3 [cs.CV], January 2016 Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. arXiv:​1506.​01497v3 [cs.CV], January 2016
31.
Zurück zum Zitat Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. arXiv:1506.02640 [cs.CV], May 2016 Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. arXiv:​1506.​02640 [cs.CV], May 2016
32.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: ICLR (2015) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: ICLR (2015)
35.
Zurück zum Zitat Erhan, D., Szegedy, C., Toshev, A., Anguelov, D.: Scalable object detection using deep neural networks. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2147–2154 (2014) Erhan, D., Szegedy, C., Toshev, A., Anguelov, D.: Scalable object detection using deep neural networks. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2147–2154 (2014)
36.
Zurück zum Zitat Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. arXiv:1311.2524 [cs.CV], October 2014 Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. arXiv:​1311.​2524 [cs.CV], October 2014
37.
Zurück zum Zitat Jia, K., Wang, X., Tang, X.: Image transformation based on learning dictionaries across image spaces. IEEE Trans. Pattern Anal. Mach. Intell. 35(2), 367–380 (2013)CrossRef Jia, K., Wang, X., Tang, X.: Image transformation based on learning dictionaries across image spaces. IEEE Trans. Pattern Anal. Mach. Intell. 35(2), 367–380 (2013)CrossRef
Metadaten
Titel
Smart Solution to Detect Images in Limited Visibility Conditions Based Convolutional Neural Networks
verfasst von
Ha Huy Cuong Nguyen
Duc Hien Nguyen
Van Loi Nguyen
Thanh Thuy Nguyen
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-63119-2_52