Skip to main content
Erschienen in: Wireless Personal Communications 4/2021

14.05.2021

Deep Learning Based Object Detection Combined with Internet of Things for Remote Surveillance

verfasst von: Aayushi Gautam, Sukhwinder Singh

Erschienen in: Wireless Personal Communications | Ausgabe 4/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Object detection is the key process in any video surveillance application. In case of remote surveillance, it is a necessity to accurately detect the target and transmit the detected data rapidly to main station so that further actions can be taken. This paper concentrates on a framework which uses deep neural network and Internet of Things for target detection and transferring detected information to the cloud at low transmission rates. The detection framework is based on combination of YOLO-Lite which is a simpler version of you only look once (YOLO) detector and spatial pyramid pooling (SPP). When trained on COCO dataset, YOLO-Lite + SPP model runs at a speed of 40 fps with mAP of 35.7% on non-GPU platform. Performance of the same has been analyzed on PASCAL VOC, COCO, TB-50 and TB-100 dataset. On GPU based platform, precision and recall values of 89.79% and 91.67% has been achieved with processing speed of 218 fps. ThingSpeak platform has been used for data reception on cloud. Results in real-time are also demonstrated which proves the efficiency of the anticipated framework and also confirms its suitability for remote video surveillance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zhao, Z. Q., Zheng, P., Xu, S. T., & Wu, X. (2019). Object detection with deep learning: A review. IEEE Transactions on Neural Networks and Learning Systems, 30(11), 3212–3232.CrossRef Zhao, Z. Q., Zheng, P., Xu, S. T., & Wu, X. (2019). Object detection with deep learning: A review. IEEE Transactions on Neural Networks and Learning Systems, 30(11), 3212–3232.CrossRef
2.
Zurück zum Zitat He, K., Zhang, X., Ren, S., & Sun, J. (2015). Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(9), 1904–1916.CrossRef He, K., Zhang, X., Ren, S., & Sun, J. (2015). Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(9), 1904–1916.CrossRef
3.
Zurück zum Zitat Ren, Y., Huang, J., Hong, Z., Lu, W., Yin, J., Zou, L., & Shen, X. (2020). Image-based concrete crack detection in tunnels using deep fully convolutional networks. Construction and Building Materials, 234, 117367.CrossRef Ren, Y., Huang, J., Hong, Z., Lu, W., Yin, J., Zou, L., & Shen, X. (2020). Image-based concrete crack detection in tunnels using deep fully convolutional networks. Construction and Building Materials, 234, 117367.CrossRef
4.
Zurück zum Zitat Feng, W., Ji, D., Wang, Y., Chang, S., Ren, H. & Gan, W. (2018). Challenges on large scale surveillance video analysis. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops (pp. 69–76). Feng, W., Ji, D., Wang, Y., Chang, S., Ren, H. & Gan, W. (2018). Challenges on large scale surveillance video analysis. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops (pp. 69–76).
5.
Zurück zum Zitat Zhang, R., Liu, X., Hu, J., Chang, K., & Liu, K. (2017). A fast method for moving object detection in video surveillance image. Signal, Image and Video Processing, 11(5), 841–848.CrossRef Zhang, R., Liu, X., Hu, J., Chang, K., & Liu, K. (2017). A fast method for moving object detection in video surveillance image. Signal, Image and Video Processing, 11(5), 841–848.CrossRef
6.
Zurück zum Zitat Varga, D., & Szirányi, T. (2017). Robust real-time pedestrian detection in surveillance videos. Journal of Ambient Intelligence and Humanized Computing, 8(1), 79–85.CrossRef Varga, D., & Szirányi, T. (2017). Robust real-time pedestrian detection in surveillance videos. Journal of Ambient Intelligence and Humanized Computing, 8(1), 79–85.CrossRef
7.
Zurück zum Zitat Zhou, P., Ding, Q., Luo, H., & Hou, X. (2018). Violence detection in surveillance video using low-level features. PLoS ONE, 13(10), e0203668.CrossRef Zhou, P., Ding, Q., Luo, H., & Hou, X. (2018). Violence detection in surveillance video using low-level features. PLoS ONE, 13(10), e0203668.CrossRef
8.
Zurück zum Zitat Hu, L., & Ni, Q. (2017). IoT-driven automated object detection algorithm for urban surveillance systems in smart cities. IEEE Internet of Things Journal, 5(2), 747–754.CrossRef Hu, L., & Ni, Q. (2017). IoT-driven automated object detection algorithm for urban surveillance systems in smart cities. IEEE Internet of Things Journal, 5(2), 747–754.CrossRef
9.
Zurück zum Zitat Nikouei, S. Y., Chen, Y., Song, S., Xu, R., Choi, B. Y., & Faughnan, T. R. (2018). Real-time human detection as an edge service enabled by a lightweight cnn. In 2018 IEEE International Conference on Edge Computing (EDGE) (pp. 125–129). IEEE. Nikouei, S. Y., Chen, Y., Song, S., Xu, R., Choi, B. Y., & Faughnan, T. R. (2018). Real-time human detection as an edge service enabled by a lightweight cnn. In 2018 IEEE International Conference on Edge Computing (EDGE) (pp. 125–129). IEEE.
10.
Zurück zum Zitat Wang, H., Wang, P., & Qian, X. (2018). MPNET: An end-to-end deep neural network for object detection in surveillance video. IEEE Access, 6, 30296–30308.CrossRef Wang, H., Wang, P., & Qian, X. (2018). MPNET: An end-to-end deep neural network for object detection in surveillance video. IEEE Access, 6, 30296–30308.CrossRef
11.
Zurück zum Zitat Muhammad, K., Ahmad, J., Mehmood, I., Rho, S., & Baik, S. W. (2018). Convolutional neural networks based fire detection in surveillance videos. IEEE Access, 6, 18174–18183.CrossRef Muhammad, K., Ahmad, J., Mehmood, I., Rho, S., & Baik, S. W. (2018). Convolutional neural networks based fire detection in surveillance videos. IEEE Access, 6, 18174–18183.CrossRef
12.
Zurück zum Zitat Kim, K.H., Hong, S., Roh, B., Cheon, Y. & Park, M. (2016). Pvanet: Deep but lightweight neural networks for real-time object detection. arXiv preprint . arXiv:1608.08021. Kim, K.H., Hong, S., Roh, B., Cheon, Y. & Park, M. (2016). Pvanet: Deep but lightweight neural networks for real-time object detection. arXiv preprint . arXiv:​1608.​08021.
13.
Zurück zum Zitat Nguyen, T. B. & Chung, S. T. (2016). ConvNets and AGMM based real-time human detection under fisheye camera for embedded surveillance. In 2016 international conference on information and communication technology convergence (ICTC) (pp. 840–845). IEEE. Nguyen, T. B. & Chung, S. T. (2016). ConvNets and AGMM based real-time human detection under fisheye camera for embedded surveillance. In 2016 international conference on information and communication technology convergence (ICTC) (pp. 840–845). IEEE.
14.
Zurück zum Zitat Anisimov, D. and Khanova, T. (2017). Towards lightweight convolutional neural networks for object detection. In 2017 14th IEEE international conference on advanced video and signal based surveillance (AVSS) (pp. 1–8). IEEE. Anisimov, D. and Khanova, T. (2017). Towards lightweight convolutional neural networks for object detection. In 2017 14th IEEE international conference on advanced video and signal based surveillance (AVSS) (pp. 1–8). IEEE.
16.
Zurück zum Zitat He, Z., & He, H. (2018). Unsupervised multi-object detection for video surveillance using memory-based recurrent attention networks. Symmetry, 10(9), 375.CrossRef He, Z., & He, H. (2018). Unsupervised multi-object detection for video surveillance using memory-based recurrent attention networks. Symmetry, 10(9), 375.CrossRef
17.
Zurück zum Zitat Huang, R., Pedoeem, J., & Chen, C. (2018). YOLO-LITE: a real-time object detection algorithm optimized for non-GPU computers. In 2018 IEEE International Conference on Big Data (Big Data) (pp. 2503–2510). IEEE. Huang, R., Pedoeem, J., & Chen, C. (2018). YOLO-LITE: a real-time object detection algorithm optimized for non-GPU computers. In 2018 IEEE International Conference on Big Data (Big Data) (pp. 2503–2510). IEEE.
18.
Zurück zum Zitat Redmon, J. (2016). Darknet: Open source neural networks in c. Pjreddie. com. Redmon, J. (2016). Darknet: Open source neural networks in c. Pjreddie. com.
19.
Zurück zum Zitat Redmon, J., Divvala, S., Girshick, R. & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 779–788). Redmon, J., Divvala, S., Girshick, R. & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 779–788).
20.
Zurück zum Zitat Cai, Z. & Vasconcelos, N. (2018). Cascade r-cnn: Delving into high quality object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 6154–6162). Cai, Z. & Vasconcelos, N. (2018). Cascade r-cnn: Delving into high quality object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 6154–6162).
21.
Zurück zum Zitat He, K., Gkioxari, G., Dollár, P. & Ross, B. (2017). Girshick. Mask R-CNN. In ICCV He, K., Gkioxari, G., Dollár, P. & Ross, B. (2017). Girshick. Mask R-CNN. In ICCV
22.
Zurück zum Zitat Dai, J., Qi, H., Xiong, Y., Li, Y., Zhang, G., Hu, H. and Wei, Y., 2017. Deformable convolutional networks. In Proceedings of the IEEE international conference on computer vision (pp. 764–773). Dai, J., Qi, H., Xiong, Y., Li, Y., Zhang, G., Hu, H. and Wei, Y., 2017. Deformable convolutional networks. In Proceedings of the IEEE international conference on computer vision (pp. 764–773).
23.
Zurück zum Zitat Lin, T.Y., Goyal, P., Girshick, R., He, K. & Dollár, P. (2017). Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision (pp. 2980–2988). Lin, T.Y., Goyal, P., Girshick, R., He, K. & Dollár, P. (2017). Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision (pp. 2980–2988).
24.
Zurück zum Zitat Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y. & Berg, A. C. (2016). Ssd: Single shot multibox detector. In European conference on computer vision (pp. 21–37). Springer, Cham. Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y. & Berg, A. C. (2016). Ssd: Single shot multibox detector. In European conference on computer vision (pp. 21–37). Springer, Cham.
25.
Zurück zum Zitat Fu, C.Y., Liu, W., Ranga, A., Tyagi, A. & Berg, A.C. (2017). Dssd: Deconvolutional single shot detector. arXiv preprint arXiv:1701.06659. Fu, C.Y., Liu, W., Ranga, A., Tyagi, A. & Berg, A.C. (2017). Dssd: Deconvolutional single shot detector. arXiv preprint arXiv:​1701.​06659.
26.
Zurück zum Zitat Zhou, P., Ni, B., Geng, C., Hu, J. & Xu, Y. (2018). Scale-transferrable object detection. In proceedings of the IEEE conference on computer vision and pattern recognition (pp. 528–537). Zhou, P., Ni, B., Geng, C., Hu, J. & Xu, Y. (2018). Scale-transferrable object detection. In proceedings of the IEEE conference on computer vision and pattern recognition (pp. 528–537).
27.
Zurück zum Zitat Kumar, S., Raja, R., & Gandham, A. (2020). Tracking an Object Using Traditional MS (Mean Shift) and CBWH MS (Mean Shift) Algorithm with Kalman Filter. In Applications of Machine Learning. Kumar, S., Raja, R., & Gandham, A. (2020). Tracking an Object Using Traditional MS (Mean Shift) and CBWH MS (Mean Shift) Algorithm with Kalman Filter. In Applications of Machine Learning.
28.
Zurück zum Zitat Kumar, S., Singh, S., & Kumar, J. (2018). Automatic live facial expression detection using genetic algorithm with haar wavelet features and SVM. Wireless Personal Communications, 103(3), 2423–2453.CrossRef Kumar, S., Singh, S., & Kumar, J. (2018). Automatic live facial expression detection using genetic algorithm with haar wavelet features and SVM. Wireless Personal Communications, 103(3), 2423–2453.CrossRef
29.
Zurück zum Zitat Kumar, S., Singh, S., & Kumar, J. (2018). Live detection of face using machine learning with multi-feature method. Wireless Personal Communications, 103(3), 2233–2375. Kumar, S., Singh, S., & Kumar, J. (2018). Live detection of face using machine learning with multi-feature method. Wireless Personal Communications, 103(3), 2233–2375.
Metadaten
Titel
Deep Learning Based Object Detection Combined with Internet of Things for Remote Surveillance
verfasst von
Aayushi Gautam
Sukhwinder Singh
Publikationsdatum
14.05.2021
Verlag
Springer US
Erschienen in
Wireless Personal Communications / Ausgabe 4/2021
Print ISSN: 0929-6212
Elektronische ISSN: 1572-834X
DOI
https://doi.org/10.1007/s11277-021-08071-5

Weitere Artikel der Ausgabe 4/2021

Wireless Personal Communications 4/2021 Zur Ausgabe

Neuer Inhalt