Skip to main content

2024 | OriginalPaper | Buchkapitel

Object Detection and Depth Estimation Using Deep Learning

verfasst von : Rajani Katiyar, Uttara Kumari, Karthik Panagar, Kashinath Patil, B. M. Manjunath, Y. Jeevan Gowda

Erschienen in: Advanced Computing

Verlag: Springer Nature Switzerland

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Detection of an object and depth estimation is very crucial in the field of computer vision, facilitating tasks in the field of autonomous navigation, scene understanding and many more. There are lot challenges in the current existing technique such as occlusion and accuracy issues, impeding their real-world applicability. To surmount these limitations, the proposed work introduces an innovative approach that melds deep learning architectures with efficient computational methods. By fusing advanced object detection models with a sophisticated depth estimation network, the work proposed have achieved substantial enhancements in accuracy and precision. The proposed model pushes the envelope for real-time implementation, contributing to the advancement of object detection and depth estimation capabilities. This approach was augmented with a novel depth estimation technique, extracting diagonal pixel lengths and combining them with actual depths from the dataset. Subsequent analysis employed both linear and polynomial regression, revealing that the polynomial model (98% average accuracy) surpassed the linear model (80.96% accuracy). These findings highlighted the importance of capturing complex non-linear relationships between pixel length and object depth, showcasing YOLOv4’s robust object detection capabilities and emphasizing the significance of intricate depth estimation in visual cues.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Nagarajan, A., Gopinath, M.P.: Hybrid optimization-enabled deep learning for indoor object detection and distance estimation to assist visually impaired persons. Adv. Eng. Softw. 176, 103362 (2023). ISSN 0965–9978CrossRef Nagarajan, A., Gopinath, M.P.: Hybrid optimization-enabled deep learning for indoor object detection and distance estimation to assist visually impaired persons. Adv. Eng. Softw. 176, 103362 (2023). ISSN 0965–9978CrossRef
2.
Zurück zum Zitat Kumar, G.A., Lee, J.H., Hwang, J., Park, J., Youn, S.H., Kwon, S.: Lidar and camera fusion approach for object distance estimation in self-driving vehicles. Symmetry 12(2), 324 (2020)CrossRef Kumar, G.A., Lee, J.H., Hwang, J., Park, J., Youn, S.H., Kwon, S.: Lidar and camera fusion approach for object distance estimation in self-driving vehicles. Symmetry 12(2), 324 (2020)CrossRef
3.
Zurück zum Zitat Usmankhujaev, S., Baydadaev, S., Kwon, J.W.: Accurate 3D to 2D Object Distance Estimation from the Mapped Point Cloud Data. Sensors 23, 2103 (2023)CrossRef Usmankhujaev, S., Baydadaev, S., Kwon, J.W.: Accurate 3D to 2D Object Distance Estimation from the Mapped Point Cloud Data. Sensors 23, 2103 (2023)CrossRef
4.
Zurück zum Zitat Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, vol. 28 (2015) Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, vol. 28 (2015)
5.
Zurück zum Zitat Girshick, R.: Fast r-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015) Girshick, R.: Fast r-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
7.
Zurück zum Zitat Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016) Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
8.
Zurück zum Zitat Redmon, J., Farhadi, A.: Yolo9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263–7271 (2017) Redmon, J., Farhadi, A.: Yolo9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263–7271 (2017)
9.
Zurück zum Zitat Nugraha, B.T., Su, S.-F.: Towards self-driving car using convolutional neural network and road lane detector. In: 2nd International Conference on Automation, Cognitive Science, Optics, Micro Electro-mechanical System, and Information Technology (ICACOMIT), pp. 65–69. IEEE (2017) Nugraha, B.T., Su, S.-F.: Towards self-driving car using convolutional neural network and road lane detector. In: 2nd International Conference on Automation, Cognitive Science, Optics, Micro Electro-mechanical System, and Information Technology (ICACOMIT), pp. 65–69. IEEE (2017)
10.
Zurück zum Zitat Cai, Y., Luan, T., Gao, H., et al.: Yolov4-5d: an effective and efficient object detector for autonomous driving. IEEE Trans. Instrum. Meas. 70, 1–13 (2021) Cai, Y., Luan, T., Gao, H., et al.: Yolov4-5d: an effective and efficient object detector for autonomous driving. IEEE Trans. Instrum. Meas. 70, 1–13 (2021)
11.
Zurück zum Zitat Zaheer, A., Rashid, M., Riaz, M.A., Khan, S.: Single-view reconstruction using orthogonal line-pairs. Comput. Vis. Image Underst. 172, 107–123 (2018)CrossRef Zaheer, A., Rashid, M., Riaz, M.A., Khan, S.: Single-view reconstruction using orthogonal line-pairs. Comput. Vis. Image Underst. 172, 107–123 (2018)CrossRef
12.
Zurück zum Zitat Lee, D.C., Hebert, M., Kanade, T.: Geometric reasoning for single image structure recovery. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2136–2143. IEEE (2009) Lee, D.C., Hebert, M., Kanade, T.: Geometric reasoning for single image structure recovery. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2136–2143. IEEE (2009)
14.
Zurück zum Zitat Criminisi, A., Reid, I., Zisserman, A.: Single view metrology. Int. J. Comput. Vision 40, 123–148 (2000)CrossRef Criminisi, A., Reid, I., Zisserman, A.: Single view metrology. Int. J. Comput. Vision 40, 123–148 (2000)CrossRef
Metadaten
Titel
Object Detection and Depth Estimation Using Deep Learning
verfasst von
Rajani Katiyar
Uttara Kumari
Karthik Panagar
Kashinath Patil
B. M. Manjunath
Y. Jeevan Gowda
Copyright-Jahr
2024
DOI
https://doi.org/10.1007/978-3-031-56700-1_5

Premium Partner