nach oben

International Journal of Intelligent Transportation Systems Research

Erschienen in:

26.01.2024

Enhancing Autonomous Driving By Exploiting Thermal Object Detection Through Feature Fusion

verfasst von: Moataz Eltahan, Khaled Elsayed

Erschienen in: International Journal of Intelligent Transportation Systems Research | Ausgabe 1/2024

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Autonomous driving does not imply self-driving vehicles; self-driving vehicles are the final evolution level of autonomous driving. Autonomous driving aims to add more benefits and features to the vehicle, including driver assistance technologies and automated driving systems. Object detection is a fundamental pillar in the autonomous vehicle industry. Recently, thermal imaging is increasingly gaining an essential role in autonomous driving. The use of a thermal camera in the sensor suite of an automotive vehicle has become more and more popular to increase reliability for robust and safe operation. Thermal imaging can detect the infrared radiation heat emitted by objects. Therefore, a thermal camera can outperform other sensors, such as LiDAR, radar, and RGB, in adverse lighting conditions. In this work, we study object detection in thermal imaging. We propose the Multi-Domain Feature Fusion Thermal Detector (MDFFTDet) framework that fuses the thermal weights with the RGB weights using only thermal images from the FLIR ADAS dataset. The proposed framework inherits the architecture of EfficientDet and further proposes a technique for the feature fusion of two different modalities: thermal and RGB. We use UNIT GAN for the image-to-image translation task to generate pseudo-RGB images from the input thermal images. The proposed framework could tackle the problem of the lack of large datasets in the thermal domain and improve detection accuracy. The experimental results on the FLIR ADAS dataset demonstrate that the proposed framework outperforms the detection accuracy of other state-of-the-art object detectors in thermal imagery, achieving this using much fewer parameters and FLOPS. Two variants of the framework, MDFFTDet-D0 and MDFFTDet-D2, are proposed, achieving mAP of 63.15% and 77.81%, respectively. On the other hand, EfficientDet-D0 and EfficientDet-D2, which are used as a baseline for the proposed frameworks, achieved mAP of 59.4% and 69.12%, respectively.

Vorheriger Artikel Trajectory Data-Driven Network Representation for Traffic State Prediction using Deep Learning

Nächster Artikel Development of Dynamic Micro- and Macroscopic Hybrid Model for Efficient Highway Traffic Simulation

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

ATZelectronics worldwide

ATZlectronics worldwide is up-to-speed on new trends and developments in automotive electronics on a scientific level with a high depth of information.

Order your 30-days-trial for free and without any commitment.

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

ATZelektronik

Die Fachzeitschrift ATZelektronik bietet für Entwickler und Entscheider in der Automobil- und Zulieferindustrie qualitativ hochwertige und fundierte Informationen aus dem gesamten Spektrum der Pkw- und Nutzfahrzeug-Elektronik.

Lassen Sie sich jetzt unverbindlich 2 kostenlose Ausgabe zusenden.

Jetzt informieren

Hani, D.: Lidar vs. radar: Detection, tracking, and imaging. wevolver.com. Retrieved from. https://www.wevolver.com/article/lidar-vs-radar-detection-tracking-and-imaging (2021) . Accessed 19 Jan 2022

Kirk J. Havens, Edward J. Sharp, Chapter 8 - Imager Selection, Editor(s): Kirk J. Havens, Edward J. Sharp, Thermal Imaging Techniques to Survey and Monitor Animals in the Wild, Academic Press. 121–141. https://doi.org/10.1016/B978-0-12-803384-5.00008-7 (2016)

Choi, J.D., Kim, M.Y.: A Sensor Fusion System with Thermal Infrared Camera and LiDAR for Autonomous Vehicles: Its Calibration and Application. Twelfth International Conference on Ubiquitous and Future Networks (ICUFN) 2021, 361–365 (2021)CrossRef

Azam, S., Munir, F., Sheri, A.M., Ko, Y., Hussain, I., Jeon, M.: Data fusion of Lidar and Thermal Camera for Autonomous driving. Applied Industrial Optics 2019, OSA Technical Digest (Optica Publishing Group, 2019), paper T2A.5 (2019)

Deng, J., Dong, W., Socher, R., Li, L., Li, K., Fei-Fei, L.: ImageNet: A large-scale hierarchical image database. IEEE Conference on Computer Vision and Pattern Recognition, 248–255 (2009)

Everingham, M., Gool, L.V., Williams, C.K., Winn, J.M., Zisserman, A.: The Pascal Visual Object Classes (VOC) Challenge. Int. J. Comput. Vision 88, 303–338 (2009)CrossRef

Lin, T., Maire, M., Belongie, S.J., Hays, J., Perona, P., Ramanan, D., Dollár, P., Zitnick, C.L.: Microsoft COCO: Common Objects in Context. European Conference on Computer Vision (ECCV) . Lecture Notes in Computer Science, vol. 8693. Springer, Cham. https://doi.org/10.1007/978-3-319-10602-1_48 (2014)

F. A. Group. FLIR thermal dataset for algorithm training. https://www.flir.in/oem/adas/adas-dataset-form/ (2018). Accessed 20 Nov 2020

Hwang, S., Park, J., Kim, N., Choi, Y., & So Kweon, I.: Multispectral pedestrian detection: Benchmark dataset and baseline. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1037–1045) (2015)

10.

Goodfellow, I.J.: NIPS 2016 Tutorial: Generative Adversarial Networks. ArXiv, abs/1701.00160. (2017)

11.

Isola, P., Zhu, J., Zhou, T., Efros, A.A.: Image-to-Image Translation with Conditional Adversarial Networks. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017, 5967–5976 (2017)

12.

Chu, C., Zhmoginov, A., & Sandler, M.: Cyclegan, a master of steganography. arXiv preprint arXiv:1712.02950 (2017)

13.

Liu, M., Breuel, T.M., Kautz, J.: Unsupervised Image-to-Image Translation Networks. International Conference on Neural Information Processing Systems. Curran Associates Inc., Red Hook, NY, USA, 700–708 (2017)

14.

Tan, M., Le, Q.V.: EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. ArXiv, abs/1905.11946 (2019)

15.

Chen, Y., Li, W., Sakaridis, C., Dai, D., Gool, L.V.: Domain Adaptive Faster R-CNN for Object Detection in the Wild. IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018, 3339–3348 (2018)

16.

Ghose, D., Desai, S.M., Bhattacharya, S., Chakraborty, D., Fiterau, M., Rahman, T.: Pedestrian Detection in Thermal Images Using Saliency Maps. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2019, 988–997 (2019)

17.

Munir, F., Azam, S., Rafique, M.A., Sheri, A.M., Jeon, M.: Thermal Object Detection using Domain Adaptation through Style Consistency. ArXiv, abs/2006.00821 (2020)

18.

Cao, Y., Zhou, T., Zhu, X., Su, Y.: Every Feature Counts: An Improved One-Stage Detector in Thermal Imagery. 2019 IEEE 5th International Conference on Computer and Communications (ICCC), 1965–1969 (2019)

19.

Agrawal, K., Subramanian, A.: Enhancing Object Detection in Adverse Conditions using Thermal Imaging. ArXiv, abs/1909.13551 (2019)

20.

Liu, N., Han, J.: PiCANet: Learning Pixel-wise Contextual Attention in ConvNets and Its Application in Saliency Detection. ArXiv, abs/1708.06433 (2017)

21.

Deng, Z., Hu, X., Zhu, L., Xu, X., Qin, J., Han, G., Heng, P.: R³Net: Recurrent Residual Refinement Network for Saliency Detection. International Joint Conference on Artificial Intelligence (IJCAI), 684–690 (2018)

22.

Li, J., Liang, X., Wei, Y., Xu, T., Feng, J., Yan, S.: Perceptual Generative Adversarial Networks for Small Object Detection. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017, 1951–1959 (2017)

23.

Wang, W., Hong, W., Wang, F., Yu, J.: GAN-Knowledge Distillation for One-Stage Object Detection. IEEE Access 8, 60719–60727 (2020)CrossRef

24.

Zhang, Y., Bai, Y., Ding, M., Ghanem, B.: Multi-task Generative Adversarial Network for Detecting Small Objects in the Wild. Int. J. Comput. Vision 128, 1810–1828 (2020)MathSciNetCrossRef

25.

Devaguptapu, C., Akolekar, N., Sharma, M.M., Balasubramanian, V.N.: Borrow From Anywhere: Pseudo Multi-Modal Object Detection in Thermal Imagery. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2019, 1029–1038 (2019)

26.

Zhang, S., Wen, L., Bian, X., Lei, Z., Li, S.: Single-Shot Refinement Neural Network for Object Detection. IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018, 4203–4212 (2018)

27.

Bin, J., Zhang, R., Du, S., Zheng, C. L., Liu, X., Blasch, E., Liu, Z.: Improved Object Detection in Thermal Imaging Through Context Enhancement and Information Fusion: A Case Study in Autonomous Driving. NeurIPS (Autonomous Driving) 2021, Machine Learning for Autonomous Driving workshops (2021)

28.

Tu, Z., Ma, Y., Li, Z., Li, C., Xu, J., Liu, Y.: RGBT Salient Object Detection: A Large-scale Dataset and Benchmark. ArXiv, abs/2007.03262 (2022)

29.

Sun, Y., Cao, B., Zhu, P., & Hu, Q.: Drone-based RGB-Infrared Cross-Modality Vehicle Detection via Uncertainty-Aware Learning. IEEE Transactions on Circuits and Systems for Video Technology, 32, 6700-6713 (2022)

30.

Tan, M., Pang, R., Le, Q.V.: EfficientDet: Scalable and Efficient Object Detection. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020, 10778–10787 (2020)

31.

Tan, M., Le, Q.V.: EfficientNet: Improving Accuracy and Efficiency through AutoML and Model Scaling. Google Research. https://blog.research.google/2019/05/efficientnet-improving-accuracy-and.html (2019). Accessed 15 Nov 2021

32.

Sandler, M., Howard, A.G., Zhu, M., Zhmoginov, A., Chen, L.: MobileNetV2: Inverted Residuals and Linear Bottlenecks. IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018, 4510–4520 (2018)

33.

Tan, M., Chen, B., Pang, R., Vasudevan, V., Le, Q.V.: MnasNet: Platform-Aware Neural Architecture Search for Mobile. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019, 2815–2823 (2019)

34.

Liu, M., Tuzel, O.: Coupled Generative Adversarial Networks. International Conference on Neural Information Processing Systems (NIPS'16). Curran Associates Inc., Red Hook, NY, USA, 469–477 (2016)

35.

Lin, T., Goyal, P., Girshick, R.B., He, K., Dollár, P.: Focal Loss for Dense Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 42, 318–327 (2020)CrossRef

36.

Yet-Another-EfficientDet-Pytorch [Source code]. https://github.com/zylo117/Yet-Another-EfficientDet-Pytorch (2020). Accessed 5 Nov 2020

37.

Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S.E., Fu, C., Berg, A.C.: SSD: Single Shot MultiBox Detector. European Conference on Computer Vision (ECCV) 2016. Lecture Notes in Computer Science, vol. 9905. Springer, Cham. https://doi.org/10.1007/978-3-319-46448-0_2 (2016)

38.

Shorten, C., Khoshgoftaar, T.M.: A survey on image data augmentation for deep learning. Journal of big data 6(1), 1–48 (2019)CrossRef

39.

Zoph, B., Cubuk, E. D., Ghiasi, G., Lin, T. Y., Shlens, J., Le, Q. V.: Learning data augmentation strategies for object detection. In Computer Vision–ECCV 2020: 16^th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXVII 16 (pp. 566–583). Springer International Publishing (2020)

40.

Zhong, Z., Zheng, L., Kang, G., Li, S., Yang, Y.: Random erasing data augmentation. In Proceedings of the AAAI Conference on Artificial Intelligence 34, 07, 13001–13008) (2020)

Titel: Enhancing Autonomous Driving By Exploiting Thermal Object Detection Through Feature Fusion
verfasst von: Moataz Eltahan
Khaled Elsayed
Publikationsdatum: 26.01.2024
Verlag: Springer US
Erschienen in: International Journal of Intelligent Transportation Systems Research / Ausgabe 1/2024
Print ISSN: 1348-8503
Elektronische ISSN: 1868-8659
DOI: https://doi.org/10.1007/s13177-024-00385-5

Premium Partner

Marktübersichten

Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.

Zur Marktübersicht

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

ATZelectronics worldwide

Springer Professional "Technik"

ATZelektronik

Weitere Artikel der Ausgabe 1/2024

Predicted Average Annual Cost of Crashes on the US-16 Wyoming Downgrade Using Time Series Analysis and Forecasting

Effectiveness of Recent Methodologies of Intelligent Transportation System

Evaluation of the Versatility of a Traffic Anomaly Detection Method during Heavy Rainfall

Direction-Oriented Topic Modeling with Applications in Traffic Scene Analysis

Trajectory Data-Driven Network Representation for Traffic State Prediction using Deep Learning

Development of Dynamic Micro- and Macroscopic Hybrid Model for Efficient Highway Traffic Simulation

Premium Partner

Marktübersichten