nach oben

International Journal of Machine Learning and Cybernetics

Erschienen in:

04.07.2020 | Original Article

Discrete convolutional CRF networks for depth estimation from monocular infrared images

verfasst von: Qianqian Wang, Haitao Zhao, Zhengwei Hu, Yuru Chen, Yuqi Li

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 1/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Predicting the depth of a scene from monocular infrared images, which plays a crucial role in understanding three-dimensional structures, is one of the challenging tasks in machine learning and computer vision. Considering the lack of texture and color information in infrared images, a novel discrete convolutional conditional random field network is proposed for depth estimation. The proposed method inherits several merits of conditional random fields and deep learning. First, the pairwise features are automatically extracted and optimized through deep architectures. Second, the monocular-images-based depth regression is converted into a multi-class classification, in which the order information of different levels of depths is considered in the loss function. Our experiments demonstrate that this conversion achieves much higher accuracy and faster conversion. Third, to obtain fine-grained level details, we have further proposed a multi-scale discrete convolutional conditional random field network that computes the pairwise features of the discrete conditional random field at different spatial levels. Extensive experiments on the infrared image dataset NUSTMS demonstrate that the proposed method outperforms other depth estimation methods. Specifically, for the proposed method, the mean relative error is 0.181, the mean log10 error is 0.072, and the accuracy with a threshold (t = 1.25³) is 95.3%.

Vorheriger Artikel A weighted exponential discriminant analysis through side-information for face and kinship verification using statistical binarized image features

Nächster Artikel A hybrid grey wolf optimizer for solving the product knapsack problem

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

ATZelectronics worldwide

ATZlectronics worldwide is up-to-speed on new trends and developments in automotive electronics on a scientific level with a high depth of information.

Order your 30-days-trial for free and without any commitment.

Jetzt informieren

ATZelektronik

Die Fachzeitschrift ATZelektronik bietet für Entwickler und Entscheider in der Automobil- und Zulieferindustrie qualitativ hochwertige und fundierte Informationen aus dem gesamten Spektrum der Pkw- und Nutzfahrzeug-Elektronik.

Lassen Sie sich jetzt unverbindlich 2 kostenlose Ausgabe zusenden.

Jetzt informieren

Eigen D, Puhrsch C, Fergus R et al (2014) Depth map prediction from a single image using a multi-scale deep network. Adv Neural Inf Process Syst 2014:2366–2374

Eigen D, Fergus R (2015) Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture. Int Conf Comput Vision 2015:2650–2658

Martínez Torres J, Iglesias Comesaña C, García-Nieto PJ (2019) Review: machine learning techniques applied to cybersecurity. Int J Mach Learn Cybernet 10(10):2823–2836CrossRef

Saxena A, Chung SH, Ng AY (2005) Learning depth from single monocular images. Adv Neural Inf Process Syst 2005:1161–1168

Saxena A, Sun M, Ng AY (2009) Makethree-dimensional: learning three-dimensional scene structure from a single still image. IEEE Trans Pattern Anal Mach Intell 31(5):824–840CrossRef

Liu B, Gould S, Koller D (2010) Single image depth estimation from predicted semantic labels. Proc IEEE Conf Comput Vision Pattern Recognit 2010:1253–1260

Liu M, Salzmann M, He X (2014) Discrete-continuous depth estimation from a single image. Proc IEEE Conf Comput Vision Pattern Recognit 2014:716–723

Zheng S, Jayasumana S, Romera-Paredes B et al (2016) Conditional random fields as recurrent neural networks. IEEE Int Conf Comput Vision 2016:1529–1537

Zhao H, Shi J, Qi X et al (2017) Pyramid scene parsing network. Proc IEEE Conf Comput Vision Pattern Recognit 2017:2881–2890

10.

Farabet C, Couprie C, Najman L et al (2013) Learning hierarchical features for scene labeling. IEEE Trans Pattern Anal Mach Intell 35(8):1915–1929CrossRef

11.

Li NB, Shen NC, Dai NY et al (2015) Depth and surface normal estimation from monocular images using regression on deep features and hierarchical CRFs. Proc IEEE Conf Comput Vision Pattern Recognit 2015:1119–1127

12.

Liu F, Shen C, Lin G (2015) Deep convolutional neural fields for depth estimation from a single image. Proc IEEE Conf Comput Vision Pattern Recognit 2015:5162–5170

13.

Liu F, Shen C, Lin G et al (2015) Learning depth from single monocular images using deep convolutional neural fields. IEEE Trans Pattern Anal Mach Intell 38(10):2024–2039CrossRef

14.

Cao Y, Wu Z, Shen C (2018) Estimating depth from monocular images as classification using deep fully convolutional residual networks. IEEE Trans Circuits Syst Video Technol 2018:3174–3182CrossRef

15.

Ibarra-Castanedo C et al (2004) Infrared image processing and data analysis. Infrared Phys Technol 46(1):75–83CrossRef

16.

Krähenbühl P, Koltun V (2011) Efficient inference in fully connected CRFs with Gaussian edge potentials. Adv Neural Inf Process Syst 24:109–117

17.

Noh H, Hong S, Han B (2015) Learning deconvolution network for semantic segmentation. IEEE Int Conf Comput Vision 2015:1520–1528

18.

Xu D, Ricci E, Ouyang W et al (2017) Multi-scale continuous CRFs as sequential deep networks for monocular depth estimation. Proc IEEE Conf Comput Vision Pattern Recognit 2017:161–169

19.

Wu S, Zhao H, Sun S (2019) Depth estimation from infrared video using local-feature-flow neural network. Int J Mach Learn Cybernet 10(9):2563–2572CrossRef

20.

Xu D, Ouyang W, Wang X, Sebe N (2018) Pad-net: Multitasks guided prediciton-and-distillation network for simultaneous depth estimation and scene parsing. arXiv preprint arXiv:1805.04409

21.

Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556

22.

Szegedy C, Liu W, Jia Y et al (2015) Going deeper with convolutions. Proc IEEE Conf Comput Vision Pattern Recognit 2015:1–9

23.

He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. Proc IEEE Conf Comput Vision Pattern Recognit 2016:770–778

24.

Szegedy C, Ioffe S, Vanhoucke V et al (2017) Inception-v4, inception-ResNet and the impact of residual connections on learning. 31st AAAI conference on artificial intelligence 2017:4278–4284

25.

Gu TT, Zhao HT, Sun SY (2018) Depth estimation of infrared image based on pyramid residual neural networks. Infrared Technol 40(5)

26.

Godard C, Mac Aodha O, Brostow GJ (2017) Unsupervised monocular depth estimation with left-right consistency. Proc IEEE Conf Comput Vision Pattern Recognit 2017:6602–6611

27.

Kundu JN, Uppala PK, Pahuja A et al (2018) AdaDepth: unsupervised content congruent adaptation for depth estimation. Proc IEEE Conf Comput Vision Pattern Recognit 2018:2656–2665

28.

Pilzer A, Xu D, Puscas MM et al (2018) Unsupervised adversarial depth estimation using cycled generative networks. Int Conf Three-dimens Vision 2018:587–595

29.

Kuznietsov Y, Stückler J, Leibe B (2017) Semi-supervised deep learning for monocular depth map prediction. Proc IEEE Conf Comput Vision Pattern Recognit 2017:2215–2223

30.

Fu H, Gong M, Wang C et al (2018) Deep ordinal regression network for monocular depth estimation. Proc IEEE Conf Comput Vision Pattern Recognit 2018:2002–2011

31.

Srivastava N, Hinton G, Krizhevsky A et al (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15:1929–1958MathSciNetMATH

32.

Viswanathan R (1993) A note on distributed estimation and sufficiency. IEEE Trans Inf Theory 39(5):1765–1767CrossRef

Titel: Discrete convolutional CRF networks for depth estimation from monocular infrared images
verfasst von: Qianqian Wang
Haitao Zhao
Zhengwei Hu
Yuru Chen
Yuqi Li
Publikationsdatum: 04.07.2020
Verlag: Springer Berlin Heidelberg
Erschienen in: International Journal of Machine Learning and Cybernetics / Ausgabe 1/2021
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI: https://doi.org/10.1007/s13042-020-01164-w

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Jonas Klose/© Pine Valley Capital GmbH, Carina Kießling von der Strategieberatung Roland Berger/© Monika Walther Fotografie | ATZ, Beijing Auto Show 2024: Deutsche Hersteller wollen angreifen./© EKH-Pictures / Generated with AI / Stock.adobe.com, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

ATZelectronics worldwide

ATZelektronik

Weitere Artikel der Ausgabe 1/2021

Clinical quantitative information recognition and entity-quantity association from Chinese electronic medical records

Causative label flip attack detection with data complexity measures

Multi-objective unit commitment optimization with ultra-low emissions under stochastic and fuzzy uncertainties

A further study on biologically inspired feature enhancement in zero-shot learning

A hybrid grey wolf optimizer for solving the product knapsack problem

Multi-agent reinforcement learning for redundant robot control in task-space

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.