nach oben

International Journal of Machine Learning and Cybernetics

Erschienen in:

19.11.2018 | Original Article

Depth estimation from infrared video using local-feature-flow neural network

verfasst von: Shouchuan Wu, Haitao Zhao, Shaoyuan Sun

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 9/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Depth estimation is essential for infrared video processing. In this paper, a novel depth estimation method, called local-feature-flow neural network (LFFNN), is proposed for generating depth maps for each frame of an infrared video. LFFNN extracts local features of a frame with the addition of inter-frame features, which is extracted from the previous frames on the corresponding region in the infrared video. LFFNN is designed for extracting the local features flow in the infrared video, learning better depth-related features through three control gates by inter-frame features propagation as the video progresses. After feature extraction, a pixel-level classifier is created to estimate depth level of different pixels in the infrared video. Our proposed approach achieves state-of-the-art depth estimation performances on the test dataset.

Vorheriger Artikel Digital hardware realization of a novel adaptive ink drop spread operator and its application in modeling and classification and on-chip training

Nächster Artikel An -norm loss based twin support vector regression and its geometric extension

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

ATZelectronics worldwide

ATZlectronics worldwide is up-to-speed on new trends and developments in automotive electronics on a scientific level with a high depth of information.

Order your 30-days-trial for free and without any commitment.

Jetzt informieren

ATZelektronik

Die Fachzeitschrift ATZelektronik bietet für Entwickler und Entscheider in der Automobil- und Zulieferindustrie qualitativ hochwertige und fundierte Informationen aus dem gesamten Spektrum der Pkw- und Nutzfahrzeug-Elektronik.

Lassen Sie sich jetzt unverbindlich 2 kostenlose Ausgabe zusenden.

Jetzt informieren

Al-Smadi M, Talafha B, Al-Ayyoub M et al (2018) Using long short-term memory deep neural networks for aspect-based sentiment analysis of Arabic reviews. Int J Mach Learn Cybern 2018(3):1–13

Andreas J, Rohrbach M, Trevor D, Klein D (2016) Neural module networks. In: IEEE conference on computer vision and pattern recognition, pp 39–48

Hendricks LA, Venugopalan S, Rohrbach M, Mooney R, Saenko K, Darrell T (2016) Deep compositional captioning: Describing novel object categories without paired training data. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–10

Bi T, Liu Y, Weng D, Wang Y (2016) Monocular depth estimation of outdoor scenes using rgb-d datasets. In: Asian conference on computer vision, pp 88–99

Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv:1406.1078

Chung J, Gulcehre C, Cho K et al (2015) Gated feedback recurrent neural networks. Int Conf Mach Learn 2015:2067–2075

Donahue J, Hendricks L A, Guadarrama S et al (2015) Long-term recurrent convolutional networks for visual recognition and description. IEEE Conf Comput Vision Pattern Recogn 2015:2625–2634

Eigen D, Puhrsch C, Fergus R et al (2014) Depth map prediction from a single image using a multi-scale deep network. Neural Inf Proc Syst 2014:2366–2374

Fragkiadaki K, Salas M, Arbelaez P et al (2014) Grouping-based low-rank trajectory completion and 3D reconstruction. Neural Inf Proc Syst 2014:55–63

10.

Garg R, Roussos A, Agapito L et al (2013) Dense variational reconstruction of non-rigid surfaces from monocular video. IEEE Conf Comput Vision Pattern Recogn 2013:1272–1279

11.

Girshick R, Donahue J, Darrell T et al (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. IEEE Conf Comput Vision Pattern Recogn 2014:580–587

12.

Ha H, Im S, Park J et al (2016) High-quality depth from uncalibrated small motion clip. IEEE Conf Comput Vision Pattern Recogn 2016:5413–5421

13.

He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

14.

Hochreiter Sepp, Schmidhuber Jurgen (1997) Long short-term memory. Neural Comput 9(8):1735–1780CrossRef

15.

Karpathy A, Feifei L (2017) Deep visual-semantic alignments for generating image descriptions. IEEE Trans Pattern Anal Mach Intell 39(4):664–676CrossRef

16.

Karsch Kevin, Liu Ce, Kang Sing Bing (2014) Depth transfer: depth extraction from video using non-parametric sampling. IEEE Trans Pattern Anal Mach Intell 36(11):2144–2158CrossRef

17.

Kong N, Black MJ (2015) Intrinsic depth: improving depth transfer with intrinsic images. IEEE Conf Comput Vision Pattern Recogn 2015:3514–3522

18.

Konrad Janusz, Wang Meng, Ishwar Prakash, Chen Wu, Mukherjee D (2013) Learning-based, automatic 2d-to-3d image and video conversion. IEEE Trans Image Process 22(9):3485–3496CrossRef

19.

Krizhevsky A, Sutskever I, Hinton GE et al (2012) Imagenet classification with deep convolutional neural networks. Neural Inf Proc Syst 2012:1097–1105

20.

Liu B, Gould S, Koller D (2010) Single image depth estimation from predicted semantic labels. Comput Vis Pattern Recognit (CVPR), 2010 IEEE conference on. IEEE, pp 1253–1260

21.

Liu F, Shen C, Lin G et al (2015) Deep convolutional neural fields for depth estimation from a single image. IEEE Trans Pattern Anal Mach Intell 2015:5162–5170

22.

Liu M, Salzmann M, He X et al (2014) Discrete-continuous depth estimation from a single image. IEEE Conf Comput Vision Pattern Recogn 2014:716–723

23.

Long J, Shelhamer E, Darrell T et al (2015) Fully convolutional networks for semantic segmentation. IEEE Conf Comput Vision Pattern Recogn 2015:3431–3440

24.

Madani Kurosh, Hassan Dayana, Sabourin Christophe (2017) A dual approach for machine-awareness in indoor environment combining pseudo-3d imaging and soft-computing techniques. Int J Mach Learn Cybern 8(6):1795–1814CrossRef

25.

Malinowski M, Fritz M (2014) A multi-world approach to question answering about real-world scenes based on uncertain input. Neural Inf Proc Syst 2014:1682–1690

26.

Ranftl R, Vineet V, Chen Q, et al (2016) Dense monocular depth estimation in complex dynamic scenes. IEEE Conf Comput Vision Pattern Recogn 2016:4058–4066

27.

Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788

28.

Ren S, He K, Girshick R B et al (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149CrossRef

29.

Roy A, Todorovic S (2016) Monocular depth estimation using neural regression forest. IEEE Conf Comput Vision Pattern Recogn 2016:5506–5514

30.

Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556

31.

Den Oord A V, Kalchbrenner N, Kavukcuoglu K et al (2016) Pixel recurrent neural networks. Int Conf Mach Learn 2016:1747–1756

32.

Vinyals O, Toshev A, Bengio S et al (2015) Show and tell: a neural image caption generator. IEEE Conf Comput Vision Pattern Recogn 2015:3156–3164

33.

Visin F, Kastner K, Cho K et al (2015) Renet: a recurrent neural network based alternative to convolutional networks. arXiv:1505.00393

34.

Xiao Yu, Hua Yu, Tian Xian-Yun, Guang Yu, Li Xiao-mei, Zhang Xue, Wang Ju-Yun (2017) Recognition of college students from weibo with deep neural networks. Int J Mach Learn Cybern 8(5):1447–1455CrossRef

35.

Zhang Guofeng, Jia Jiaya, Hua Wei, Bao Hujun (2011) Robust bilayer segmentation and motion/depth estimation with a handheld camera. IEEE Trans Pattern Anal Mach Intell 33(3):603–617CrossRef

Titel: Depth estimation from infrared video using local-feature-flow neural network
verfasst von: Shouchuan Wu
Haitao Zhao
Shaoyuan Sun
Publikationsdatum: 19.11.2018
Verlag: Springer Berlin Heidelberg
Erschienen in: International Journal of Machine Learning and Cybernetics / Ausgabe 9/2019
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI: https://doi.org/10.1007/s13042-018-0891-9

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Jonas Klose/© Pine Valley Capital GmbH, Carina Kießling von der Strategieberatung Roland Berger/© Monika Walther Fotografie | ATZ, Beijing Auto Show 2024: Deutsche Hersteller wollen angreifen./© EKH-Pictures / Generated with AI / Stock.adobe.com, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

ATZelectronics worldwide

ATZelektronik

Weitere Artikel der Ausgabe 9/2019

A Dempster–Shafer theory based classifier combination for online Signature recognition and verification systems

An -norm loss based twin support vector regression and its geometric extension

A recommender system to address the Cold Start problem for App usage prediction

Triple-I FMP algorithm for double hierarchical fuzzy system based on manifold learning

Digital hardware realization of a novel adaptive ink drop spread operator and its application in modeling and classification and on-chip training

A projection-based approach to software quality evaluation from the users’ perspectives

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.