nach oben

Erschienen in:

2021 | OriginalPaper | Buchkapitel

Kinect-Based Outdoor Navigation for the Visually Challenged Using Deep Learning

verfasst von : Anand Subramanian, N. Venkateswaran, W. Jino Hans

Erschienen in: Advances in Machine Learning and Computational Intelligence

Verlag: Springer Singapore

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

In this paper, we propose an outdoor navigation system, intended for people with visual impairments. Our system makes use of a Microsoft Kinect which is reconfigured for mobile use with a portable power supply. An object detection model was trained to detect commonly found obstacles on roads, namely cars, pedestrians, bicycles and motorcycles, based on the inputs from the Kinect. In the process, we select an optimal object detection model for an embedded environment by carrying out extensive training, benchmarking and experimentation on three single shot detection models (SSD) with different feature extractors and a RetinaNet model, while also applying quantization techniques to obtain real-time performance with relatively minor losses in performance. The detections from the network are leveraged to calculate the distance between the person and the object detected, using the depth map from the Kinect, and the information is relayed to the user using a text-to-speech system, through Bluetooth earphones paired to the system. The entire setup is constructed on a white cane, where a Raspberry Pi 3B is connected to the Kinect for reading the input frames and performing onboard processing. The results of testing the model in outdoor footage indicate its viability as a tool for outdoor navigation.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel A BERT-Based Question Representation for Improved Question Retrieval in Community Question Answering Systems

Nächstes Kapitel Prediction of Stock Market Prices of Using Recurrent Neural Network—Long Short-Term Memory

K. Kumar, B. Champaty, K, Uvanesh, R. Chachan, K. Pal, A. Anis, Development of an Ultrasonic Cane as a Navigation Aid for the Blind People, in 2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT) (2014), pp. 475–479

M. Nassih, I., Cherradi, Y. Maghous, B. Ouriaghli, Y. Salih-Alj, Obstacles Recognition System for the Blind People Using RFID, IN 2012 Sixth International Conference on Next Generation Mobile Applications, Services and Technologies (2012), pp. 60–63

Kinect—Windows app development, https://developer.microsoft.com/en-us/windows/kinect. Last accessed 27 Mar 2019

V. Filipe, F. Fernandes, H. Fernandes, A. Sousa, H. Paredes, J. Barroso, Blind navigation support system based on microsoft kinect. Procedia Comput. Sci. 14, 94–101 (2012)CrossRef

H. Takizawa, S. Yamaguchi, M. Aoyagi, N. Ezaki, S. Mizuno, Kinect Cane: Object Recognition Aids for the Visually Impaired, in 2013 6th International Conference on Human System Interactions (HSI) (2013), pp. 473–478

A. Ali, M.A. Ali, Blind Navigation System for Visually Impaired Using Windowing-Based Mean on Microsoft Kinect Camera, in 2017 Fourth International Conference on Advances in Biomedical Engineering (ICABME) (2017), pp. 1–4

T. M. (aquent Llc), Microsoft Speech API (SAPI) 5.3. https://docs.microsoft.com/en-us/previous-versions/windows/desktop/ms723627(v%3dvs.85. Last accessed 2 April 2019

W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, A.C. Berg, SSD: Single Shot MultiBox detector, in Computer Vision (ECCV 2016). Springer International Publishing, Berlin (2016), pp. 21–37

A.G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, H. Adam, MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications (2017). arXiv preprint: arXiv:1704.04861

10.

P. Jin, V. Rathod, X. Zhu, Pooling Pyramid Network for Object Detection (2018). arXiv preprint: arXiv: 1807.03284

11.

M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, L.C. Chen, Mobilenetv2: Inverted Residuals and Linear Bottlenecks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018), pp. 4510–4520

12.

C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, Going Deeper with Convolutions, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015), pp. 1–9

13.

L. Tsung-Yi, P. Goyal, R. Girshick, K. He, P. Dollár, Focal Loss for Dense Object Detection, in IEEE International Conference on Computer Vision (ICCV 2017) (2017), pp. 2999–3007

14.

OpenKinect, OpenKinect/libfreenect. https://github.com/OpenKinect/libfreenect. Last accessed 2 April 2019

15.

J. Huang, V. Rathod, C. Sun, M. Zhu, A. Korattikara, A. Fathi, I. Fischer, Z. Wojna, Y. Song, S. Guadarrama, Others: Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017), pp. 7310–7311

16.

F. Yu, W. Xian, Y. Chen, F. Liu, M. Liao, V. Madhavan, T. Darrell, BDD100K: A Diverse Driving Video Database with Scalable Annotation Tooling (2018). arXiv preprint: arXiv:1805.04687

17.

Cartucho: Cartucho/mAP, https://github.com/Cartucho/mAP. Last accessed 4 May 2019

Titel: Kinect-Based Outdoor Navigation for the Visually Challenged Using Deep Learning
verfasst von: Anand Subramanian
N. Venkateswaran
W. Jino Hans
Verlag: Springer Singapore
Buch: Advances in Machine Learning and Computational Intelligence
Print ISBN: 978-981-15-5242-7

Electronic ISBN: 978-981-15-5243-4

Copyright-Jahr: 2021
DOI: https://doi.org/10.1007/978-981-15-5243-4_32

Premium Partner

Marktübersichten

Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.

Zur Marktübersicht

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner

Marktübersichten