Skip to main content

2021 | OriginalPaper | Buchkapitel

Kinect-Based Outdoor Navigation for the Visually Challenged Using Deep Learning

verfasst von : Anand Subramanian, N. Venkateswaran, W. Jino Hans

Erschienen in: Advances in Machine Learning and Computational Intelligence

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we propose an outdoor navigation system, intended for people with visual impairments. Our system makes use of a Microsoft Kinect which is reconfigured for mobile use with a portable power supply. An object detection model was trained to detect commonly found obstacles on roads, namely cars, pedestrians, bicycles and motorcycles, based on the inputs from the Kinect. In the process, we select an optimal object detection model for an embedded environment by carrying out extensive training, benchmarking and experimentation on three single shot detection models (SSD) with different feature extractors and a RetinaNet model, while also applying quantization techniques to obtain real-time performance with relatively minor losses in performance. The detections from the network are leveraged to calculate the distance between the person and the object detected, using the depth map from the Kinect, and the information is relayed to the user using a text-to-speech system, through Bluetooth earphones paired to the system. The entire setup is constructed on a white cane, where a Raspberry Pi 3B is connected to the Kinect for reading the input frames and performing onboard processing. The results of testing the model in outdoor footage indicate its viability as a tool for outdoor navigation.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat K. Kumar, B. Champaty, K, Uvanesh, R. Chachan, K. Pal, A. Anis, Development of an Ultrasonic Cane as a Navigation Aid for the Blind People, in 2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT) (2014), pp. 475–479 K. Kumar, B. Champaty, K, Uvanesh, R. Chachan, K. Pal, A. Anis, Development of an Ultrasonic Cane as a Navigation Aid for the Blind People, in 2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT) (2014), pp. 475–479
2.
Zurück zum Zitat M. Nassih, I., Cherradi, Y. Maghous, B. Ouriaghli, Y. Salih-Alj, Obstacles Recognition System for the Blind People Using RFID, IN 2012 Sixth International Conference on Next Generation Mobile Applications, Services and Technologies (2012), pp. 60–63 M. Nassih, I., Cherradi, Y. Maghous, B. Ouriaghli, Y. Salih-Alj, Obstacles Recognition System for the Blind People Using RFID, IN 2012 Sixth International Conference on Next Generation Mobile Applications, Services and Technologies (2012), pp. 60–63
4.
Zurück zum Zitat V. Filipe, F. Fernandes, H. Fernandes, A. Sousa, H. Paredes, J. Barroso, Blind navigation support system based on microsoft kinect. Procedia Comput. Sci. 14, 94–101 (2012)CrossRef V. Filipe, F. Fernandes, H. Fernandes, A. Sousa, H. Paredes, J. Barroso, Blind navigation support system based on microsoft kinect. Procedia Comput. Sci. 14, 94–101 (2012)CrossRef
5.
Zurück zum Zitat H. Takizawa, S. Yamaguchi, M. Aoyagi, N. Ezaki, S. Mizuno, Kinect Cane: Object Recognition Aids for the Visually Impaired, in 2013 6th International Conference on Human System Interactions (HSI) (2013), pp. 473–478 H. Takizawa, S. Yamaguchi, M. Aoyagi, N. Ezaki, S. Mizuno, Kinect Cane: Object Recognition Aids for the Visually Impaired, in 2013 6th International Conference on Human System Interactions (HSI) (2013), pp. 473–478
6.
Zurück zum Zitat A. Ali, M.A. Ali, Blind Navigation System for Visually Impaired Using Windowing-Based Mean on Microsoft Kinect Camera, in 2017 Fourth International Conference on Advances in Biomedical Engineering (ICABME) (2017), pp. 1–4 A. Ali, M.A. Ali, Blind Navigation System for Visually Impaired Using Windowing-Based Mean on Microsoft Kinect Camera, in 2017 Fourth International Conference on Advances in Biomedical Engineering (ICABME) (2017), pp. 1–4
8.
Zurück zum Zitat W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, A.C. Berg, SSD: Single Shot MultiBox detector, in Computer Vision (ECCV 2016). Springer International Publishing, Berlin (2016), pp. 21–37 W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, A.C. Berg, SSD: Single Shot MultiBox detector, in Computer Vision (ECCV 2016). Springer International Publishing, Berlin (2016), pp. 21–37
9.
Zurück zum Zitat A.G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, H. Adam, MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications (2017). arXiv preprint: arXiv:1704.04861 A.G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, H. Adam, MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications (2017). arXiv preprint: arXiv:1704.04861
10.
Zurück zum Zitat P. Jin, V. Rathod, X. Zhu, Pooling Pyramid Network for Object Detection (2018). arXiv preprint: arXiv: 1807.03284 P. Jin, V. Rathod, X. Zhu, Pooling Pyramid Network for Object Detection (2018). arXiv preprint: arXiv: 1807.03284
11.
Zurück zum Zitat M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, L.C. Chen, Mobilenetv2: Inverted Residuals and Linear Bottlenecks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018), pp. 4510–4520 M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, L.C. Chen, Mobilenetv2: Inverted Residuals and Linear Bottlenecks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018), pp. 4510–4520
12.
Zurück zum Zitat C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, Going Deeper with Convolutions, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015), pp. 1–9 C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, Going Deeper with Convolutions, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015), pp. 1–9
13.
Zurück zum Zitat L. Tsung-Yi, P. Goyal, R. Girshick, K. He, P. Dollár, Focal Loss for Dense Object Detection, in IEEE International Conference on Computer Vision (ICCV 2017) (2017), pp. 2999–3007 L. Tsung-Yi, P. Goyal, R. Girshick, K. He, P. Dollár, Focal Loss for Dense Object Detection, in IEEE International Conference on Computer Vision (ICCV 2017) (2017), pp. 2999–3007
15.
Zurück zum Zitat J. Huang, V. Rathod, C. Sun, M. Zhu, A. Korattikara, A. Fathi, I. Fischer, Z. Wojna, Y. Song, S. Guadarrama, Others: Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017), pp. 7310–7311 J. Huang, V. Rathod, C. Sun, M. Zhu, A. Korattikara, A. Fathi, I. Fischer, Z. Wojna, Y. Song, S. Guadarrama, Others: Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017), pp. 7310–7311
16.
Zurück zum Zitat F. Yu, W. Xian, Y. Chen, F. Liu, M. Liao, V. Madhavan, T. Darrell, BDD100K: A Diverse Driving Video Database with Scalable Annotation Tooling (2018). arXiv preprint: arXiv:1805.04687 F. Yu, W. Xian, Y. Chen, F. Liu, M. Liao, V. Madhavan, T. Darrell, BDD100K: A Diverse Driving Video Database with Scalable Annotation Tooling (2018). arXiv preprint: arXiv:1805.04687
Metadaten
Titel
Kinect-Based Outdoor Navigation for the Visually Challenged Using Deep Learning
verfasst von
Anand Subramanian
N. Venkateswaran
W. Jino Hans
Copyright-Jahr
2021
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-15-5243-4_32

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.