Skip to main content

2018 | OriginalPaper | Buchkapitel

Object Detection to Assist Visually Impaired People: A Deep Neural Network Adventure

verfasst von : Fereshteh S. Bashiri, Eric LaRose, Jonathan C. Badger, Roshan M. D’Souza, Zeyun Yu, Peggy Peissig

Erschienen in: Advances in Visual Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Blindness or vision impairment, one of the top ten disabilities among men and women, targets more than 7 million Americans of all ages. Accessible visual information is of paramount importance to improve independence and safety of blind and visually impaired people, and there is a pressing need to develop smart automated systems to assist their navigation, specifically in unfamiliar healthcare environments, such as clinics, hospitals, and urgent cares. This contribution focused on developing computer vision algorithms composed with a deep neural network to assist visually impaired individual’s mobility in clinical environments by accurately detecting doors, stairs, and signages, the most remarkable landmarks. Quantitative experiments demonstrate that with enough number of training samples, the network recognizes the objects of interest with an accuracy of over 98% within a fraction of a second.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ahmetovic, D., et al.: Achieving practical and accurate indoor navigation for people with visual impairments. In: Proceedings of the 14th Web for All Conference on The Future of Accessible Work, p. 31. ACM (2017) Ahmetovic, D., et al.: Achieving practical and accurate indoor navigation for people with visual impairments. In: Proceedings of the 14th Web for All Conference on The Future of Accessible Work, p. 31. ACM (2017)
2.
Zurück zum Zitat Bashiri, F.S., LaRose, E., Peissig, P., Tafti, A.P.: Mcindoor20000: a fully-labeled image dataset to advance indoor objects detection. Data Brief 17, 71–75 (2018)CrossRef Bashiri, F.S., LaRose, E., Peissig, P., Tafti, A.P.: Mcindoor20000: a fully-labeled image dataset to advance indoor objects detection. Data Brief 17, 71–75 (2018)CrossRef
4.
Zurück zum Zitat BIRCatMCRI: Mcindoor20000. GitHub repository (2017) BIRCatMCRI: Mcindoor20000. GitHub repository (2017)
5.
Zurück zum Zitat Bourne, R.R., et al.: Magnitude, temporal trends, and projections of the global prevalence of blindness and distance and near vision impairment: a systematic review and meta-analysis. Lancet Glob. Health 5(9), e888–e897 (2017)CrossRef Bourne, R.R., et al.: Magnitude, temporal trends, and projections of the global prevalence of blindness and distance and near vision impairment: a systematic review and meta-analysis. Lancet Glob. Health 5(9), e888–e897 (2017)CrossRef
6.
Zurück zum Zitat Erickson, W., Lee, C.G., von Schrader, S.: 2016 disability status reports: United states (2018) Erickson, W., Lee, C.G., von Schrader, S.: 2016 disability status reports: United states (2018)
7.
Zurück zum Zitat Gaudissart, V., Ferreira, S., Thillou, C., Gosselin, B.: Sypole: mobile reading assistant for blind people. In: 9th Conference Speech and Computer (2004) Gaudissart, V., Ferreira, S., Thillou, C., Gosselin, B.: Sypole: mobile reading assistant for blind people. In: 9th Conference Speech and Computer (2004)
8.
Zurück zum Zitat Gupta, D.S.: Architecture of convolutional neural networks (CNNs) demystified (2017) Gupta, D.S.: Architecture of convolutional neural networks (CNNs) demystified (2017)
10.
Zurück zum Zitat Huang, J.: Accelerating AI with GPUs: A New Computing Model (2016) Huang, J.: Accelerating AI with GPUs: A New Computing Model (2016)
11.
Zurück zum Zitat Jabnoun, H., Benzarti, F., Amiri, H.: A new method for text detection and recognition in indoor scene for assisting blind people. In: Ninth International Conference on Machine Vision (ICMV 2016), vol. 10341, p. 1034123. International Society for Optics and Photonics (2017) Jabnoun, H., Benzarti, F., Amiri, H.: A new method for text detection and recognition in indoor scene for assisting blind people. In: Ninth International Conference on Machine Vision (ICMV 2016), vol. 10341, p. 1034123. International Society for Optics and Photonics (2017)
12.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
13.
Zurück zum Zitat Kruthiventi, S.S., Ayush, K., Babu, R.V.: Deepfix: a fully convolutional neural network for predicting human eye fixations. arXiv preprint arXiv:1510.02927 (2015) Kruthiventi, S.S., Ayush, K., Babu, R.V.: Deepfix: a fully convolutional neural network for predicting human eye fixations. arXiv preprint arXiv:​1510.​02927 (2015)
14.
Zurück zum Zitat Lawrence, S., Giles, C.L., Tsoi, A.C., Back, A.D.: Face recognition: a convolutional neural-network approach. IEEE Trans. Neural Netw. 8(1), 98–113 (1997)CrossRef Lawrence, S., Giles, C.L., Tsoi, A.C., Back, A.D.: Face recognition: a convolutional neural-network approach. IEEE Trans. Neural Netw. 8(1), 98–113 (1997)CrossRef
15.
Zurück zum Zitat LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436 (2015)CrossRef LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436 (2015)CrossRef
16.
Zurück zum Zitat LeCun, Y., et al.: Handwritten digit recognition with a back-propagation network. In: Advances in Neural Information Processing Systems, pp. 396–404 (1990) LeCun, Y., et al.: Handwritten digit recognition with a back-propagation network. In: Advances in Neural Information Processing Systems, pp. 396–404 (1990)
17.
Zurück zum Zitat LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef
18.
Zurück zum Zitat Manoj, B., Rohini, V.: A novel approach to object detection and distance measurement for visually impaired people. Int. J. Comput. Intell. Res. 13(4), 479–484 (2017) Manoj, B., Rohini, V.: A novel approach to object detection and distance measurement for visually impaired people. Int. J. Comput. Intell. Res. 13(4), 479–484 (2017)
19.
Zurück zum Zitat Mekhalfi, M.L., Melgani, F., Bazi, Y., Alajlan, N.: Fast indoor scene description for blind people with multiresolution random projections. J. Vis. Commun. Image Represent. 44, 95–105 (2017)CrossRef Mekhalfi, M.L., Melgani, F., Bazi, Y., Alajlan, N.: Fast indoor scene description for blind people with multiresolution random projections. J. Vis. Commun. Image Represent. 44, 95–105 (2017)CrossRef
20.
Zurück zum Zitat Srinivas, S., Sarvadevabhatla, R.K., Mopuri, K.R., Prabhu, N., Kruthiventi, S.S., Babu, R.V.: A taxonomy of deep convolutional neural nets for computer vision. Front. Robot. AI 2, 36 (2016)CrossRef Srinivas, S., Sarvadevabhatla, R.K., Mopuri, K.R., Prabhu, N., Kruthiventi, S.S., Babu, R.V.: A taxonomy of deep convolutional neural nets for computer vision. Front. Robot. AI 2, 36 (2016)CrossRef
21.
Zurück zum Zitat Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems, pp. 3104–3112 (2014) Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems, pp. 3104–3112 (2014)
22.
Zurück zum Zitat Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015) Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
23.
Zurück zum Zitat Tekin, E., Coughlan, J.M., Shen, H.: Real-time detection and reading of LED/LCD displays for visually impaired persons. In: Proceedings/IEEE Workshop on Applications of Computer Vision. IEEE Workshop on Applications of Computer Vision, p. 491. NIH Public Access (2011) Tekin, E., Coughlan, J.M., Shen, H.: Real-time detection and reading of LED/LCD displays for visually impaired persons. In: Proceedings/IEEE Workshop on Applications of Computer Vision. IEEE Workshop on Applications of Computer Vision, p. 491. NIH Public Access (2011)
24.
Zurück zum Zitat Tekin, E., Vásquez, D., Coughlan, J.M.: SK smartphone barcode reader for the blind. In: Journal on technology and persons with disabilities:... Annual International Technology and Persons with Disabilities Conference, vol. 28, p. 230. NIH Public Access (2013) Tekin, E., Vásquez, D., Coughlan, J.M.: SK smartphone barcode reader for the blind. In: Journal on technology and persons with disabilities:... Annual International Technology and Persons with Disabilities Conference, vol. 28, p. 230. NIH Public Access (2013)
Metadaten
Titel
Object Detection to Assist Visually Impaired People: A Deep Neural Network Adventure
verfasst von
Fereshteh S. Bashiri
Eric LaRose
Jonathan C. Badger
Roshan M. D’Souza
Zeyun Yu
Peggy Peissig
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-03801-4_44