Skip to main content
Top

2018 | OriginalPaper | Chapter

Scene Recognition for Indoor Localization of Mobile Robots Using Deep CNN

Authors : Piotr Wozniak, Hadha Afrisal, Rigel Galindo Esparza, Bogdan Kwolek

Published in: Computer Vision and Graphics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper we propose a deep neural network based algorithm for indoor place recognition. It uses transfer learning to retrain VGG-F, a pretrained convolutional neural network to classify places on images acquired by a humanoid robot. The network has been trained as well as evaluated on a dataset consisting of 8000 images, which were recorded in sixteen rooms. The dataset is freely accessed from our website. We demonstrated experimentally that the proposed algorithm considerably outperforms BoW algorithms, which are frequently used in loop-closure. It also outperforms an algorithm in which features extracted by FC-6 layer of the VGG-F are classified by a linear SVM.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Arroyo, R., Alcantarilla, P., Bergasa, L., Romera, E.: OpenABLE: an open-source toolbox for application in life-long visual localization of autonomous vehicles. In: IEEE International Conference on Intelligent Transportation Systems, pp. 965–970 (2016) Arroyo, R., Alcantarilla, P., Bergasa, L., Romera, E.: OpenABLE: an open-source toolbox for application in life-long visual localization of autonomous vehicles. In: IEEE International Conference on Intelligent Transportation Systems, pp. 965–970 (2016)
2.
go back to reference Bay, H., Tuytelaars, T., Van Gool, L.: SURF: Speeded up robust features. Eur. Conf. Comput. Vis. 3951, 404–417 (2006) Bay, H., Tuytelaars, T., Van Gool, L.: SURF: Speeded up robust features. Eur. Conf. Comput. Vis. 3951, 404–417 (2006)
3.
go back to reference Cadena, C., et al.: Past, present, and future of simultaneous localization and mapping: toward the robust-perception age. IEEE Trans. Robot. 32(6), 1309–1332 (2016)CrossRef Cadena, C., et al.: Past, present, and future of simultaneous localization and mapping: toward the robust-perception age. IEEE Trans. Robot. 32(6), 1309–1332 (2016)CrossRef
4.
go back to reference Chatfield, K., Lempitsky, V.S., Vedaldi, A., Zisserman, A.: The devil is in the details: an evaluation of recent feature encoding methods. In: British Machine Vision Conference (BMVC) (2011) Chatfield, K., Lempitsky, V.S., Vedaldi, A., Zisserman, A.: The devil is in the details: an evaluation of recent feature encoding methods. In: British Machine Vision Conference (BMVC) (2011)
6.
go back to reference Cummins, M., Newman, P.: FAB-MAP: probabilistic localization and mapping in the space of appearance. Int. J. Rob. Res. 27(6), 647–665 (2008)CrossRef Cummins, M., Newman, P.: FAB-MAP: probabilistic localization and mapping in the space of appearance. Int. J. Rob. Res. 27(6), 647–665 (2008)CrossRef
7.
go back to reference Galvez-Lopez, D., Tardos, T.: Bags of binary words for fast place recognition in image sequences. IEEE Trans. Robot. 28, 1188–1197 (2012)CrossRef Galvez-Lopez, D., Tardos, T.: Bags of binary words for fast place recognition in image sequences. IEEE Trans. Robot. 28, 1188–1197 (2012)CrossRef
8.
go back to reference Garcia-Fidalgo, E., Ortiz, A.: Vision-based topological mapping and localization by means of local invariant features and map refinement. Robotica 33, 1446–1470 (2014)CrossRef Garcia-Fidalgo, E., Ortiz, A.: Vision-based topological mapping and localization by means of local invariant features and map refinement. Robotica 33, 1446–1470 (2014)CrossRef
9.
go back to reference Harris, C., Stephens, M.: A combined corner and edge detector. Alvey Vis. Conf. 15, 10–5244 (1988) Harris, C., Stephens, M.: A combined corner and edge detector. Alvey Vis. Conf. 15, 10–5244 (1988)
10.
go back to reference Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Processing Systems, pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Processing Systems, pp. 1097–1105 (2012)
11.
go back to reference Kuindersma, S., et al.: Optimization-based locomotion planning, estimation, and control design for the atlas humanoid robot. Adv. Neural Proc. Syst. 40, 429–455 (2016) Kuindersma, S., et al.: Optimization-based locomotion planning, estimation, and control design for the atlas humanoid robot. Adv. Neural Proc. Syst. 40, 429–455 (2016)
12.
go back to reference Leutenegger, S., Chli, M., Siegwart, R.: BRISK: binary robust invariant scalable keypoints. In: International Conference on Computer Vision (ICCV) (2011) Leutenegger, S., Chli, M., Siegwart, R.: BRISK: binary robust invariant scalable keypoints. In: International Conference on Computer Vision (ICCV) (2011)
13.
go back to reference Levitt, T., Lawton, D.: Qualitative navigation for mobile robots. Artif. Intell. 44(3), 305–360 (1990)CrossRef Levitt, T., Lawton, D.: Qualitative navigation for mobile robots. Artif. Intell. 44(3), 305–360 (1990)CrossRef
14.
go back to reference Li, Q., Li, K., You, X., Bu, S., Liu, Z.: Place recognition based on deep feature and adaptive weighting of similarity matrix. Neurocomputing 199, 114–127 (2016)CrossRef Li, Q., Li, K., You, X., Bu, S., Liu, Z.: Place recognition based on deep feature and adaptive weighting of similarity matrix. Neurocomputing 199, 114–127 (2016)CrossRef
15.
16.
go back to reference Lowry, S., et al.: Visual place recognition: a survey. IEEE Trans. Robot. 32, 1–19 (2016)CrossRef Lowry, S., et al.: Visual place recognition: a survey. IEEE Trans. Robot. 32, 1–19 (2016)CrossRef
17.
go back to reference Newman, P., Ho, K.: SLAM-loop closing with visually salient features. In: Proceedings of IEEE International Conference on Robotics and Automation, pp. 635–642 (2005) Newman, P., Ho, K.: SLAM-loop closing with visually salient features. In: Proceedings of IEEE International Conference on Robotics and Automation, pp. 635–642 (2005)
18.
go back to reference Oliva, A., Torralba, A.: Building the gist of a scene: the role of global image features in recognition. In: Visual Perception, Progress in Brain Research, vol. 155, pp. 23–36. Elsevier (2006) Oliva, A., Torralba, A.: Building the gist of a scene: the role of global image features in recognition. In: Visual Perception, Progress in Brain Research, vol. 155, pp. 23–36. Elsevier (2006)
19.
go back to reference Oriolo, G., Paolillo, A., Rosa, L., Vendittelli, M.: Humanoid odometric localization integrating kinematic, inertial and visual information. Auton. Robots 40, 867–879 (2016)CrossRef Oriolo, G., Paolillo, A., Rosa, L., Vendittelli, M.: Humanoid odometric localization integrating kinematic, inertial and visual information. Auton. Robots 40, 867–879 (2016)CrossRef
20.
go back to reference Radford, N., et al.: Valkryrie: NASA’s first bipedal humanoid robot. J. Field Robot. 32, 397–419 (2015)CrossRef Radford, N., et al.: Valkryrie: NASA’s first bipedal humanoid robot. J. Field Robot. 32, 397–419 (2015)CrossRef
21.
go back to reference Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: International Conference on Computer Vision (ICCV), vol. 32 (2011) Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: International Conference on Computer Vision (ICCV), vol. 32 (2011)
22.
go back to reference Russakovsky, O., et al.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2015)MathSciNetCrossRef Russakovsky, O., et al.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2015)MathSciNetCrossRef
23.
go back to reference Sahdev, R., Tsotsos, J.: Indoor place recognition system for localization of mobile robots. In: IEEE Conference on Computer and Robot Vision, pp. 53–60 (2016) Sahdev, R., Tsotsos, J.: Indoor place recognition system for localization of mobile robots. In: IEEE Conference on Computer and Robot Vision, pp. 53–60 (2016)
24.
go back to reference Schönberger, J., Hardmeier, H., Sattler, T., Pollefeys, M.: Comparative evaluation of hand-crafted and learned local features. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 6959–6968 (2017) Schönberger, J., Hardmeier, H., Sattler, T., Pollefeys, M.: Comparative evaluation of hand-crafted and learned local features. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 6959–6968 (2017)
25.
go back to reference Simard, P., Steinkraus, D., Platt, J.: Best practices for convolutional neural networks applied to visual document analysis. In: International Conference on Document Analysis and Recognition, pp. 958–963 (2003) Simard, P., Steinkraus, D., Platt, J.: Best practices for convolutional neural networks applied to visual document analysis. In: International Conference on Document Analysis and Recognition, pp. 958–963 (2003)
26.
go back to reference Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR abs/1409.1556 (2014) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR abs/1409.1556 (2014)
27.
go back to reference Sivic, J., Russell, B., Efros, A., Zisserman, A., Freeman, W.: Discovering objects and their location in images. In: IEEE International Conference on Computer Vision, vol. 1, pp. 370–377 (2005) Sivic, J., Russell, B., Efros, A., Zisserman, A., Freeman, W.: Discovering objects and their location in images. In: IEEE International Conference on Computer Vision, vol. 1, pp. 370–377 (2005)
28.
go back to reference Sünderhauf, N., Protzel, P.: BRIEF-Gist - closing the loop by simple means. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 1234–1241 (2011) Sünderhauf, N., Protzel, P.: BRIEF-Gist - closing the loop by simple means. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 1234–1241 (2011)
29.
go back to reference Sünderhauf, N., et al.: Place recognition with convNet landmarks: viewpoint-robust, condition-robust, training-free. In: Proceedings of Robotics: Science and Systems XII (2015) Sünderhauf, N., et al.: Place recognition with convNet landmarks: viewpoint-robust, condition-robust, training-free. In: Proceedings of Robotics: Science and Systems XII (2015)
30.
go back to reference Tai, L., Liu, M.: Deep-learning in mobile robotics - from perception to control systems: a survey on why and why not. arXiv (2016) Tai, L., Liu, M.: Deep-learning in mobile robotics - from perception to control systems: a survey on why and why not. arXiv (2016)
31.
go back to reference Torii, A., Sivic, J., Pajdla, T., Okutomi, M.: Visual place recognition with repetitive structures. In: Proceedings of the IEEE Conference on Computer, Vision and Pattern Recognition (2013) Torii, A., Sivic, J., Pajdla, T., Okutomi, M.: Visual place recognition with repetitive structures. In: Proceedings of the IEEE Conference on Computer, Vision and Pattern Recognition (2013)
32.
go back to reference Wang, Z., Wu, F., Hu, Z.: MSLD: a robust descriptor for line matching. Pattern Recogn. 42, 941–953 (2009)CrossRef Wang, Z., Wu, F., Hu, Z.: MSLD: a robust descriptor for line matching. Pattern Recogn. 42, 941–953 (2009)CrossRef
Metadata
Title
Scene Recognition for Indoor Localization of Mobile Robots Using Deep CNN
Authors
Piotr Wozniak
Hadha Afrisal
Rigel Galindo Esparza
Bogdan Kwolek
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-030-00692-1_13

Premium Partner