Skip to main content
Log in

Local optimized and scalable frame-to-model SLAM

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In recent years, dense visual SLAM (Simultaneous Localization And Mapping) has been proved that it has lots of advantages on the accuracy of pose estimation compared to sparse features due to exploiting the more available information in image data. In this paper, we propose a scalable dense frame-to-model SLAM system based on KinectFusion algorithm. We design an effective active volume shift strategy to extend the tracking range without increment on storage resource. Furthermore, we propose three local optimization methods including motion predicting, weighted ICP (Iterative Closest Point) and multi-modelframe combined estimation to improve the tracking stability and decrease the accumulative error. Note that, as the basis of on-line augmented reality application, our work focus on local optimization rather than global closing loop optimization which is a classical problem in SLAM. At last, we evaluate our optimized approaches on publicly available benchmark datasets, and compare it with the original KinectFusion, a similar method and another two frame-to-frame RGBD SLAM methods. The results indicate our approaches yield a certain degree improvement on the performance of tracking accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Besl PJ, McKay ND (1992) A method for registration of 3-D shapes. IEEE Trans Pattern Anal Mach Intell 14:239–256

    Article  Google Scholar 

  2. Chiuso A, Favaro P, Jin H, Soatto S (2000) 3-D motion and structurefrom 2-D motion causally integrated over time: implementation. In: European Conf. on Computer Vision (ECCV), pp. 734–750

  3. Comport AI, Malis E, Rives P (2007) Accurate quadrifocal tracking for robust 3D visual odometry. In: Robotics and Automation, 2007 I.E. International Conference on, Roma, pp. 40–45

  4. Davison AJ, Reid ID, Molton ND, Stasse O (2007) MonoSLAM: real-time single camera SLAM. IEEE Trans Pattern Anal Mach Intell 29:1052–1067

    Article  Google Scholar 

  5. Engelhard N, Endres F, Hess J, Sturm J, Burgard W (2011) Realtime 3D visual SLAM with a hand-held camera. In: RGB-D Workshop on 3D Perception in Robotics at the European Robotics Forum (ERF)

  6. Fiala M (2005) ARTag, a fiducial marker system using digital techniques. In: Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, pp. 590–596 vol. 2

  7. Henry P, Krainin M, Herbst E, Ren X, Fox D (2010) RGB-D mapping: using depth cameras for dense 3D modeling of indoor environments. In: Intl. Symp. on Experimental Robotics (ISER), pp 477–491

  8. Henry P, Krainin M, Herbst E, Ren X, Fox D (2012) RGB-D mapping: using Kinect-style depth cameras for dense 3D modeling of indoor environments. Int J Robot Res 31(5):647–663

    Article  Google Scholar 

  9. Izadi S, Kim D, Hilliges O, Molyneaux D, Newcombe RA, Kohli P, Shotton J, Hodges S, Freeman D, Davison AJ, Fitzgibbon A (2011) KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera. In: Proceedings of the 24th annual ACM symposium on User interface software and technology, ACM

  10. Kato H, Billinghurst M (1999) Marker tracking and HMD calibration for a video-based augmented reality conferencing system. In: Augmented Reality, 1999. (IWAR ‘99) Proceedings. 2nd IEEE and ACM International Workshop on, San Francisco, CA, pp. 85–94

  11. Kerl C, Sturm J, Cremers D (2013) Dense visual SLAM for RGB-D cameras. In: Intelligent Robots and Systems (IROS), 2013 IEEE/RSJ International Conference on, Tokyo, pp. 2100–2106

  12. Kerl C, Sturm J, Cremers D (2013) Robust odometry estimation for RGB-D cameras. In: Robotics and Automation (ICRA), 2013 I.E. International Conference on, Karlsruhe, pp. 3748–3754

  13. Klein G, Murray D (2007) Parallel tracking and mapping for small AR workspaces. In: Mixed and Augmented Reality, 2007. ISMAR 2007. 6th IEEE and ACM International Symposium on, Nara, pp. 225–234

  14. Konolige K, Agrawal M, Bolles R, Cowan C, Fischler M, Gerkey B (2007) Outdoor mapping and navigation using stereo vision. In: Intl. Symp. on Experimental Robotics (ISER), pp 179–190

  15. Li J, Li Y, Zhang J, Li D, Wang L, Zhang M (2013) A touchable virtual screen interaction system with handheld Kinect camera. In: Wireless Communications & Signal Processing (WCSP), 2013 International Conference on, Hangzhou, pp. 1–4

  16. Low KL (2004) Linear least-squares optimization for point-to-plane ICP surface registration. Tech. Rep. TR04-004, Dep. of Comput. Sci., Univ. of N. C. at Chapel Hill, Chapel Hill

  17. Morency L, Darrell T (2002) Stereo tracking using ICP and normal flow constraint. In: Pattern Recognition, 2002. Proceedings. 16th International Conference on, 2002, pp. 367–372 vol. 4

  18. Newcombe RA, Izadi S, Hilliges O, Molyneaux D, Kim D, Davison AJ, Kohi P, Shotton J, Hodges S, Fitzgibbon A (2011) KinectFusion: real-time dense surface mapping and tracking. In: Mixed and Augmented Reality (ISMAR), 2011 10th IEEE International Symposium on, Basel, pp. 127–136

  19. Newcombe RA, Lovegrove SJ, Davison AJ (2011) DTAM: dense tracking and mapping in real-time. In: Computer Vision (ICCV), 2011 I.E. International Conference on, Barcelona, pp. 2320–2327

  20. Roth H, Vona M (2012) Moving volume KinectFusion. In: Proc. Brit. Mach. Vision Conf (BMVC), pp. 1 -11

  21. Rusu RB, Cousins S (2011) 3D is here: point cloud library (PCL). In: IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China, pp. 1–4

  22. Steinbrucker F, Sturm J, Cremers D (2011) Real-time visual odometry from dense RGB-D images. In: Computer Vision Workshops (ICCV Workshops), 2011 I.E. International Conference on, Barcelona, pp. 719–722

  23. Stuckler J, Behnke S (2012) Integrating depth and color cues for dense multi-resolution scene mapping using RGB-D cameras. In: Multisensor Fusion and Integration for Intelligent Systems (MFI), 2012 I.E. Conference on, Hamburg, pp. 162–167

  24. Sturm J, Engelhard N, Endres F, Burgard W, Cremers D (2012) A benchmark for the evaluation of RGB-D SLAM systems. In: Intelligent Robots and Systems (IROS), 2012 IEEE/RSJ International Conference on, Vilamoura, pp. 573–580

  25. Tykkala T, Comport AI, Kamarainen J, Hartikainen H (2014) Live RGB-D camera tracking for television production studios. J Vis Commun Image Represent 25:207–217

    Article  Google Scholar 

  26. Whelan T, McDonald J, Kaess M, Fallon M, Johannsson H, Leonard J (2012) Kintinuous: spatially extended KinectFusion. In: 3rd RSS Workshop on RGB-D: Advanced Reasoning with Depth Cameras, Sydney

  27. Zhang F (1997) Quaternions and matrices of quaternions. Linear Algebra Appl 251:21–57

    Article  MathSciNet  MATH  Google Scholar 

  28. Zhang J, Li Y, Li J, Wang L, Li D, Zhang M (2013) Dynamic scenes reconstruction based on foreground and background splitting. In: Wireless Communications & Signal Processing (WCSP), 2013 International Conference on, Hangzhou, pp. 1–4

Download references

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China (Grant No. 61271338, 61401390), the Zhejiang Provincial Natural Science Foundation of China (Grant No. LQ14F010005), and the Open Projects Program of National Laboratory of Pattern Recognition of China (Grant No. 201306308).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Liang-Hao Wang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, JN., Wang, LH., Li, Y. et al. Local optimized and scalable frame-to-model SLAM. Multimed Tools Appl 75, 8675–8694 (2016). https://doi.org/10.1007/s11042-015-2780-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-015-2780-5

Keywords

Navigation