nach oben

Erschienen in:

2016 | OriginalPaper | Buchkapitel

A Simple Hierarchical Pooling Data Structure for Loop Closure

verfasst von : Xiaohan Fei, Konstantine Tsotsos, Stefano Soatto

Erschienen in: Computer Vision – ECCV 2016

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

We propose a data structure obtained by hierarchically pooling Bag-of-Words (BoW) descriptors during a sequence of views that achieves average speedups in large-scale loop closure applications ranging from 2 to 20 times on benchmark datasets. Although simple, the method works as well as sophisticated agglomerative schemes at a fraction of the cost with minimal loss of performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition

Nächstes Kapitel A Versatile Approach for Solving PnP, PnPf, and PnPfr Problems

Time-cost rate is defined as the increase of query time per thousand (1 k) images in the database. Average time-cost rate is the average of time-cost rates computed for all sequences in each dataset.

http://vis.uky.edu/~stewe/ukbench/.

https://lear.inrialpes.fr/~jegou/data.php.

Aizawa, A.: An information-theoretic perspective of tf-idf measures. Inf. Process. Manage. 39(1), 45–65 (2003)MathSciNetCrossRefMATH

Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH

Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15561-1_56 CrossRef

Chetverikov, D., Svirko, D., Stepanov, D., Krsek, P.: The trimmed iterative closest point algorithm. In: 2002 IEEE International Conference on Pattern Recognition (ICPR), vol. 3, pp. 545–548. IEEE (2002)

Chum, O., Philbin, J., Sivic, J., Isard, M., Zisserman, A.: Total recall: automatic query expansion with a generative feature model for object retrieval. In: 2007 IEEE International Conference on Computer Vision (ICCV), pp. 1–8. IEEE (2007)

Cummins, M., Newman, P.: Highly scalable appearance-only slam-fab-map. 2.0. In: Robotics: Science and Systems, Seattle, USA, vol. 5 (2009)

Dong, J., Soatto, S.: Domain size pooling in local descriptors: Dsp-sift. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015)

Galvez-Lopez, D., Tardos, J.D.: Real-time loop detection with bags of binary words. In: 2011 IEEE/RSJ 2015 IEEE Conference on Intelligent Robots and Systems (IROS), pp. 51–58. IEEE (2011)

Geiger, A., Lenz, P., Stiller, C., Urtasun, R.: Vision meets robotics: the kitti dataset. Intl. J. of Robotics Res., 0278364913491297 (2013)

10.

Geman, D., Jedynak, B.: An active testing model for tracking roads in satellite images. IEEE Trans. Pattern Anal. Mach. Intell. 18(1), 1–14 (1996)CrossRef

11.

Girshick, R., Iandola, F., Darrell, T., Malik, J.: Deformable part models are convolutional neural networks (2014). arXiv preprint arXiv:1409.5403

12.

Grauman, K., Darrell, T.: The pyramid match kernel: Discriminative classification with sets of image features. In: 2015 IEEE International Conference on Computer Vision (ICCV), vol. 2, pp. 1458–1465. IEEE (2005)

13.

Jegou, H., Douze, M., Schmid, C.: Hamming embedding and weak geometric consistency for large scale image search. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008. LNCS, vol. 5302, pp. 304–317. Springer, Heidelberg (2008). doi:10.1007/978-3-540-88682-2_24 CrossRef

14.

Jones, E., Soatto, S.: Visual-inertial navigation, localization and mapping: a scalable real-time large-scale approach. Intl. J. of Robotics Res. (2011)

15.

Klein, G., Murray, D.: Parallel tracking and mapping for small AR workspaces. In: Proceedings of the 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality, pp. 1–10. IEEE Computer Society (2007)

16.

Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, pp. 2169–2178. IEEE (2006)

17.

Lim, H., Lim, J., Kim, H.J.: Real-time 6-d of monocular visual slam in a large-scale environment. In: 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 1532–1539. IEEE (2014)

18.

Mahendran, A., Vedaldi, A.: Understanding deep image representations by inverting them. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5188–5196. IEEE (2015)

19.

Mur-Artal, R., Montiel, J., Tardos, J.D.: Orb-slam: a versatile and accurate monocular slam system. IEEE Trans. Rob. 31(5), 1147–1163 (2015)CrossRef

20.

Newcombe, R.A., Davison, A.J.: Live dense reconstruction with a single moving camera. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1498–1505. IEEE (2010)

21.

Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: 2006 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, pp. 2161–2168. IEEE (2006)

22.

Rosten, E., Drummond, T.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006). doi:10.1007/11744023_34 CrossRef

23.

Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: Orb: an efficient alternative to sift or surf. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2564–2571. IEEE (2011)

24.

Samet, H.: The design and analysis of spatial data structures, vol. 85. Addison-Wesley, Reading (1990)

25.

Sattler, T., Leibe, B., Kobbelt, L.: Fast image-based localization using direct 2d-to-3d matching. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 667–674. IEEE (2011)

26.

Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps. In: Workshop at International Conference on Learning Representations (2014)

27.

Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: 2003 IEEE International Conference on Computer Vision (ICCV), pp. 1470–1477. IEEE (2003)

28.

Smale, S., Zhou, D.X.: Shannon sampling ii: connections to learning theory. Appl. Comput. Harmonic Anal. 19(3), 285–302 (2005)MathSciNetCrossRefMATH

29.

Sturm, J., Engelhard, N., Endres, F., Burgard, W., Cremers, D.: A benchmark for the evaluation of rgb-d slam systems. In: 2012 IEEE/RSJ International Conference on Intelligent Robot Systems (IROS) (2012)

30.

Swain, M.J., Ballard, D.H.: Color indexing. Intl. J. Comput. Vis. 7(1), 11–32 (1991)CrossRef

31.

Tishby, N., Pereira, F.C., Bialek, W.: The information bottleneck method. In: Proceedings of the Allerton Conference (2000)

32.

Torii, A., Sivic, J., Pajdla, T.: Visual localization by linear combination of image descriptors. In: 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 102–109. IEEE (2011)

33.

Torralba, A., Murphy, K.P., Freeman, W.T., Rubin, M.A.: Context-based vision system for place and object recognition. In: 2003 IEEE International Conference on Computer Vision (ICCV), pp. 273–280. IEEE (2003)

34.

Turcot, P., Lowe, D.G.: Better matching with fewer features: The selection of useful features in large database recognition problems. In: 2009 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 2109–2116. IEEE (2009)

35.

Ulrich, I., Nourbakhsh, I.: Appearance-based place recognition for topological localization. In: 2000 IEEE International Conference on Robotics and Automation (ICRA), vol. 2, pp. 1023–1029. IEEE (2000)

36.

Vasconcelos, N.: On the efficient evaluation of probabilistic similarity functions for image retrieval. IEEE Trans. Inf. Theory 50(7), 1482–1496 (2004)MathSciNetCrossRefMATH

37.

Williams, B., Cummins, M., Neira, J., Newman, P., Reid, I., Tardós, J.: A comparison of loop closing techniques in monocular slam. Rob. Auton. Syst. 57(12), 1188–1197 (2009)CrossRef

Titel: A Simple Hierarchical Pooling Data Structure for Loop Closure
verfasst von: Xiaohan Fei
Konstantine Tsotsos
Stefano Soatto
Verlag: Springer International Publishing
Buch: Computer Vision – ECCV 2016
Print ISBN: 978-3-319-46486-2

Electronic ISBN: 978-3-319-46487-9

Copyright-Jahr: 2016
DOI: https://doi.org/10.1007/978-3-319-46487-9_20

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"