Skip to main content

2016 | OriginalPaper | Buchkapitel

A Simple Hierarchical Pooling Data Structure for Loop Closure

verfasst von : Xiaohan Fei, Konstantine Tsotsos, Stefano Soatto

Erschienen in: Computer Vision – ECCV 2016

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We propose a data structure obtained by hierarchically pooling Bag-of-Words (BoW) descriptors during a sequence of views that achieves average speedups in large-scale loop closure applications ranging from 2 to 20 times on benchmark datasets. Although simple, the method works as well as sophisticated agglomerative schemes at a fraction of the cost with minimal loss of performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Time-cost rate is defined as the increase of query time per thousand (1 k) images in the database. Average time-cost rate is the average of time-cost rates computed for all sequences in each dataset.
 
Literatur
2.
Zurück zum Zitat Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH
3.
Zurück zum Zitat Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15561-1_56 CrossRef Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010). doi:10.​1007/​978-3-642-15561-1_​56 CrossRef
4.
Zurück zum Zitat Chetverikov, D., Svirko, D., Stepanov, D., Krsek, P.: The trimmed iterative closest point algorithm. In: 2002 IEEE International Conference on Pattern Recognition (ICPR), vol. 3, pp. 545–548. IEEE (2002) Chetverikov, D., Svirko, D., Stepanov, D., Krsek, P.: The trimmed iterative closest point algorithm. In: 2002 IEEE International Conference on Pattern Recognition (ICPR), vol. 3, pp. 545–548. IEEE (2002)
5.
Zurück zum Zitat Chum, O., Philbin, J., Sivic, J., Isard, M., Zisserman, A.: Total recall: automatic query expansion with a generative feature model for object retrieval. In: 2007 IEEE International Conference on Computer Vision (ICCV), pp. 1–8. IEEE (2007) Chum, O., Philbin, J., Sivic, J., Isard, M., Zisserman, A.: Total recall: automatic query expansion with a generative feature model for object retrieval. In: 2007 IEEE International Conference on Computer Vision (ICCV), pp. 1–8. IEEE (2007)
6.
Zurück zum Zitat Cummins, M., Newman, P.: Highly scalable appearance-only slam-fab-map. 2.0. In: Robotics: Science and Systems, Seattle, USA, vol. 5 (2009) Cummins, M., Newman, P.: Highly scalable appearance-only slam-fab-map. 2.0. In: Robotics: Science and Systems, Seattle, USA, vol. 5 (2009)
7.
Zurück zum Zitat Dong, J., Soatto, S.: Domain size pooling in local descriptors: Dsp-sift. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015) Dong, J., Soatto, S.: Domain size pooling in local descriptors: Dsp-sift. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015)
8.
Zurück zum Zitat Galvez-Lopez, D., Tardos, J.D.: Real-time loop detection with bags of binary words. In: 2011 IEEE/RSJ 2015 IEEE Conference on Intelligent Robots and Systems (IROS), pp. 51–58. IEEE (2011) Galvez-Lopez, D., Tardos, J.D.: Real-time loop detection with bags of binary words. In: 2011 IEEE/RSJ 2015 IEEE Conference on Intelligent Robots and Systems (IROS), pp. 51–58. IEEE (2011)
9.
Zurück zum Zitat Geiger, A., Lenz, P., Stiller, C., Urtasun, R.: Vision meets robotics: the kitti dataset. Intl. J. of Robotics Res., 0278364913491297 (2013) Geiger, A., Lenz, P., Stiller, C., Urtasun, R.: Vision meets robotics: the kitti dataset. Intl. J. of Robotics Res., 0278364913491297 (2013)
10.
Zurück zum Zitat Geman, D., Jedynak, B.: An active testing model for tracking roads in satellite images. IEEE Trans. Pattern Anal. Mach. Intell. 18(1), 1–14 (1996)CrossRef Geman, D., Jedynak, B.: An active testing model for tracking roads in satellite images. IEEE Trans. Pattern Anal. Mach. Intell. 18(1), 1–14 (1996)CrossRef
11.
Zurück zum Zitat Girshick, R., Iandola, F., Darrell, T., Malik, J.: Deformable part models are convolutional neural networks (2014). arXiv preprint arXiv:1409.5403 Girshick, R., Iandola, F., Darrell, T., Malik, J.: Deformable part models are convolutional neural networks (2014). arXiv preprint arXiv:​1409.​5403
12.
Zurück zum Zitat Grauman, K., Darrell, T.: The pyramid match kernel: Discriminative classification with sets of image features. In: 2015 IEEE International Conference on Computer Vision (ICCV), vol. 2, pp. 1458–1465. IEEE (2005) Grauman, K., Darrell, T.: The pyramid match kernel: Discriminative classification with sets of image features. In: 2015 IEEE International Conference on Computer Vision (ICCV), vol. 2, pp. 1458–1465. IEEE (2005)
13.
Zurück zum Zitat Jegou, H., Douze, M., Schmid, C.: Hamming embedding and weak geometric consistency for large scale image search. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008. LNCS, vol. 5302, pp. 304–317. Springer, Heidelberg (2008). doi:10.1007/978-3-540-88682-2_24 CrossRef Jegou, H., Douze, M., Schmid, C.: Hamming embedding and weak geometric consistency for large scale image search. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008. LNCS, vol. 5302, pp. 304–317. Springer, Heidelberg (2008). doi:10.​1007/​978-3-540-88682-2_​24 CrossRef
14.
Zurück zum Zitat Jones, E., Soatto, S.: Visual-inertial navigation, localization and mapping: a scalable real-time large-scale approach. Intl. J. of Robotics Res. (2011) Jones, E., Soatto, S.: Visual-inertial navigation, localization and mapping: a scalable real-time large-scale approach. Intl. J. of Robotics Res. (2011)
15.
Zurück zum Zitat Klein, G., Murray, D.: Parallel tracking and mapping for small AR workspaces. In: Proceedings of the 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality, pp. 1–10. IEEE Computer Society (2007) Klein, G., Murray, D.: Parallel tracking and mapping for small AR workspaces. In: Proceedings of the 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality, pp. 1–10. IEEE Computer Society (2007)
16.
Zurück zum Zitat Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, pp. 2169–2178. IEEE (2006) Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, pp. 2169–2178. IEEE (2006)
17.
Zurück zum Zitat Lim, H., Lim, J., Kim, H.J.: Real-time 6-d of monocular visual slam in a large-scale environment. In: 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 1532–1539. IEEE (2014) Lim, H., Lim, J., Kim, H.J.: Real-time 6-d of monocular visual slam in a large-scale environment. In: 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 1532–1539. IEEE (2014)
18.
Zurück zum Zitat Mahendran, A., Vedaldi, A.: Understanding deep image representations by inverting them. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5188–5196. IEEE (2015) Mahendran, A., Vedaldi, A.: Understanding deep image representations by inverting them. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5188–5196. IEEE (2015)
19.
Zurück zum Zitat Mur-Artal, R., Montiel, J., Tardos, J.D.: Orb-slam: a versatile and accurate monocular slam system. IEEE Trans. Rob. 31(5), 1147–1163 (2015)CrossRef Mur-Artal, R., Montiel, J., Tardos, J.D.: Orb-slam: a versatile and accurate monocular slam system. IEEE Trans. Rob. 31(5), 1147–1163 (2015)CrossRef
20.
Zurück zum Zitat Newcombe, R.A., Davison, A.J.: Live dense reconstruction with a single moving camera. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1498–1505. IEEE (2010) Newcombe, R.A., Davison, A.J.: Live dense reconstruction with a single moving camera. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1498–1505. IEEE (2010)
21.
Zurück zum Zitat Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: 2006 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, pp. 2161–2168. IEEE (2006) Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: 2006 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, pp. 2161–2168. IEEE (2006)
22.
Zurück zum Zitat Rosten, E., Drummond, T.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006). doi:10.1007/11744023_34 CrossRef Rosten, E., Drummond, T.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006). doi:10.​1007/​11744023_​34 CrossRef
23.
Zurück zum Zitat Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: Orb: an efficient alternative to sift or surf. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2564–2571. IEEE (2011) Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: Orb: an efficient alternative to sift or surf. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2564–2571. IEEE (2011)
24.
Zurück zum Zitat Samet, H.: The design and analysis of spatial data structures, vol. 85. Addison-Wesley, Reading (1990) Samet, H.: The design and analysis of spatial data structures, vol. 85. Addison-Wesley, Reading (1990)
25.
Zurück zum Zitat Sattler, T., Leibe, B., Kobbelt, L.: Fast image-based localization using direct 2d-to-3d matching. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 667–674. IEEE (2011) Sattler, T., Leibe, B., Kobbelt, L.: Fast image-based localization using direct 2d-to-3d matching. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 667–674. IEEE (2011)
26.
Zurück zum Zitat Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps. In: Workshop at International Conference on Learning Representations (2014) Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps. In: Workshop at International Conference on Learning Representations (2014)
27.
Zurück zum Zitat Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: 2003 IEEE International Conference on Computer Vision (ICCV), pp. 1470–1477. IEEE (2003) Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: 2003 IEEE International Conference on Computer Vision (ICCV), pp. 1470–1477. IEEE (2003)
28.
Zurück zum Zitat Smale, S., Zhou, D.X.: Shannon sampling ii: connections to learning theory. Appl. Comput. Harmonic Anal. 19(3), 285–302 (2005)MathSciNetCrossRefMATH Smale, S., Zhou, D.X.: Shannon sampling ii: connections to learning theory. Appl. Comput. Harmonic Anal. 19(3), 285–302 (2005)MathSciNetCrossRefMATH
29.
Zurück zum Zitat Sturm, J., Engelhard, N., Endres, F., Burgard, W., Cremers, D.: A benchmark for the evaluation of rgb-d slam systems. In: 2012 IEEE/RSJ International Conference on Intelligent Robot Systems (IROS) (2012) Sturm, J., Engelhard, N., Endres, F., Burgard, W., Cremers, D.: A benchmark for the evaluation of rgb-d slam systems. In: 2012 IEEE/RSJ International Conference on Intelligent Robot Systems (IROS) (2012)
30.
Zurück zum Zitat Swain, M.J., Ballard, D.H.: Color indexing. Intl. J. Comput. Vis. 7(1), 11–32 (1991)CrossRef Swain, M.J., Ballard, D.H.: Color indexing. Intl. J. Comput. Vis. 7(1), 11–32 (1991)CrossRef
31.
Zurück zum Zitat Tishby, N., Pereira, F.C., Bialek, W.: The information bottleneck method. In: Proceedings of the Allerton Conference (2000) Tishby, N., Pereira, F.C., Bialek, W.: The information bottleneck method. In: Proceedings of the Allerton Conference (2000)
32.
Zurück zum Zitat Torii, A., Sivic, J., Pajdla, T.: Visual localization by linear combination of image descriptors. In: 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 102–109. IEEE (2011) Torii, A., Sivic, J., Pajdla, T.: Visual localization by linear combination of image descriptors. In: 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 102–109. IEEE (2011)
33.
Zurück zum Zitat Torralba, A., Murphy, K.P., Freeman, W.T., Rubin, M.A.: Context-based vision system for place and object recognition. In: 2003 IEEE International Conference on Computer Vision (ICCV), pp. 273–280. IEEE (2003) Torralba, A., Murphy, K.P., Freeman, W.T., Rubin, M.A.: Context-based vision system for place and object recognition. In: 2003 IEEE International Conference on Computer Vision (ICCV), pp. 273–280. IEEE (2003)
34.
Zurück zum Zitat Turcot, P., Lowe, D.G.: Better matching with fewer features: The selection of useful features in large database recognition problems. In: 2009 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 2109–2116. IEEE (2009) Turcot, P., Lowe, D.G.: Better matching with fewer features: The selection of useful features in large database recognition problems. In: 2009 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 2109–2116. IEEE (2009)
35.
Zurück zum Zitat Ulrich, I., Nourbakhsh, I.: Appearance-based place recognition for topological localization. In: 2000 IEEE International Conference on Robotics and Automation (ICRA), vol. 2, pp. 1023–1029. IEEE (2000) Ulrich, I., Nourbakhsh, I.: Appearance-based place recognition for topological localization. In: 2000 IEEE International Conference on Robotics and Automation (ICRA), vol. 2, pp. 1023–1029. IEEE (2000)
36.
Zurück zum Zitat Vasconcelos, N.: On the efficient evaluation of probabilistic similarity functions for image retrieval. IEEE Trans. Inf. Theory 50(7), 1482–1496 (2004)MathSciNetCrossRefMATH Vasconcelos, N.: On the efficient evaluation of probabilistic similarity functions for image retrieval. IEEE Trans. Inf. Theory 50(7), 1482–1496 (2004)MathSciNetCrossRefMATH
37.
Zurück zum Zitat Williams, B., Cummins, M., Neira, J., Newman, P., Reid, I., Tardós, J.: A comparison of loop closing techniques in monocular slam. Rob. Auton. Syst. 57(12), 1188–1197 (2009)CrossRef Williams, B., Cummins, M., Neira, J., Newman, P., Reid, I., Tardós, J.: A comparison of loop closing techniques in monocular slam. Rob. Auton. Syst. 57(12), 1188–1197 (2009)CrossRef
Metadaten
Titel
A Simple Hierarchical Pooling Data Structure for Loop Closure
verfasst von
Xiaohan Fei
Konstantine Tsotsos
Stefano Soatto
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-46487-9_20