nach oben

International Journal of Computer Vision

Erschienen in:

01.02.2014

Generative Methods for Long-Term Place Recognition in Dynamic Scenes

verfasst von: Edward Johns, Guang-Zhong Yang

Erschienen in: International Journal of Computer Vision | Ausgabe 3/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

This paper proposes a new framework for visual place recognition that incrementally learns models of each place and offers adaptability to dynamic elements in the scene. Traditional Bag-Of-Words (BOW) image-retrieval approaches to place recognition typically treat images in a holistic manner and are not capable of dealing with sub-scene dynamics, such as structural changes to a building façade or seasonal effects on foliage. However, by treating local features as observations of real-world landmarks in a scene that is observed repeatedly over a period of time, such dynamics can be modelled at a local level, and the spatio-temporal properties of each landmark can be independently updated incrementally. The method proposed models each place as a set of such landmarks and their geometric relationships. A new BOW filtering stage and geometric verification scheme are introduced to compute a similarity score between a query image and each scene model. As further training images are acquired for each place, the landmark properties are updated over time and in the long term, the model can adapt to dynamic behaviour in the scene. Results on an outdoor dataset of images captured along a 7 km path, over a period of 5 months, show an improvement in recognition performance when compared to state-of-the-art image retrieval approaches to place recognition.

Vorheriger Artikel Detecting People Looking at Each Other in Videos

Nächster Artikel Active Rare Class Discovery and Classification Using Dirichlet Processes

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Agarwal, S., Snavely, N., Simon, I., Seitz, S. M., & Szeliski, R. (2009). Building rome in a day. In Proceedings of ICCV.

Arandjelovic, R., & Zisserman, A. (2012). Three things everyone should know to improve object retrieval. In Proceedings of CVPR.

Arnaud, E., Odone F., Delponte, E., & Verri, A. (2006). Trains of keypoints for 3d object recognition. In Proceedings of ICPR.

Bowman, K. O., & Shenton, L. R. (2007). The beta distribution, moment method. Far East Journal of Theoretical Statistics, 23, 133–165.MATHMathSciNet

Cao, Y., Wang, C., Li, Z., & Zhang, L. (2010). Spatial bag-of-features. In Proceedings of CVPR.

Chatfield, K., Lempitsky, V., Vedaldi, A., & Zisserman, A. (2011). The devil is in the details: An evaluation of recent feature encoding methods. In Proceedings of BMVC.

Chum, O., Philbin, J., & Zisserman, A. (2008). Near duplicate image detection: Min-hash and tf-idf weighting. In Proceedings of BMVC.

Chum, O., Mikulík, A., Perdoch, M., & Matas, J. (2011). Total recall ii: Query expansion revisited. In Proceedings of CVPR (pp. 889–896).

Csurka, G., Dance, C. R., Fan, L., Willamowski, J., & Bray, C. (2004). Visual categorization with bags of keypoints. In Proceedings of ECCV International Workshop on Statistical Learning in Computer Vision.

Cummins, M., & Newman, P. (2008). Fab-map: Probabilistic localization and mapping in the space of appearance. IJRR, 27, 647–665.

Cummins, M., & Newman, P. (2009). Highly scalable appearance-only slam-fab-map 2.0. In Proceedings of Robotics: Science and Systems.

Hartley, R. I., & Zisserman, A. (2004). Multiple view geometry in computer vision. Cambridge: Cambridge University Press.MATHCrossRef

Jegou, H., & Chum, O. (2012). Negative evidences and co-occurrences in image retrieval: The benefit of pca and whitening. In Proceedings of ECCV.

Jégou, H., Douze, M., & Schmid, C. (2010). Improving bag-of-features for large scale image search. IJCV, 87(3), 316–336.CrossRef

Johns, E., & Yang, G. Z. (2011a). From images to scenes: Compressing an image cluster into a single scene model for place recognition. In Proceedings of ICCV (pp. 874–881).

Johns, E., & Yang, G. Z. (2011b). Global localization in a dense continuous topological map. In Proceedings of ICAR

Johns, E., & Yang, G. Z. (2011c). Place recognition and online learning in dynamic scenes with spatio-temporal landmarks. In Proceedings of BMVC, (pp. 10.1–10.12).

Johns, E., & Yang, G. Z. (2013a). Dynamic scene models for incremental, long-term, appearance-based localisation. In Procedings of ICRA.

Johns, E., & Yang, G. Z. (2013b). Feature co-occurrence maps: Appearance-based localisation throughout the day. In Proceedings of ICRA.

Leordeanu, M., & Hebert, M. (2005). A spectral technique for correspondence problems using pairwise constraints. In Proceedings of ICCV (pp. 1482–1489).

Li, Y., Snavely, N., & Huttenlocher, D. P. (2010). Location recognition using prioritized feature matching. In Proceedings of ECCV (pp. 791–804).

Lik, F., & Kosecka, J. (2006). Probabilistic location recognition using reduced feature set. In Proceedings of ICRA.

Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. Trans IJCV, 60, 91–110.CrossRef

Luo, J., Pronobis, A., Caputo, B., & Jensfelt, P. (2007). Incremental learning for place recognition in dynamic environments. In Proceedings of IROS.

Marszalek, M., & Schmid, C. (2006). Spatial weighting for bag-of-features. In Proceedings of CVPR.

Mikullk, A., & Perdoch, M. (2010). Learning a fine vocabulary. In Proceedings of ECCV.

Ni, K., Kannan, A., Criminis, A., & Winn, J. (2009). Epitomic location recognition. In IEEE Trans PAMI.

Nister, D., & Stewenius, H. (2006). Scalable recognition with a vocabulary tree. In Proceedings of CVPR (pp. 1222–1229).

Orabona, F., Jie L., & Caputo, B. (2010). Online-batch strongly convex multi kernel learning. In Proceedings of CVPR.

Philbin, J., Chum, O., Isard, M., Sivic, J., & Zisserman, A. (2007). Object retrieval with large vocabularies and fast spatial matching. In Proceedings of CVPR (pp. 1–8).

Philbin, J., Chum, O., Isard, M., Sivic, J., & Zisserman, A. (2008). Lost in quantization: Improving particular object retrieval in large scale image databases. In Proceedings of CVPR.

Pronobis, A., & Caputo, B. (2007). Confidence-based cue integration for visual place recognition. In Proceedings of IROS.

Raguram, R., Wu, C., Frahm, J. M., & Lazebnik, S. (2011). Modeling and recognition of landmark image collections using iconic scene graphs. Trans IJCV, 95(3), 213–239.CrossRef

Schindler, G., Brown, M., & Szeliski, R. (2007). City-scale location recognition. In Proceedings of CVPR.

Se, S., Lowe, D., & Little, J. (2001). Vision-based mobile robot localization and mapping using scale-invariant features. In Proceedings of ICRA.

Sivic, J., & Zisserman, A. (2003). Video google: A text retrieval approach to object matching in videos. In Proceedings of ICCV (pp. 1470–1477).

Tolias, G., & Avrithis, Y. (2011). Speeded-up, relaxed spatial matching. In Proceedings of ICCV.

Winn, J., Criminisi, A., & Minka, T. (2005). Object categorization by learned universal visual dictionary. In Proceedings of ICCV (pp. 1800–1807).

Zhai, C., & Lafferty, J. (2001). A study of smoothing methods for language models applied to ad hoc information retrieval. In Proceedings of ACM SIGIR (pp. 334–342).

Zhang, Y., Jia, Z., & Chen, T. (2011). Image retrieval with geometry-preserving visual phrases. In Proceedings of CVPR (pp. 809–816).

Zheng, Y. T., Zhao, M., Song, Y., Adam, H., Buddemeier, U., Bissacco, A., et al. (2009). Tour the world: building a web-scale landmark recognition engine. In Proceedings of CVPR.

Titel: Generative Methods for Long-Term Place Recognition in Dynamic Scenes
verfasst von: Edward Johns
Guang-Zhong Yang
Publikationsdatum: 01.02.2014
Verlag: Springer US
Erschienen in: International Journal of Computer Vision / Ausgabe 3/2014
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI: https://doi.org/10.1007/s11263-013-0648-6

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 3/2014

Detecting People Looking at Each Other in Videos

Objects, Actions, Places

Active Rare Class Discovery and Classification Using Dirichlet Processes

Demisting the Hough Transform for 3D Shape Recognition and Registration

Rotation-Invariant HOG Descriptors Using Fourier Analysis in Polar and Spherical Coordinates

Object and Action Classification with Latent Window Parameters

Premium Partner