Skip to main content
Erschienen in: International Journal of Computer Vision 3/2016

01.02.2016

Image Based Geo-localization in the Alps

verfasst von: Olivier Saurer, Georges Baatz, Kevin Köser, L’ubor Ladický, Marc Pollefeys

Erschienen in: International Journal of Computer Vision | Ausgabe 3/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Given a picture taken somewhere in the world, automatic geo-localization of such an image is an extremely useful task especially for historical and forensic sciences, documentation purposes, organization of the world’s photographs and intelligence applications. While tremendous progress has been made over the last years in visual location recognition within a single city, localization in natural environments is much more difficult, since vegetation, illumination, seasonal changes make appearance-only approaches impractical. In this work, we target mountainous terrain and use digital elevation models to extract representations for fast visual database lookup. We propose an automated approach for very large scale visual localization that can efficiently exploit visual information (contours) and geometric constraints (consistent orientation) at the same time. We validate the system at the scale of Switzerland (40,000 \(\hbox {km}^2\)) using over 1000 landscape query images with ground truth GPS position.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
3
Synthetic experiments verified that taking the photo from ten or fifty meters above the ground does not degrade recognition besides very special cases like standing very close to a small wall.
 
Literatur
Zurück zum Zitat Baatz, G., Köser, K., Chen, D., Grzeszczuk, R., & Pollefeys, M. (2012). Leveraging 3d city models for rotation invariant place-of-interest recognition. International Journal of Computer Vision, 96, 315–334. Special Issue on Mobile Vision.CrossRef Baatz, G., Köser, K., Chen, D., Grzeszczuk, R., & Pollefeys, M. (2012). Leveraging 3d city models for rotation invariant place-of-interest recognition. International Journal of Computer Vision, 96, 315–334. Special Issue on Mobile Vision.CrossRef
Zurück zum Zitat Baatz, G., Saurer, O., Köser, K., & Pollefeys, M. (2012). Large scale visual geo-localization of images in mountainous terrain. In Proceedings of European Conference on Computer Vision (ECCV) (pp. 517–530). Baatz, G., Saurer, O., Köser, K., & Pollefeys, M. (2012). Large scale visual geo-localization of images in mountainous terrain. In Proceedings of European Conference on Computer Vision (ECCV) (pp. 517–530).
Zurück zum Zitat Baboud, L., Cadík, M., Eisemann, E., & Seidel, H.-P. (2011). Automatic photo-to-terrain alignment for the annotation of mountain pictures. In Proceedings of Computer Vision and Pattern Recognition (CVPR) (pp. 41–48). Baboud, L., Cadík, M., Eisemann, E., & Seidel, H.-P. (2011). Automatic photo-to-terrain alignment for the annotation of mountain pictures. In Proceedings of Computer Vision and Pattern Recognition (CVPR) (pp. 41–48).
Zurück zum Zitat Bansal, M., & Daniilidis, K. (2014). Geometric urban geo-localization. In Proceedings of Computer Vision and Pattern Recognition (CVPR) (pp. 3978–3985). Bansal, M., & Daniilidis, K. (2014). Geometric urban geo-localization. In Proceedings of Computer Vision and Pattern Recognition (CVPR) (pp. 3978–3985).
Zurück zum Zitat Bazin, J.-C., Kweon, I., Demonceaux, C., & Vasseur, P. (2009). Dynamic programming and skyline extraction in catadioptric infrared images. In Proceedings of International Conference on Robotics and Automation (ICRA) (pp. 409–416). Bazin, J.-C., Kweon, I., Demonceaux, C., & Vasseur, P. (2009). Dynamic programming and skyline extraction in catadioptric infrared images. In Proceedings of International Conference on Robotics and Automation (ICRA) (pp. 409–416).
Zurück zum Zitat Blake, A., Rother, C., Brown, M., Perez, P., & Torr, P. (2004). Interactive image segmentation using an adaptive gmmrf model. In Proceedings of European Conference on Computer Vision (ECCV) (pp. 428–441). Blake, A., Rother, C., Brown, M., Perez, P., & Torr, P. (2004). Interactive image segmentation using an adaptive gmmrf model. In Proceedings of European Conference on Computer Vision (ECCV) (pp. 428–441).
Zurück zum Zitat Brown, M., & Lowe, D. G. (2007). Automatic panoramic image stitching using invariant features. International Journal of Computer Vision, 74, 59–73.CrossRef Brown, M., & Lowe, D. G. (2007). Automatic panoramic image stitching using invariant features. International Journal of Computer Vision, 74, 59–73.CrossRef
Zurück zum Zitat Chen, D., Baatz, G., Köser, K., Tsai, S., Vedantham, R., Pylvanainen, T., Roimela, K., Chen, X., Bach, J., Pollefeys, M., Girod, B., & Grzeszczuk, R. (2011). City-scale landmark identification on mobile devices. In Proceedings of Computer Vision and Pattern Recognition (CVPR). Chen, D., Baatz, G., Köser, K., Tsai, S., Vedantham, R., Pylvanainen, T., Roimela, K., Chen, X., Bach, J., Pollefeys, M., Girod, B., & Grzeszczuk, R. (2011). City-scale landmark identification on mobile devices. In Proceedings of Computer Vision and Pattern Recognition (CVPR).
Zurück zum Zitat Comaniciu, D., Meer, P., & Member, S. (2002). Mean shift: A robust approach toward feature space analysis. Transactions on Pattern Analysis and Machine Intelligence, 24, 603–619.CrossRef Comaniciu, D., Meer, P., & Member, S. (2002). Mean shift: A robust approach toward feature space analysis. Transactions on Pattern Analysis and Machine Intelligence, 24, 603–619.CrossRef
Zurück zum Zitat Cozman, F. (1997). Decision Making Based on Convex Sets of Probability Distributions: Quasi-Bayesian Networks and Outdoor Visual Position Estimation. PhD thesis, Robotics Institute, Carnegie Mellon University, Pittsburgh, PA. Cozman, F. (1997). Decision Making Based on Convex Sets of Probability Distributions: Quasi-Bayesian Networks and Outdoor Visual Position Estimation. PhD thesis, Robotics Institute, Carnegie Mellon University, Pittsburgh, PA.
Zurück zum Zitat Cozman, F., & Krotkov, E. (1996). Position estimation from outdoor visual landmarks for teleoperation of lunar rovers. In WACV ’96 (pp. 156–161). Cozman, F., & Krotkov, E. (1996). Position estimation from outdoor visual landmarks for teleoperation of lunar rovers. In WACV ’96 (pp. 156–161).
Zurück zum Zitat Friedman, J., Hastie, T., & Tibshirani, R. (2000). Additive logistic regression: A statistical view of boosting. The Annals of Statistics, 28, 337–407.MathSciNetCrossRefMATH Friedman, J., Hastie, T., & Tibshirani, R. (2000). Additive logistic regression: A statistical view of boosting. The Annals of Statistics, 28, 337–407.MathSciNetCrossRefMATH
Zurück zum Zitat Hays, J., & Efros, A. A. (2008). im2gps: estimating geographic information from a single image. In Proceedings of Computer Vision and Pattern Recognition (CVPR). Hays, J., & Efros, A. A. (2008). im2gps: estimating geographic information from a single image. In Proceedings of Computer Vision and Pattern Recognition (CVPR).
Zurück zum Zitat Hussain, S. ul., & Triggs, B. (2012). Visual recognition using local quantized patterns. In Proceedings of European Conference on Computer Vision (ECCV). Hussain, S. ul., & Triggs, B. (2012). Visual recognition using local quantized patterns. In Proceedings of European Conference on Computer Vision (ECCV).
Zurück zum Zitat Kolmogorov, V., & Boykov, Y. (2005). What metrics can be approximated by geo-cuts, or global optimization of length/area and flux. In Proceedings of International Conference on Computer Vision (ICCV) (pp. 564–571). Washington: DC, USA. Kolmogorov, V., & Boykov, Y. (2005). What metrics can be approximated by geo-cuts, or global optimization of length/area and flux. In Proceedings of International Conference on Computer Vision (ICCV) (pp. 564–571). Washington: DC, USA.
Zurück zum Zitat Ladicky, L., Russell, C., Kohli, P., & Torr, P. (2014). Associative hierarchical random fields. Transactions on Pattern Analysis and Machine Intelligence, 36(6), 1056–1077.CrossRef Ladicky, L., Russell, C., Kohli, P., & Torr, P. (2014). Associative hierarchical random fields. Transactions on Pattern Analysis and Machine Intelligence, 36(6), 1056–1077.CrossRef
Zurück zum Zitat Ladicky, L., Zeisl, B., & Pollefeys, M. (2014) Discriminatively trained dense surface normal estimation. In Proceedings of European Conference on Computer Vision (ECCV). Ladicky, L., Zeisl, B., & Pollefeys, M. (2014) Discriminatively trained dense surface normal estimation. In Proceedings of European Conference on Computer Vision (ECCV).
Zurück zum Zitat Lalonde, J.-F., Narasimhan, S. G., & Efros, A. A. (2010). What do the sun and the sky tell us about the camera? International Journal on Computer Vision, 88(1), 24–51.CrossRef Lalonde, J.-F., Narasimhan, S. G., & Efros, A. A. (2010). What do the sun and the sky tell us about the camera? International Journal on Computer Vision, 88(1), 24–51.CrossRef
Zurück zum Zitat Li, Y., Snavely, N., & Huttenlocher, D. P. (2010). Location recognition using prioritized feature matching. In Proceedings of European Conference on Computer Vision (ECCV) (pp. 791–804). Li, Y., Snavely, N., & Huttenlocher, D. P. (2010). Location recognition using prioritized feature matching. In Proceedings of European Conference on Computer Vision (ECCV) (pp. 791–804).
Zurück zum Zitat Lie, W.-N., Lin, T. C.-I., Lin, T.-C., & Hung, K.-S. (2005). A robust dynamic programming algorithm to extract skyline in images for navigation. Pattern Recognition Letters, 26(2), 221–230.CrossRef Lie, W.-N., Lin, T. C.-I., Lin, T.-C., & Hung, K.-S. (2005). A robust dynamic programming algorithm to extract skyline in images for navigation. Pattern Recognition Letters, 26(2), 221–230.CrossRef
Zurück zum Zitat Lowe, D. G. (2004). Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision, 60(2), 91–110.CrossRef Lowe, D. G. (2004). Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision, 60(2), 91–110.CrossRef
Zurück zum Zitat Malik, J., Belongie, S., Leung, T., & Shi, J. (2001). Contour and texture analysis for image segmentation. International Journal of Computer Vision, 43(1), 7–27. Malik, J., Belongie, S., Leung, T., & Shi, J. (2001). Contour and texture analysis for image segmentation. International Journal of Computer Vision, 43(1), 7–27.
Zurück zum Zitat Manay, S., Cremers, D., Hong, B.-W., Yezzi, A., & Soatto, S. (2006). Integral invariants for shape matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(10), 1602–1618. Manay, S., Cremers, D., Hong, B.-W., Yezzi, A., & Soatto, S. (2006). Integral invariants for shape matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(10), 1602–1618.
Zurück zum Zitat Naval, P. C., Mukunoki, M., Minoh, M., & Ikeda, K. (1997). Estimating camera position and orientation from geographical map and mountain image. In 38th Pattern Sensing Group Research Meeting, Society of Instrument and Control Engineers (pp. 9–16). Naval, P. C., Mukunoki, M., Minoh, M., & Ikeda, K. (1997). Estimating camera position and orientation from geographical map and mountain image. In 38th Pattern Sensing Group Research Meeting, Society of Instrument and Control Engineers (pp. 9–16).
Zurück zum Zitat Nistér, D., & Stewénius, H. (2006). Scalable recognition with a vocabulary tree. In Proceedings of Computer Vision and Pattern Recognition (CVPR) (pp. 2161–2168). Nistér, D., & Stewénius, H. (2006). Scalable recognition with a vocabulary tree. In Proceedings of Computer Vision and Pattern Recognition (CVPR) (pp. 2161–2168).
Zurück zum Zitat Ramalingam, S., Bouaziz, S., & Sturm, P. (2011). Pose estimation using both points and lines for geo-localization. In Proceedings of International Conference on Robotics and Automation (ICRA) (pp. 4716–4723). Ramalingam, S., Bouaziz, S., & Sturm, P. (2011). Pose estimation using both points and lines for geo-localization. In Proceedings of International Conference on Robotics and Automation (ICRA) (pp. 4716–4723).
Zurück zum Zitat Ramalingam, S., Bouaziz, S., & Sturm, P., & Brand, M. (2010). Skyline2gps: Localization in urban canyons using omni-skylines. In IROS 2010 (pp. 3816–3823). Ramalingam, S., Bouaziz, S., & Sturm, P., & Brand, M. (2010). Skyline2gps: Localization in urban canyons using omni-skylines. In IROS 2010 (pp. 3816–3823).
Zurück zum Zitat Schindler, G., Brown, M., & Szeliski, R. (2007). City-scale location recognition. In Proceedings of Computer Vision and Pattern Recognition (CVPR) (pp. 1–7). Schindler, G., Brown, M., & Szeliski, R. (2007). City-scale location recognition. In Proceedings of Computer Vision and Pattern Recognition (CVPR) (pp. 1–7).
Zurück zum Zitat Shechtman, E., & Irani, M. (2007). Matching local self-similarities across images and videos. In Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR). Shechtman, E., & Irani, M. (2007). Matching local self-similarities across images and videos. In Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR).
Zurück zum Zitat Shotton, J., Winn, J., Rother, C., & Criminisi, A. (2006). Textonboost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation. In Proceedings of European Conference on Computer Vision (ECCV) (pp. 1–15). Shotton, J., Winn, J., Rother, C., & Criminisi, A. (2006). Textonboost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation. In Proceedings of European Conference on Computer Vision (ECCV) (pp. 1–15).
Zurück zum Zitat Sivic, J., & Zisserman, A. (2003) Video Google: A text retrieval approach to object matching in videos. In Proceedings of International Conference on Computer Vision (ICCV) (pp. 1470–1477). Sivic, J., & Zisserman, A. (2003) Video Google: A text retrieval approach to object matching in videos. In Proceedings of International Conference on Computer Vision (ICCV) (pp. 1470–1477).
Zurück zum Zitat Stein, F., & Medioni, G. (1995). Map-based localization using the panoramic horizon. Transaction on Robotics and Automation, 11(6), 892–896.CrossRef Stein, F., & Medioni, G. (1995). Map-based localization using the panoramic horizon. Transaction on Robotics and Automation, 11(6), 892–896.CrossRef
Zurück zum Zitat Talluri, R., & Aggarwal, J. (1992). Position estimation for an autonomous mobile robot in an outdoor environment. Transaction on Robotics and Automation, 8(5), 573–584.CrossRef Talluri, R., & Aggarwal, J. (1992). Position estimation for an autonomous mobile robot in an outdoor environment. Transaction on Robotics and Automation, 8(5), 573–584.CrossRef
Zurück zum Zitat Taneja, A., Ballan, L., & Pollefeys, M. (2012). Registration of spherical panoramic images with cadastral 3d models. In 3D Imaging, Modeling, Processing, Visualization and Transmission (3DIMPVT) (pp. 479–486). Taneja, A., Ballan, L., & Pollefeys, M. (2012). Registration of spherical panoramic images with cadastral 3d models. In 3D Imaging, Modeling, Processing, Visualization and Transmission (3DIMPVT) (pp. 479–486).
Zurück zum Zitat Thompson, W. B., Henderson, T. C., Colvin, T. L., Dick, L. B., & Valiquette, C. M. (1993). Vision-based localization. In Image Understanding Workshop (pp. 491–498). Thompson, W. B., Henderson, T. C., Colvin, T. L., Dick, L. B., & Valiquette, C. M. (1993). Vision-based localization. In Image Understanding Workshop (pp. 491–498).
Zurück zum Zitat Vasilevskiy, A., & Siddiqi, K. (2002). Flux maximizing geometric flows. In Transactions on Pattern Analysis and Machine Intelligence (PAMI) (pp. 1565–1578). Vasilevskiy, A., & Siddiqi, K. (2002). Flux maximizing geometric flows. In Transactions on Pattern Analysis and Machine Intelligence (PAMI) (pp. 1565–1578).
Zurück zum Zitat Woo, J., Son, K., Li, T., Kim, G. S., & Kweon, I.-S. (2007). Vision-based uav navigation in mountain area. In MVA (pp. 236–239). Woo, J., Son, K., Li, T., Kim, G. S., & Kweon, I.-S. (2007). Vision-based uav navigation in mountain area. In MVA (pp. 236–239).
Zurück zum Zitat Yang, M., Kpalma, K., & Ronsin, J. (2008). A survey of shape feature extraction techniques. In P.-Y. Yin (Ed.), Pattern recognition (pp. 43–90). Yang, M., Kpalma, K., & Ronsin, J. (2008). A survey of shape feature extraction techniques. In P.-Y. Yin (Ed.), Pattern recognition (pp. 43–90).
Metadaten
Titel
Image Based Geo-localization in the Alps
verfasst von
Olivier Saurer
Georges Baatz
Kevin Köser
L’ubor Ladický
Marc Pollefeys
Publikationsdatum
01.02.2016
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 3/2016
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-015-0830-0

Weitere Artikel der Ausgabe 3/2016

International Journal of Computer Vision 3/2016 Zur Ausgabe

Premium Partner