skip to main content
research-article
Free Access

Building Rome in a day

Published:01 October 2011Publication History
Skip Abstract Section

Abstract

We present a system that can reconstruct 3D geometry from large, unorganized collections of photographs such as those found by searching for a given city (e.g., Rome) on Internet photo-sharing sites. Our system is built on a set of new, distributed computer vision algorithms for image matching and 3D reconstruction, designed to maximize parallelism at each stage of the pipeline and to scale gracefully with both the size of the problem and the amount of available computation. Our experimental results demonstrate that it is now possible to reconstruct city-scale image collections with more than a hundred thousand images in less than a day.

References

  1. Agarwal, S., Snavely, N., Seitz, S.M., Szeliski, R. Bundle adjustment in the large. In ECCV (2), volume 6312 of Lecture Notes in Computer Science (2010). K. Daniilidis, P. Maragos, and N. Paragios, eds. Springer, Berlin, Germany, 29--42. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Antone, M.E., Teller, S.J. Scalable extrinsic calibration of omnidirectional image networks. Int. J. Comput. Vis. 49, 2--3 (2002), 143--174. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Arya, S., Mount, D.M., Netanyahu, N.S., Silverman, R., Wu, A.Y. An optimal algorithm for approximate nearest neighbor searching fixed dimensions. J. ACM 45, 6 (1998), 891--923. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chen, Y., Davis, T.A., Hager, W.W., Rajamanickam, S. Algorithm 887: CHOLMOD, supernodal sparse Cholesky factorization and update/ downdate. ACM Trans. Math. Softw. 35, 3 (2008), 1--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Chum, O., Philbin, J., Sivic, J., Isard, M., Zisserman, A. Total recall: Automatic query expansion with a generative feature model for object retrieval. In ICCV (2007), IEEE, 1--8.Google ScholarGoogle ScholarCross RefCross Ref
  6. Fischler, M.A., Bolles, R.C. Random sample consensus: A paradigm for model fitting with application to image analysis and automated cartography. Commun. Assoc. Comp. Mach. 24 (1981), 381--395. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Frahm, J.-M., Georgel, P.F., Gallup, D., Johnson, T., Raguram, R., Wu, C., Jen, Y.-H., Dunn, E., Clipp, B., Lazebnik, S. Building Rome on a cloudless day. In ECCV (4), volume 6314 of Lecture Notes in Computer Science (2010). K. Daniilidis, P. Maragos, and N. Paragios, eds. Springer, Berlin, Germany, 368--381. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Früh, C., Zakhor, A. An automated method for large-scale, ground-based city model acquisition. Int. J. Comput. Vis. 60, 1 (2004), 5--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Furukawa, Y., Curless, B., Seitz, S.M., Szeliski, R. Towards internet-scale multi-view stereo. In CVPR (2010), IEEE, 1434--1441.Google ScholarGoogle ScholarCross RefCross Ref
  10. Hartley, R.I., Zisserman, A. Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge, U.K., 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Jones, K. A statistical interpretation of term specificity and its application in retrieval. J. Doc. 60, 5 (2004), 493--502.Google ScholarGoogle Scholar
  12. Karypis, G., Kumar, V. A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J. Sci. Comput. 20, 1 (1998), 359--392. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Lowe, D. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 2 (2004), 91--110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Nistér, D., Stewénius, H. Scalable recognition with a vocabulary tree. In CVPR (2) (2006), IEEE Computer Society, 2161--2168. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Pollefeys, M., Nister, D., Frahm, J., Akbarzadeh, A., Mordohai, P., Clipp, B., Engels, C., Gallup, D., Kim, S., Merrell, P. et al. Detailed real-time urban 3d reconstruction from video. IJCV 78, 2 (2008), 143--167. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Schindler, G., Brown, M., Szeliski, R. City-scale location recognition. In CVPR (2007), IEEE Computer Society.Google ScholarGoogle ScholarCross RefCross Ref
  17. Sivic, J., Zisserman, A. Video Google: A text retrieval approach to object matching in videos. In ICCV (2003), 1470--1477. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Snavely, N., Seitz, S.M., Szeliski, R. Photo Tourism: Exploring photo collections in 3d. ACM Trans. Graph. 25, 3 (2006), 835--846. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Snavely, N., Seitz, S.M., Szeliski, R. Skeletal graphs for efficient structure from motion. In CVPR (2008), IEEE Computer Society.Google ScholarGoogle ScholarCross RefCross Ref
  20. Triggs, B., McLauchlan, P., Hartley, R.I., Fitzgibbon, A. Bundle adjustment---A modern synthesis. In Vision Algorithms '99 (1999), 298--372. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Zebedin, L., Bauer, J., Karner, K.F., Bischof, H. Fusion of feature-and area-based information for urban buildings modelling from aerial imagery. In ECCV (4), volume 5305 of Lecture Notes in Computer Science (2008). D.A. Forsyth, P.H.S. Torr, and A. Zisserman, eds. Springer, Berlin, Germany, 873--886. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Building Rome in a day

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image Communications of the ACM
            Communications of the ACM  Volume 54, Issue 10
            October 2011
            126 pages
            ISSN:0001-0782
            EISSN:1557-7317
            DOI:10.1145/2001269
            Issue’s Table of Contents

            Copyright © 2011 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 1 October 2011

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Popular
            • Refereed

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format