Abstract
We present a system that can reconstruct 3D geometry from large, unorganized collections of photographs such as those found by searching for a given city (e.g., Rome) on Internet photo-sharing sites. Our system is built on a set of new, distributed computer vision algorithms for image matching and 3D reconstruction, designed to maximize parallelism at each stage of the pipeline and to scale gracefully with both the size of the problem and the amount of available computation. Our experimental results demonstrate that it is now possible to reconstruct city-scale image collections with more than a hundred thousand images in less than a day.
- Agarwal, S., Snavely, N., Seitz, S.M., Szeliski, R. Bundle adjustment in the large. In ECCV (2), volume 6312 of Lecture Notes in Computer Science (2010). K. Daniilidis, P. Maragos, and N. Paragios, eds. Springer, Berlin, Germany, 29--42. Google ScholarDigital Library
- Antone, M.E., Teller, S.J. Scalable extrinsic calibration of omnidirectional image networks. Int. J. Comput. Vis. 49, 2--3 (2002), 143--174. Google ScholarDigital Library
- Arya, S., Mount, D.M., Netanyahu, N.S., Silverman, R., Wu, A.Y. An optimal algorithm for approximate nearest neighbor searching fixed dimensions. J. ACM 45, 6 (1998), 891--923. Google ScholarDigital Library
- Chen, Y., Davis, T.A., Hager, W.W., Rajamanickam, S. Algorithm 887: CHOLMOD, supernodal sparse Cholesky factorization and update/ downdate. ACM Trans. Math. Softw. 35, 3 (2008), 1--14. Google ScholarDigital Library
- Chum, O., Philbin, J., Sivic, J., Isard, M., Zisserman, A. Total recall: Automatic query expansion with a generative feature model for object retrieval. In ICCV (2007), IEEE, 1--8.Google ScholarCross Ref
- Fischler, M.A., Bolles, R.C. Random sample consensus: A paradigm for model fitting with application to image analysis and automated cartography. Commun. Assoc. Comp. Mach. 24 (1981), 381--395. Google ScholarDigital Library
- Frahm, J.-M., Georgel, P.F., Gallup, D., Johnson, T., Raguram, R., Wu, C., Jen, Y.-H., Dunn, E., Clipp, B., Lazebnik, S. Building Rome on a cloudless day. In ECCV (4), volume 6314 of Lecture Notes in Computer Science (2010). K. Daniilidis, P. Maragos, and N. Paragios, eds. Springer, Berlin, Germany, 368--381. Google ScholarDigital Library
- Früh, C., Zakhor, A. An automated method for large-scale, ground-based city model acquisition. Int. J. Comput. Vis. 60, 1 (2004), 5--24. Google ScholarDigital Library
- Furukawa, Y., Curless, B., Seitz, S.M., Szeliski, R. Towards internet-scale multi-view stereo. In CVPR (2010), IEEE, 1434--1441.Google ScholarCross Ref
- Hartley, R.I., Zisserman, A. Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge, U.K., 2003. Google ScholarDigital Library
- Jones, K. A statistical interpretation of term specificity and its application in retrieval. J. Doc. 60, 5 (2004), 493--502.Google Scholar
- Karypis, G., Kumar, V. A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J. Sci. Comput. 20, 1 (1998), 359--392. Google ScholarDigital Library
- Lowe, D. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 2 (2004), 91--110. Google ScholarDigital Library
- Nistér, D., Stewénius, H. Scalable recognition with a vocabulary tree. In CVPR (2) (2006), IEEE Computer Society, 2161--2168. Google ScholarDigital Library
- Pollefeys, M., Nister, D., Frahm, J., Akbarzadeh, A., Mordohai, P., Clipp, B., Engels, C., Gallup, D., Kim, S., Merrell, P. et al. Detailed real-time urban 3d reconstruction from video. IJCV 78, 2 (2008), 143--167. Google ScholarDigital Library
- Schindler, G., Brown, M., Szeliski, R. City-scale location recognition. In CVPR (2007), IEEE Computer Society.Google ScholarCross Ref
- Sivic, J., Zisserman, A. Video Google: A text retrieval approach to object matching in videos. In ICCV (2003), 1470--1477. Google ScholarDigital Library
- Snavely, N., Seitz, S.M., Szeliski, R. Photo Tourism: Exploring photo collections in 3d. ACM Trans. Graph. 25, 3 (2006), 835--846. Google ScholarDigital Library
- Snavely, N., Seitz, S.M., Szeliski, R. Skeletal graphs for efficient structure from motion. In CVPR (2008), IEEE Computer Society.Google ScholarCross Ref
- Triggs, B., McLauchlan, P., Hartley, R.I., Fitzgibbon, A. Bundle adjustment---A modern synthesis. In Vision Algorithms '99 (1999), 298--372. Google ScholarDigital Library
- Zebedin, L., Bauer, J., Karner, K.F., Bischof, H. Fusion of feature-and area-based information for urban buildings modelling from aerial imagery. In ECCV (4), volume 5305 of Lecture Notes in Computer Science (2008). D.A. Forsyth, P.H.S. Torr, and A. Zisserman, eds. Springer, Berlin, Germany, 873--886. Google ScholarDigital Library
Index Terms
- Building Rome in a day
Recommendations
Processing 6 billion CDRs/day: from research to production (experience report)
DEBS '12: Proceedings of the 6th ACM International Conference on Distributed Event-Based SystemsA call detail record (CDR), is a data record produced by a telephone exchange or other telecommunications equipment documenting the details of a phone call that passed through the exchange or equipment. Telecommunications companies (or "telcos") use ...
Building Rome on a cloudless day
ECCV'10: Proceedings of the 11th European conference on Computer vision: Part IVThis paper introduces an approach for dense 3D reconstruction from unregistered Internet-scale photo collections with about 3 million images within the span of a day on a single PC ("cloudless"). Our method advances image clustering, stereo, stereo ...
Next day load forecasting using SVM
ISNN'05: Proceedings of the Second international conference on Advances in Neural Networks - Volume Part IIIBased on similar day method and SVM, this paper proposes a new method for next day load forecasting. The new method uses the parameters of several similar days, instead of only selecting one similar day as in similar day method. The parameters of ...
Comments