nach oben

International Journal of Computer Vision

Erschienen in:

01.12.2014

Reconstructing the World’s Museums

verfasst von: Jianxiong Xiao, Yasutaka Furukawa

Erschienen in: International Journal of Computer Vision | Ausgabe 3/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Virtual exploration tools for large indoor environments (e.g. museums) have so far been limited to either blueprint-style 2D maps that lack photo-realistic views of scenes, or ground-level image-to-image transitions, which are immersive but ill-suited for navigation. On the other hand, photorealistic aerial maps would be a useful navigational guide for large indoor environments, but it is impossible to directly acquire photographs covering a large indoor environment from aerial viewpoints. This paper presents a 3D reconstruction and visualization system for automatically producing clean and well-regularized texture-mapped 3D models for large indoor scenes, from ground-level photographs and 3D laser points. The key component is a new algorithm called “inverse constructive solid geometry (CSG)” for reconstructing a scene with a CSG representation consisting of volumetric primitives, which imposes powerful regularization constraints. We also propose several novel techniques to adjust the 3D model to make it suitable for rendering the 3D maps from aerial viewpoints. The visualization system enables users to easily browse a large-scale indoor environment from a bird’s-eye view, locate specific room interiors, fly into a place of interest, view immersive ground-level panorama views, and zoom out again, all with seamless 3D transitions. We demonstrate our system on various museums, including the Metropolitan Museum of Art in New York City—one of the largest art galleries in the world.

Vorheriger Artikel Special Issue on Large-Scale Computer Vision: Geometry, Inference, and Learning

Nächster Artikel People Watching: Human Actions as a Cue for Single View Geometry

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Different from standard volumetric reconstruction algorithms (Furukawa et al. 2009b; Hernández Esteban et al. 2007), voxel resolution is not critical for accuracy in our approach, as precise surface positions are determined by primitives.

Runs are processed independently for computational and memory efficiency. If needed, a long run can be split into shorter ones, where resulting CSG models can be merged in constant time.

We did consider a large-scale graph-cut algorithm Delong and Boykov (2008), but it is still expensive. Our scheme exploits our compact 3D model, and allows easy implementation that works well in practice.

Agarwal, S., Furukawa, Y., Snavely, N., Simon, I., Curless, B., Seitz, S. M., et al. (2011). Building Rome in a day. Communications of the ACM, 54(10), 105–112.CrossRef

Agarwal, S., Snavely, N., Simon, I., Seitz, S. M., & Szeliski, R. (2009). Building rome in a day. In Proceedings of the International Conference on Computer Vision (ICCV).

Autodesk. (2012). Revit architecture.

CGAL (Computational Geometry Algorithms Library). (2012). http://www.cgal.org.

Curless, B., & Levoy, M. (1996). A volumetric method for building complex models from range images. In SIGGRAPH (pp. 303–312). doi:10.1145/237170.237269.

Delong, A., & Boykov, Y. (2008). A scalable graph-cut algorithm for n-d grids. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

Felzenszwalb, P., & Huttenlocher, D. (2004). Distance transforms of sampled functions. Cornell Computing and Information Science Technical report TR2004-1963.

Furukawa, Y., Curless, B., Seitz, S. M., & Szeliski, R. (2009). Manhattan-world stereo. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

Furukawa, Y., Curless, B., Seitz, S. M., & Szeliski, R. (2009). Reconstructing building interiors from images. In Proceedings of the International Conference on Computer Vision (ICCV). doi:10.1109/ICCV.2009.5459145.

Garland, M. (1998). Qslim: Quadric-based simplification algorithm.

Gupta, A., Efros, A. A., & Hebert, M. (2010). Blocks world revisited: Image understanding using qualitative geometry and mechanics. In European Conference on Computer Vision (ECCV).

Hedau, V., Hoiem, D., & Forsyth, D. (2009). Recovering the spatial layout of cluttered rooms. In Proceedings of the International Conference on Computer Vision (ICCV).

Henry, P., Krainin, M., Herbst, E., Ren, X., & Fox, D. (2010). RGB-D mapping: Using depth cameras for dense 3d modeling of indoor environments. In International Symposium on Experimental Robotics (ISER).

Hernández Esteban, C., Vogiatzis, G., & Cipolla, R. (2007). Probabilistic visibility for multi-view stereo. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

Hough, P. V. (1959). Machine analysis of bubble chamber pictures. In Proceedings of International Conference on High Energy Accelerators and Instrumentation.

Huang, Q. X., & Anguelov, D. (2010). High quality pose estimation by aligning multiple scans to a latent map. In IEEE International Conference on Robotics and Automation (ICRA).

Jiang, H., & Xiao, J. (2013). A linear approach to matching cuboids in RGBD images. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

Li, Y., Wu, X., Chrysathou, Y., Sharf, A., Cohen-Or, D., & Mitra, N. J. (2011). Globfit: Consistently fitting primitives by discovering global relations. In SIGGRAPH.

Liu, T., Carlberg, M., Chen, G., Chen, J., Kua, J., & Zakhor, A. (2010). Indoor localization and visualization using a human-operated backpack system. In International Conference on Indoor Positioning and Indoor Navigation (IPIN).

Pauly, M., Mitra, N. J., Wallner, J., Pottmann, H., & Guibas, L. J. (2011). Discovering structural regularity in 3d geometry. In SIGGRAPH.

Project website: http://vision.princeton.edu/projects/2012/museum/.

Rodriguez, E. V., Oliver, A. A., Huber, D., & Cerrada, C. (2011). Detection, modeling, and classification of moldings for automated reverse engineering of buildings from 3d data. In International Symposium on Automation and Robotics in Construction (ISARC).

Russell, B. C., Martin-Brualla, R., Butler, D. J., Seitz, S. M., & Zettlemoyer, L. (2013). 3D Wikipedia: Using online text to automatically label and navigate reconstructed geometry. In SIGGRAPH, Asia.

Sanchez, V., & Zakhor, A. (2012). Planar 3d modeling of building interiors from point cloud data. In IEEE International Conference on Image Processing (ICIP).

Sinha, S. N., Steedly, D., Szeliski, R., Agrawala, M., & Pollefeys, M. (2008). Interactive 3D architectural modeling from unordered photo collections. In SIGGRAPH, Asia.

Suveg, I., & Vosselman, G. (2004). Reconstruction of 3d building models from aerial images and maps. Journal of Photogrammetry and Remote Sensing, 58, 3–4.

Uyttendaele, M., Criminisi, A., Kang, S. B., Winder, S., Szeliski, R., & Hartley, R. (2004). Image-based interactive exploration of real-world environments. In IEEE Computer Graphics and Applications (CGA).

Xiao, J., Fang, T., Tan, P., Zhao, P., Ofek, E., & Quan, L. (2008). Image-based façade modeling. In SIGGRAPH, Asia.

Xiao, J., Fang, T., Zhao, P., Lhuillier, M., & Quan, L. (2009). Image-based street-side city modeling. In SIGGRAPH, Asia.

Xiao, J., Hays, J., Russell, B. C., Patterson, G., Ehinger, K., Torralba, A., et al. (2013). Basic level scene understanding: Categories, attributes and structures. Frontiers in Psychology, 4, 506.

Xiao, J., Owens, A., & Torralba, A. (2013). SUN3D: A database of big spaces reconstructed using sfm and object labels. In IEEE International Conference on Computer Vision (ICCV).

Xiao, J., Russell, B., & Torralba, A. (2012). Localizing 3D cuboids in single-view images. Advances in Neural Information Processing Systems, 25, 620–628.

Titel: Reconstructing the World’s Museums
verfasst von: Jianxiong Xiao
Yasutaka Furukawa
Publikationsdatum: 01.12.2014
Verlag: Springer US
Erschienen in: International Journal of Computer Vision / Ausgabe 3/2014
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI: https://doi.org/10.1007/s11263-014-0711-y

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 3/2014

Photo Sequencing

Low-Rank Bilinear Classification: Efficient Convex Optimization and Extensions

Filter-Based Mean-Field Inference for Random Fields with Higher-Order Terms and Product Label-Spaces

ImageNet Auto-Annotation with Segmentation Propagation

Special Issue on Large-Scale Computer Vision: Geometry, Inference, and Learning

People Watching: Human Actions as a Cue for Single View Geometry

Premium Partner