Skip to main content
Erschienen in: International Journal of Computer Vision 3/2014

01.12.2014

Reconstructing the World’s Museums

verfasst von: Jianxiong Xiao, Yasutaka Furukawa

Erschienen in: International Journal of Computer Vision | Ausgabe 3/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Virtual exploration tools for large indoor environments (e.g. museums) have so far been limited to either blueprint-style 2D maps that lack photo-realistic views of scenes, or ground-level image-to-image transitions, which are immersive but ill-suited for navigation. On the other hand, photorealistic aerial maps would be a useful navigational guide for large indoor environments, but it is impossible to directly acquire photographs covering a large indoor environment from aerial viewpoints. This paper presents a 3D reconstruction and visualization system for automatically producing clean and well-regularized texture-mapped 3D models for large indoor scenes, from ground-level photographs and 3D laser points. The key component is a new algorithm called “inverse constructive solid geometry (CSG)” for reconstructing a scene with a CSG representation consisting of volumetric primitives, which imposes powerful regularization constraints. We also propose several novel techniques to adjust the 3D model to make it suitable for rendering the 3D maps from aerial viewpoints. The visualization system enables users to easily browse a large-scale indoor environment from a bird’s-eye view, locate specific room interiors, fly into a place of interest, view immersive ground-level panorama views, and zoom out again, all with seamless 3D transitions. We demonstrate our system on various museums, including the Metropolitan Museum of Art in New York City—one of the largest art galleries in the world.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
Different from standard volumetric reconstruction algorithms (Furukawa et al. 2009b; Hernández Esteban et al. 2007), voxel resolution is not critical for accuracy in our approach, as precise surface positions are determined by primitives.
 
2
Runs are processed independently for computational and memory efficiency. If needed, a long run can be split into shorter ones, where resulting CSG models can be merged in constant time.
 
3
We did consider a large-scale graph-cut algorithm Delong and Boykov (2008), but it is still expensive. Our scheme exploits our compact 3D model, and allows easy implementation that works well in practice.
 
Literatur
Zurück zum Zitat Agarwal, S., Furukawa, Y., Snavely, N., Simon, I., Curless, B., Seitz, S. M., et al. (2011). Building Rome in a day. Communications of the ACM, 54(10), 105–112.CrossRef Agarwal, S., Furukawa, Y., Snavely, N., Simon, I., Curless, B., Seitz, S. M., et al. (2011). Building Rome in a day. Communications of the ACM, 54(10), 105–112.CrossRef
Zurück zum Zitat Agarwal, S., Snavely, N., Simon, I., Seitz, S. M., & Szeliski, R. (2009). Building rome in a day. In Proceedings of the International Conference on Computer Vision (ICCV). Agarwal, S., Snavely, N., Simon, I., Seitz, S. M., & Szeliski, R. (2009). Building rome in a day. In Proceedings of the International Conference on Computer Vision (ICCV).
Zurück zum Zitat Autodesk. (2012). Revit architecture. Autodesk. (2012). Revit architecture.
Zurück zum Zitat Delong, A., & Boykov, Y. (2008). A scalable graph-cut algorithm for n-d grids. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Delong, A., & Boykov, Y. (2008). A scalable graph-cut algorithm for n-d grids. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
Zurück zum Zitat Felzenszwalb, P., & Huttenlocher, D. (2004). Distance transforms of sampled functions. Cornell Computing and Information Science Technical report TR2004-1963. Felzenszwalb, P., & Huttenlocher, D. (2004). Distance transforms of sampled functions. Cornell Computing and Information Science Technical report TR2004-1963.
Zurück zum Zitat Furukawa, Y., Curless, B., Seitz, S. M., & Szeliski, R. (2009). Manhattan-world stereo. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Furukawa, Y., Curless, B., Seitz, S. M., & Szeliski, R. (2009). Manhattan-world stereo. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
Zurück zum Zitat Furukawa, Y., Curless, B., Seitz, S. M., & Szeliski, R. (2009). Reconstructing building interiors from images. In Proceedings of the International Conference on Computer Vision (ICCV). doi:10.1109/ICCV.2009.5459145. Furukawa, Y., Curless, B., Seitz, S. M., & Szeliski, R. (2009). Reconstructing building interiors from images. In Proceedings of the International Conference on Computer Vision (ICCV). doi:10.​1109/​ICCV.​2009.​5459145.
Zurück zum Zitat Garland, M. (1998). Qslim: Quadric-based simplification algorithm. Garland, M. (1998). Qslim: Quadric-based simplification algorithm.
Zurück zum Zitat Gupta, A., Efros, A. A., & Hebert, M. (2010). Blocks world revisited: Image understanding using qualitative geometry and mechanics. In European Conference on Computer Vision (ECCV). Gupta, A., Efros, A. A., & Hebert, M. (2010). Blocks world revisited: Image understanding using qualitative geometry and mechanics. In European Conference on Computer Vision (ECCV).
Zurück zum Zitat Hedau, V., Hoiem, D., & Forsyth, D. (2009). Recovering the spatial layout of cluttered rooms. In Proceedings of the International Conference on Computer Vision (ICCV). Hedau, V., Hoiem, D., & Forsyth, D. (2009). Recovering the spatial layout of cluttered rooms. In Proceedings of the International Conference on Computer Vision (ICCV).
Zurück zum Zitat Henry, P., Krainin, M., Herbst, E., Ren, X., & Fox, D. (2010). RGB-D mapping: Using depth cameras for dense 3d modeling of indoor environments. In International Symposium on Experimental Robotics (ISER). Henry, P., Krainin, M., Herbst, E., Ren, X., & Fox, D. (2010). RGB-D mapping: Using depth cameras for dense 3d modeling of indoor environments. In International Symposium on Experimental Robotics (ISER).
Zurück zum Zitat Hernández Esteban, C., Vogiatzis, G., & Cipolla, R. (2007). Probabilistic visibility for multi-view stereo. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Hernández Esteban, C., Vogiatzis, G., & Cipolla, R. (2007). Probabilistic visibility for multi-view stereo. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
Zurück zum Zitat Hough, P. V. (1959). Machine analysis of bubble chamber pictures. In Proceedings of International Conference on High Energy Accelerators and Instrumentation. Hough, P. V. (1959). Machine analysis of bubble chamber pictures. In Proceedings of International Conference on High Energy Accelerators and Instrumentation.
Zurück zum Zitat Huang, Q. X., & Anguelov, D. (2010). High quality pose estimation by aligning multiple scans to a latent map. In IEEE International Conference on Robotics and Automation (ICRA). Huang, Q. X., & Anguelov, D. (2010). High quality pose estimation by aligning multiple scans to a latent map. In IEEE International Conference on Robotics and Automation (ICRA).
Zurück zum Zitat Jiang, H., & Xiao, J. (2013). A linear approach to matching cuboids in RGBD images. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Jiang, H., & Xiao, J. (2013). A linear approach to matching cuboids in RGBD images. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
Zurück zum Zitat Li, Y., Wu, X., Chrysathou, Y., Sharf, A., Cohen-Or, D., & Mitra, N. J. (2011). Globfit: Consistently fitting primitives by discovering global relations. In SIGGRAPH. Li, Y., Wu, X., Chrysathou, Y., Sharf, A., Cohen-Or, D., & Mitra, N. J. (2011). Globfit: Consistently fitting primitives by discovering global relations. In SIGGRAPH.
Zurück zum Zitat Liu, T., Carlberg, M., Chen, G., Chen, J., Kua, J., & Zakhor, A. (2010). Indoor localization and visualization using a human-operated backpack system. In International Conference on Indoor Positioning and Indoor Navigation (IPIN). Liu, T., Carlberg, M., Chen, G., Chen, J., Kua, J., & Zakhor, A. (2010). Indoor localization and visualization using a human-operated backpack system. In International Conference on Indoor Positioning and Indoor Navigation (IPIN).
Zurück zum Zitat Pauly, M., Mitra, N. J., Wallner, J., Pottmann, H., & Guibas, L. J. (2011). Discovering structural regularity in 3d geometry. In SIGGRAPH. Pauly, M., Mitra, N. J., Wallner, J., Pottmann, H., & Guibas, L. J. (2011). Discovering structural regularity in 3d geometry. In SIGGRAPH.
Zurück zum Zitat Rodriguez, E. V., Oliver, A. A., Huber, D., & Cerrada, C. (2011). Detection, modeling, and classification of moldings for automated reverse engineering of buildings from 3d data. In International Symposium on Automation and Robotics in Construction (ISARC). Rodriguez, E. V., Oliver, A. A., Huber, D., & Cerrada, C. (2011). Detection, modeling, and classification of moldings for automated reverse engineering of buildings from 3d data. In International Symposium on Automation and Robotics in Construction (ISARC).
Zurück zum Zitat Russell, B. C., Martin-Brualla, R., Butler, D. J., Seitz, S. M., & Zettlemoyer, L. (2013). 3D Wikipedia: Using online text to automatically label and navigate reconstructed geometry. In SIGGRAPH, Asia. Russell, B. C., Martin-Brualla, R., Butler, D. J., Seitz, S. M., & Zettlemoyer, L. (2013). 3D Wikipedia: Using online text to automatically label and navigate reconstructed geometry. In SIGGRAPH, Asia.
Zurück zum Zitat Sanchez, V., & Zakhor, A. (2012). Planar 3d modeling of building interiors from point cloud data. In IEEE International Conference on Image Processing (ICIP). Sanchez, V., & Zakhor, A. (2012). Planar 3d modeling of building interiors from point cloud data. In IEEE International Conference on Image Processing (ICIP).
Zurück zum Zitat Sinha, S. N., Steedly, D., Szeliski, R., Agrawala, M., & Pollefeys, M. (2008). Interactive 3D architectural modeling from unordered photo collections. In SIGGRAPH, Asia. Sinha, S. N., Steedly, D., Szeliski, R., Agrawala, M., & Pollefeys, M. (2008). Interactive 3D architectural modeling from unordered photo collections. In SIGGRAPH, Asia.
Zurück zum Zitat Suveg, I., & Vosselman, G. (2004). Reconstruction of 3d building models from aerial images and maps. Journal of Photogrammetry and Remote Sensing, 58, 3–4. Suveg, I., & Vosselman, G. (2004). Reconstruction of 3d building models from aerial images and maps. Journal of Photogrammetry and Remote Sensing, 58, 3–4.
Zurück zum Zitat Uyttendaele, M., Criminisi, A., Kang, S. B., Winder, S., Szeliski, R., & Hartley, R. (2004). Image-based interactive exploration of real-world environments. In IEEE Computer Graphics and Applications (CGA). Uyttendaele, M., Criminisi, A., Kang, S. B., Winder, S., Szeliski, R., & Hartley, R. (2004). Image-based interactive exploration of real-world environments. In IEEE Computer Graphics and Applications (CGA).
Zurück zum Zitat Xiao, J., Fang, T., Tan, P., Zhao, P., Ofek, E., & Quan, L. (2008). Image-based façade modeling. In SIGGRAPH, Asia. Xiao, J., Fang, T., Tan, P., Zhao, P., Ofek, E., & Quan, L. (2008). Image-based façade modeling. In SIGGRAPH, Asia.
Zurück zum Zitat Xiao, J., Fang, T., Zhao, P., Lhuillier, M., & Quan, L. (2009). Image-based street-side city modeling. In SIGGRAPH, Asia. Xiao, J., Fang, T., Zhao, P., Lhuillier, M., & Quan, L. (2009). Image-based street-side city modeling. In SIGGRAPH, Asia.
Zurück zum Zitat Xiao, J., Hays, J., Russell, B. C., Patterson, G., Ehinger, K., Torralba, A., et al. (2013). Basic level scene understanding: Categories, attributes and structures. Frontiers in Psychology, 4, 506. Xiao, J., Hays, J., Russell, B. C., Patterson, G., Ehinger, K., Torralba, A., et al. (2013). Basic level scene understanding: Categories, attributes and structures. Frontiers in Psychology, 4, 506.
Zurück zum Zitat Xiao, J., Owens, A., & Torralba, A. (2013). SUN3D: A database of big spaces reconstructed using sfm and object labels. In IEEE International Conference on Computer Vision (ICCV). Xiao, J., Owens, A., & Torralba, A. (2013). SUN3D: A database of big spaces reconstructed using sfm and object labels. In IEEE International Conference on Computer Vision (ICCV).
Zurück zum Zitat Xiao, J., Russell, B., & Torralba, A. (2012). Localizing 3D cuboids in single-view images. Advances in Neural Information Processing Systems, 25, 620–628. Xiao, J., Russell, B., & Torralba, A. (2012). Localizing 3D cuboids in single-view images. Advances in Neural Information Processing Systems, 25, 620–628.
Metadaten
Titel
Reconstructing the World’s Museums
verfasst von
Jianxiong Xiao
Yasutaka Furukawa
Publikationsdatum
01.12.2014
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 3/2014
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-014-0711-y

Weitere Artikel der Ausgabe 3/2014

International Journal of Computer Vision 3/2014 Zur Ausgabe

OriginalPaper

Photo Sequencing

Premium Partner