skip to main content
research-article

Column-oriented database systems

Published:01 August 2009Publication History
Skip Abstract Section

Abstract

Column-oriented database systems (column-stores) have attracted a lot of attention in the past few years. Column-stores, in a nutshell, store each database table column separately, with attribute values belonging to the same column stored contiguously, compressed, and densely packed, as opposed to traditional database systems that store entire records (rows) one after the other. Reading a subset of a table's columns becomes faster, at the potential expense of excessive disk-head seeking from column to column for scattered reads or updates. After several dozens of research papers and at least a dozen of new column-store start-ups, several questions remain. Are these a new breed of systems or simply old wine in new bottles? How easily can a major row-based system achieve column-store performance? Are column-stores the answer to effortlessly support large-scale data-intensive applications? What are the new, exciting system research problems to tackle? What are the new applications that can be potentially enabled by column-stores? In this tutorial, we present an overview of column-oriented database system technology and address these and other related questions.

References

  1. Abadi, D. J., Madden, S. R., and Ferreira, M.: Integrating compression and execution in column-oriented database systems. In Proc. SIGMOD, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Abadi, D. J., Myers, D. S., DeWitt, D. J., and Madden, S. R.: Materialization strategies in a column-oriented DBMS. In Proc. ICDE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  3. Abadi, D. J., Madden, S. R., and Hachem, N.: Column-stores vs. row-stores: how different are they really? In Proc. SIGMOD, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Boncz, P. A. Monet: A Next-Generation DBMS Kernel For Query-Intensive Applications. Ph.D. Thesis, Universiteit van Amsterdam, Amsterdam, The Netherlands, May 2002.Google ScholarGoogle Scholar
  5. Copeland, G. P., Khoshafian, S. N.: A Decomposition Storage Model. In Proc. SIGMOD, 1985. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Harizopoulos, S., Liang, V., Abadi, D. J., and Madden, S.: Performance tradeoffs in read-optimized databases. In Proc. VLDB, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Stonebraker, M. et al.: C-Store: A Column-oriented DBMS. In Proc. VLDB, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Tsirogiannis, D., Harizopoulos, S., Shah, M. A., Wiener, J. L., and Graefe, G.: Query processing techniques for solid state drives. In Proc. SIGMOD, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Zukowski, M., Heman, S., Nes, N., and Boncz, P. A.: Superscalar ram-cpu cache compression. In Proc. ICDE, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Zukowski, M., Heman, S., Nes, N., and Boncz, P. A. Cooperative Scans: Dynamic Bandwidth Sharing in a DBMS. In Proc. VLDB, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Column-oriented database systems

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image Proceedings of the VLDB Endowment
        Proceedings of the VLDB Endowment  Volume 2, Issue 2
        August 2009
        367 pages

        Publisher

        VLDB Endowment

        Publication History

        • Published: 1 August 2009
        Published in pvldb Volume 2, Issue 2

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader