Abstract
This paper describes the system architecture of the Vertica Analytic Database (Vertica), a commercialization of the design of the C-Store research prototype. Vertica demonstrates a modern commercial RDBMS system that presents a classical relational interface while at the same time achieving the high performance expected from modern "web scale" analytic systems by making appropriate architectural choices. Vertica is also an instructive lesson in how academic systems research can be directly commercialized into a successful product.
- Actian Vectorwise. http://www.actian.com/products/vectorwise.Google Scholar
- HP Completes Acquisition of Vertica Systems, Inc. http://www.hp.com/hpinfo/newsroom/press/2011/110322c.html.Google Scholar
- Infobright. http://www.infobright.com/.Google Scholar
- Oracle Hybrid Columnar Compression on Exadata. http://www.oracle.com/technetwork/middleware/bi-foundation/ehcc-twp-131254.pdf.Google Scholar
- PostgreSQL. http://www.postgresql.org/.Google Scholar
- Why Verticas Compression is Better. http://www.vertica.com/2010/05/26/why-verticas-compression-is-better.Google Scholar
- A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, S. Anthony, H. Liu, P. Wyckoff and R. Murthy. Hive - A Warehousing Solution Over a MapReduce Framework. PVLDB, 2(2):1626--1629, 2009. Google Scholar
- D. J. Abadi, D. S. Myers, D. J. Dewitt, and S. R. Madden. Materialization Strategies in a Column-Oriented DBMS. In ICDE, pages 466--475, 2007.Google Scholar
- B. Chattopadhyay, L. Lin, W. Liu, S. Mittal, P. Aragonda, V. Lychagina, Y. Kwon and M. Wong. Tenzing: A SQL Implementation On The MapReduce framework. PVLDB, 4(12):1318--1327, 2011.Google Scholar
- P. A. Boncz, M. Zukowski, and N. Nes. MonetDB/X100: Hyper-Pipelining Query Execution. In CIDR, pages 225--237, 2005.Google Scholar
- S. Ceri and J. Widom. Deriving Production Rules for Incremental View Maintenance. In VLDB, pages 577--589, 1991. Google Scholar
- J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In OSDI, pages 137--150, 2004. Google Scholar
- G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels. Dynamo: Amazon's Highly Available Key-value Store. In SOSP, pages 205--220, 2007. Google Scholar
- F. Färber, S. K. Cha, J. Primsch, C. Bornhövd, S. Sigg, and W. Lehner. SAP HANA Database: Data Management for Modern Business Applications. ACM SIGMOD Record, 40(4):45--51, 2012. Google Scholar
- J. Gray and A. Reuter. Transaction Processing: Concepts and Techniques. Morgan Kaufmann Publishers Inc., 1992. Google Scholar
- P. J. Haas, J. F. Naughton, S. Seshadri, and L. Stokes. Sampling-Based Estimation of the Number of Distinct Values of an Attribute. In VLDB, pages 311--322, 1995. Google Scholar
- W. Kim. On Optimizing a SQL-like Nested Query. ACM TODS, 7(3):443--469, 1982. Google Scholar
- R. Kimball and M. Ross. The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling. Wiley, John & Sons, Inc., 2002. Google Scholar
- A. Lakshman and P. Malik. Cassandra: A Decentralized Structured Storage System. SIGOPS Operating Systems Review, 44(2):35--40, 2010. Google Scholar
- P.-Å. Larson, E. N. Hanson, and S. L. Price. Columnar Storage in SQL Server 2012. IEEE Data Engineering Bulletin, 35(1):15--20, 2012.Google Scholar
- M. Stonebraker, D. J. Abadi, A. Batkin, X. Chen, M. Cherniack, M. Ferreira, E. Lau, A. Lin, S. Madden and E. J. O'Neil et.al. C-Store: A Column-oriented DBMS. In VLDB, pages 553--564, 2005. Google Scholar
- G. Moerkotte. Small Materialized Aggregates: A Light Weight Index Structure for data warehousing. In VLDB, pages 476--487, 1998. Google Scholar
- R. Barber, P. Bendel, M. Czech, O. Draese, F. Ho, N. Hrle, S. Idreos, M. S. Kim, O. Koeth and J. G. Lee et.al. Business Analytics in (a) Blink. IEEE Data Engineering Bulletin, 35(1):9--14, 2012.Google Scholar
- D. Slezak, J. Wroblewski, V. Eastwood, and P. Synak. Brighthouse: An Analytic Data Warehouse for Ad-hoc Queries. PVLDB, 1(2):1337--1345, 2008. Google Scholar
- M. Staudt and M. Jarke. Incremental Maintenance of Externally Materialized Views. In VLDB, pages 75--86, 1996. Google Scholar
- M. Stonebraker. One Size Fits All: An Idea Whose Time has Come and Gone. In ICDE, pages 2--11, 2005. Google Scholar
- J. D. Ullman. Principles of Database and Knowledge-Base Systems, Volume II. Computer Science Press, 1989.Google Scholar
Index Terms
- The vertica analytic database: C-store 7 years later
Recommendations
The vertica database: SQL RDBMS for managing big data
MBDS '12: Proceedings of the 2012 workshop on Management of big data systemsIn this presentation, we describe the architecture of the Vertica Analytic Database (Vertica), with an emphasis on the management features. Vertica combines a scale-out design, commodity hardware, and the RDBMS data management paradigm to keep SQL ...
Graph analytics using vertica relational database
BIG DATA '15: Proceedings of the 2015 IEEE International Conference on Big Data (Big Data)Graph analytics is becoming increasingly popular, with a number of new applications and systems developed in the past few years. In this paper, we study Vertica relational database as a platform for graph analytics. We show that vertex-centric graph ...
Vertica-ML: Distributed Machine Learning in Vertica Database
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of DataA growing number of companies rely on machine learning as a key element for gaining a competitive edge from their collected Big Data. An in-database machine learning system can provide many advantages in this scenario, e.g., eliminating the overhead of ...
Comments