Skip to main content
Top

2019 | OriginalPaper | Chapter

Towards Dynamic Data Placement for Polystore Ingestion

Authors : Jiang Du, John Meehan, Nesime Tatbul, Stan Zdonik

Published in: Real-Time Business Intelligence and Analytics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Integrating low-latency data streaming into data warehouse architectures has become an important enhancement to support modern data warehousing applications. In these architectures, heterogeneous workloads with data ingestion and analytical queries must be executed with strict performance guarantees. Furthermore, the data warehouse may consists of multiple different types of storage engines (a.k.a., polystores or multi-stores). A paramount problem is data placement; different workload scenarios call for different data placement designs. Moreover, workload conditions change frequently. In this paper, we provide evidence that a dynamic, workload-driven approach is needed for data placement in polystores with low-latency data ingestion support. We study the problem based on the characteristics of the TPC-DI benchmark in the context of an abbreviated polystore that consists of S-Store and Postgres.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
4.
go back to reference Altinel, M., Bornhovd, C., Krishnamurthy, S., Mohan, C., Pirahesh, H., Reinwald, B.: Cache tables: paving the way for an adaptive database cache. In: VLDB, pp. 718–729 (2003) Altinel, M., Bornhovd, C., Krishnamurthy, S., Mohan, C., Pirahesh, H., Reinwald, B.: Cache tables: paving the way for an adaptive database cache. In: VLDB, pp. 718–729 (2003)
5.
go back to reference Barber, R., et al.: Wildfire: concurrent blazing data ingest and analytics. In: SIGMOD, pp. 2077–2080 (2016) Barber, R., et al.: Wildfire: concurrent blazing data ingest and analytics. In: SIGMOD, pp. 2077–2080 (2016)
6.
go back to reference Bruno, N., Chaudhuri, S.: An online approach to physical design tuning. In: ICDE, pp. 826–835 (2007) Bruno, N., Chaudhuri, S.: An online approach to physical design tuning. In: ICDE, pp. 826–835 (2007)
7.
go back to reference Cetintemel, U., et al.: S-Store: a streaming NewSQL system for big velocity applications. PVLDB 7(13), 1633–1636 (2014) Cetintemel, U., et al.: S-Store: a streaming NewSQL system for big velocity applications. PVLDB 7(13), 1633–1636 (2014)
8.
go back to reference Cudre-Mauroux, P., et al.: A demonstration of SciDB: a science-oriented DBMS. PVLDB 2(2), 1534–1537 (2009) Cudre-Mauroux, P., et al.: A demonstration of SciDB: a science-oriented DBMS. PVLDB 2(2), 1534–1537 (2009)
9.
go back to reference DeBrabant, J., Pavlo, A., Tu, S., Stonebraker, M., Zdonik, S.: Anti-caching: a new approach to database management system architecture. PVLDB 6(14), 1942–1953 (2013) DeBrabant, J., Pavlo, A., Tu, S., Stonebraker, M., Zdonik, S.: Anti-caching: a new approach to database management system architecture. PVLDB 6(14), 1942–1953 (2013)
10.
go back to reference Du, J., Glavic, B., Tan, W., Miller, R.J.: DeepSea: progressive workload-aware partitioning of materialized views in scalable data analytics. In: EDBT, pp. 198–209 (2017) Du, J., Glavic, B., Tan, W., Miller, R.J.: DeepSea: progressive workload-aware partitioning of materialized views in scalable data analytics. In: EDBT, pp. 198–209 (2017)
11.
go back to reference Elmore, A., et al.: A demonstration of the BigDAWG polystore system. PVLDB 8(12), 1908–1911 (2015) Elmore, A., et al.: A demonstration of the BigDAWG polystore system. PVLDB 8(12), 1908–1911 (2015)
12.
go back to reference Fernandez, R.C., et al.: Liquid: unifying nearline and offline big data integration. In: CIDR (2015) Fernandez, R.C., et al.: Liquid: unifying nearline and offline big data integration. In: CIDR (2015)
13.
go back to reference Fitzpatrick, B.: Distributed caching with memcached. Linux J. 124, 5–5 (2004) Fitzpatrick, B.: Distributed caching with memcached. Linux J. 124, 5–5 (2004)
14.
go back to reference Golab, L., Johnson, T., Seidel, J.S., Shkapenyuk, V.: Stream warehousing with DataDepot. In: SIGMOD, pp. 847–854 (2009) Golab, L., Johnson, T., Seidel, J.S., Shkapenyuk, V.: Stream warehousing with DataDepot. In: SIGMOD, pp. 847–854 (2009)
15.
go back to reference Josifovski, V., Schwarz, P., Haas, L., Lin, E.: Garlic: a new flavor of federated query processing for DB2. In: SIGMOD, pp. 524–532 (2002) Josifovski, V., Schwarz, P., Haas, L., Lin, E.: Garlic: a new flavor of federated query processing for DB2. In: SIGMOD, pp. 524–532 (2002)
16.
go back to reference Kallman, R., et al.: H-Store: a high-performance, distributed main memory transaction processing system. PVLDB 1(2), 1496–1499 (2008) Kallman, R., et al.: H-Store: a high-performance, distributed main memory transaction processing system. PVLDB 1(2), 1496–1499 (2008)
17.
go back to reference Kreps, J., Narkhede, N., Rao, J.: Kafka: a distributed messaging system for log processing. In: NetDB Workshop (2011) Kreps, J., Narkhede, N., Rao, J.: Kafka: a distributed messaging system for log processing. In: NetDB Workshop (2011)
18.
go back to reference LeFevre, J., Sankaranarayanan, J., Hacigumus, H., Tatemura, J., Polyzotis, N., Carey, M.J.: MISO: souping up big data query processing with a multistore system. In: SIGMOD, pp. 1591–1602 (2014) LeFevre, J., Sankaranarayanan, J., Hacigumus, H., Tatemura, J., Polyzotis, N., Carey, M.J.: MISO: souping up big data query processing with a multistore system. In: SIGMOD, pp. 1591–1602 (2014)
19.
go back to reference Meehan, J., Aslantas, C., Zdonik, S., Tatbul, N., Du, J.: Data ingestion for the connected world. In: CIDR (2017) Meehan, J., Aslantas, C., Zdonik, S., Tatbul, N., Du, J.: Data ingestion for the connected world. In: CIDR (2017)
20.
go back to reference Meehan, J., et al.: S-Store: streaming meets transaction processing. PVDLB 8(13), 2134–2145 (2015)MathSciNet Meehan, J., et al.: S-Store: streaming meets transaction processing. PVDLB 8(13), 2134–2145 (2015)MathSciNet
21.
go back to reference Meehan, J., et al.: Integrating real-time and batch processing in a polystore. In: IEEE HPEC (2016) Meehan, J., et al.: Integrating real-time and batch processing in a polystore. In: IEEE HPEC (2016)
22.
go back to reference Özsu, M.T., Valduriez, P.: Distributed database systems: where are we now? IEEE Comput. 24(8), 68–78 (1991)CrossRef Özsu, M.T., Valduriez, P.: Distributed database systems: where are we now? IEEE Comput. 24(8), 68–78 (1991)CrossRef
23.
go back to reference Poess, M., Rabl, T., Jacobsen, H., Caufield, B.: TPC-DI: the first industry benchmark for data integration. PVLDB 7(13), 1367–1378 (2014) Poess, M., Rabl, T., Jacobsen, H., Caufield, B.: TPC-DI: the first industry benchmark for data integration. PVLDB 7(13), 1367–1378 (2014)
24.
go back to reference Stonebraker, M., et al.: C-store: a column-oriented DBMS. In: VLDB, pp. 553–564 (2005) Stonebraker, M., et al.: C-store: a column-oriented DBMS. In: VLDB, pp. 553–564 (2005)
25.
go back to reference Stonebraker, M., Cetintemel, U.: “One size fits all”: an idea whose time has come and gone. In: ICDE, pp. 2–11 (2005) Stonebraker, M., Cetintemel, U.: “One size fits all”: an idea whose time has come and gone. In: ICDE, pp. 2–11 (2005)
26.
go back to reference Tatbul, N., et al.: Handling shared, mutable state in stream processing with correctness guarantees. IEEE Data Eng. Bull. Special Issue Next-Gener. Stream Process. 38(4), 94–104 (2015) Tatbul, N., et al.: Handling shared, mutable state in stream processing with correctness guarantees. IEEE Data Eng. Bull. Special Issue Next-Gener. Stream Process. 38(4), 94–104 (2015)
Metadata
Title
Towards Dynamic Data Placement for Polystore Ingestion
Authors
Jiang Du
John Meehan
Nesime Tatbul
Stan Zdonik
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-24124-7_13