skip to main content
10.1145/3328905.3332511acmconferencesArticle/Chapter ViewAbstractPublication PagesdebsConference Proceedingsconference-collections
poster

Generating Reproducible Out-of-Order Data Streams

Published:24 June 2019Publication History

ABSTRACT

Evaluating modern stream processing systems in a reproducible manner requires data streams with different data distributions, data rates, and real-world characteristics such as delayed and out-of-order tuples. In this paper, we present an open source stream generator which generates reproducible and deterministic out-of-order streams based on real data files, simulating arbitrary fractions of out-of-order tuples and their respective delays.

References

  1. Tyler Akidau, Robert Bradshaw, et al. 2015. The dataflow model: a practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing. In VLDB.Google ScholarGoogle Scholar
  2. Davide Anguita, Alessandro Ghio, Luca Oneto, et al. 2013. A public domain dataset for human activity recognition using smartphones.. In Esann.Google ScholarGoogle Scholar
  3. Savong BOU, Hiroyuki KITAGAWA, and Toshiyuki AMAGASA. 2018. CBiX: Incremental Sliding-Window Aggregation For Real-Time Analytics Over Out-of-Order Data Streams. In DEIM.Google ScholarGoogle Scholar
  4. Zbigniew Jerzak, Thomas Heinze, Matthias Fehr, et al. 2012. The DEBS 2012 Grand Challenge. In DEBS.Google ScholarGoogle Scholar
  5. Jeyhun Karimov, Tilmann Rabl, Asterios Katsifodimos, Roman Samarev, Henri Heiskanen, and Volker Markl. 2018. Benchmarking distributed stream data processing systems. In ICDE.Google ScholarGoogle Scholar
  6. Jin Li, David Maier, Kristin Tufte, et al. 2005. Semantics and evaluation techniques for window aggregates in data streams. In SIGMOD. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Christopher Mutschler, Holger Ziekow, and Zbigniew Jerzak. 2013. The DEBS 2013 Grand Challenge. In DEBS.Google ScholarGoogle Scholar
  8. Kanat Tangwongsan, Martin Hirzel, and Scott Schneider. 2018. Sub-O (log n) Out-of-Order Sliding-Window Aggregation. arXiv preprint (2018).Google ScholarGoogle Scholar
  9. New York City Taxi and Limousine Commission. {n.d.}. Tlc trip record data. https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page.Google ScholarGoogle Scholar
  10. Jonas Traub, Philipp M. Grulich, Alejandro R. Cuéllar, Sebastian Breß, Asterios Katsifodimos, Tilmann Rabl, and Volker Markl. 2018. Scotty: Efficient Window Aggregation for Out-of-Order Stream Processing. In ICDE.Google ScholarGoogle Scholar
  11. Jonas Traub, Philipp M. Grulich, Alejandro R. Cuéllar, Sebastian Breß, Asterios Katsifodimos, Tilmann Rabl, and Volker Markl. 2019. Efficient Window Aggregation with General Stream Slicing. In EDBT.Google ScholarGoogle Scholar
  12. Jonas Traub, Nikolaas Steenbergen, Philipp Grulich, Tilmann Rabl, and Volker Markl. 2017. I2: Interactive Real-Time Visualization for Streaming Data.. In EDBT.Google ScholarGoogle Scholar

Index Terms

  1. Generating Reproducible Out-of-Order Data Streams

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      DEBS '19: Proceedings of the 13th ACM International Conference on Distributed and Event-based Systems
      June 2019
      291 pages
      ISBN:9781450367943
      DOI:10.1145/3328905

      Copyright © 2019 Owner/Author

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 June 2019

      Check for updates

      Qualifiers

      • poster
      • Research
      • Refereed limited

      Acceptance Rates

      DEBS '19 Paper Acceptance Rate13of47submissions,28%Overall Acceptance Rate130of553submissions,24%

      Upcoming Conference

      DEBS '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader