skip to main content
10.1145/2933267.2933304acmconferencesArticle/Chapter ViewAbstractPublication PagesdebsConference Proceedingsconference-collections
research-article

Frames: data-driven windows

Published:13 June 2016Publication History

ABSTRACT

Traditional Data Stream Management Systems (DSMS) segment data streams using windows that are defined either by a time interval or a number of tuples. Such windows are fixed---the definition unvarying over the course of a stream---and are defined based on external properties unrelated to the data content of the stream. However, streams and their content do vary over time---the rate of a data stream may vary or the data distribution of the content may vary. The mismatch between a fixed stream segmentation and a variable stream motivates the need for a more flexible, expressive and physically independent stream segmentation. We introduce a new stream segmentation technique, called frames. Frames segment streams based on data content. We present a theory and implementation of frames and show the utility of frames for a variety of applications.

References

  1. A. Adi and O. Etzion. Amit -- The Situation Manager. VLDB J., 13(2):177--203, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Agrawal et al. Efficient Pattern Matching over Event Streams. In SIGMOD, pages 147--160, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. Akidau et al. The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-scale, Unbounded, Out-of-order Data Processing. Proc. VLDB Endow., 8(12):1792--1803, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Artikis et al. Run-time Composite Event Recognition. In DEBS, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. I. Botan et al. Extending XQuery with Window Functions. In VLDB, pages 75--86, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. L. Brenna et al. Cayuga: A High-Performance Event Processing Engine. In SIGMOD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. B. Chandramouli et al. High-Performance Dynamic Pattern Matching over Disordered Streams. PVLDB, 3(1):220--231, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. K. Chui et al. S-OLAP: An OLAP System for Analyzing Sequence Data. In SIGMOD, pages 1131--1134, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. O. Etzion et al. Context-Based Event Processing Systems. In S. Helmer, A. Poulovassilis, and F. Xhafa, editors, Reasoning in Event-Based Distributed Systems, Studies in Computational Intelligence, pages 257--278. Springer, 2011.Google ScholarGoogle Scholar
  10. P. M. Fischer et al. Extending XQuery with a Pattern Matching Facility. In Proc. Intl. XML Database Symposium (XSym), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. T. M. Ghanem et al. Supporting Views in Data Stream Management Systems. ACM Trans. Database Syst., 35(1), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. L. Golab et al. A Sequence-Oriented Stream Warehouse Paradigm for Network Monitoring Applications. In Proc. Intl. Conf. on Passive and Active Measurement (PAM), 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Himberg et al. Time Series Segmentation for Context Recognition in Mobile Devices. In ICDM, pages 203--210, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Hirzel. Partition and Compose: Parallel Complex Event Processing. In DEBS, pages 191--200, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. F. Hueske. Introducing Stream Windows in Apache Flink. https://flink.apache.org/news/2015/12/04/Introducing-windows.html, 2015.Google ScholarGoogle Scholar
  16. IBM Streams Processing Language Standard Toolkit Reference, IBM Infosphere Streams V 2.0.0.4, 2012.Google ScholarGoogle Scholar
  17. Y. E. Ioannidis. The History of Histograms (abridged). In VLDB, pages 19--30, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Y. E. Ioannidis and V. Poosala. Balancing Histogram Optimality and Practicality for Query Result Size Estimation. In SIGMOD, pages 233--244, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Kalinin et al. Interactive Data Exploration Using Semantic Windows. In SIGMOD, pages 505--516, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. E. J. Keogh et al. An Online Algorithm for Segmenting Time Series. In ICDM, pages 289--296, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Li et al. No Pane, No Gain: Efficient Evaluation of Sliding-Window Aggregates over Data Streams. SIGMOD Record, 34(1):39--44, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. J. Li et al. Semantics and Evaluation Techniques for Window Aggregates in Data Streams. In SIGMOD, pages 311--322, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Li et al. AdaptWID: An Adaptive, Memory-Efficient Window Aggregation Implementation. IEEE Internet Computing, 12(6):22--29, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. J. Li et al. Out-of-Order Processing: A New Architecture for High-Performance Stream Systems. PVLDB, 1(1):274--288, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. D. Maier et al. Capturing Episodes: May the Frame Be with You. In DEBS, pages 1--11, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. S. McReynolds. Complex Event Processing in the Real World. Oracle White Paper, Sept. 2007.Google ScholarGoogle Scholar
  27. MSDN Library. Snapshot Windows. In Developers Guide (StreamInsight): Writing Query Templates in LINQ.Google ScholarGoogle Scholar
  28. J. F. Naughton et al. The Niagara Internet Query System. IEEE Data Eng. Bull., 24(2):27--33, 2001.Google ScholarGoogle Scholar
  29. R. Sadri et al. Expressing and Optimizing Sequence Queries in Database Systems. ACM Trans. Database Syst., 29(2):282--318, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. H. Shatkay and S. B. Zdonik. Approximate Queries and Representations for Large Data Sequences. In ICDE, pages 536--545, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. J. Whiteneck et al. Framing the Question: Detecting and Filling Spatial-temporal Windows. In Proc. Intl. Workshop on GeoStreaming (IWGS), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. F. Zemke et al. Pattern Matching in Sequences of Rows. Draft SQL Change Proposal, Mar. 2007.Google ScholarGoogle Scholar

Index Terms

  1. Frames: data-driven windows

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      DEBS '16: Proceedings of the 10th ACM International Conference on Distributed and Event-based Systems
      June 2016
      456 pages
      ISBN:9781450340212
      DOI:10.1145/2933267

      Copyright © 2016 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 13 June 2016

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate130of553submissions,24%

      Upcoming Conference

      DEBS '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader