skip to main content
research-article

Recognizing patterns in streams with imprecise timestamps

Published:01 September 2010Publication History
Skip Abstract Section

Abstract

Large-scale event systems are becoming increasingly popular in a variety of domains. Event pattern evaluation plays a key role in monitoring applications in these domains. Existing work on pattern evaluation, however, assumes that the occurrence time of each event is known precisely and the events from various sources can be merged into a single stream with a total or partial order. We observe that in real-world applications event occurrence times are often unknown or imprecise. Therefore, we propose a temporal model that assigns a time interval to each event to represent all of its possible occurrence times and revisit pattern evaluation under this model. In particular, we propose the formal semantics of such pattern evaluation, two evaluation frameworks, and algorithms and optimizations in these frameworks. Our evaluation results using both real traces and synthetic systems show that the event-based framework always outperforms the point-based framework and with optimizations, it achieves high efficiency for a wide range of workloads tested.

References

  1. J. Agrawal, Y. Diao, et al. Efficient pattern matching over event streams. In SIGMOD, 147--160, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Akdere, U. Çetintemel, et al. Plan-based complex event detection across distributed sources. PVLDB, 1(1):66--77, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. H. Ali, C. Gerea, et al. Microsoft cep server and online behavioral targeting. PVLDB, 2(2):1558--1561, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. R. S. Barga, J. Goldstein, et al. Consistent streaming through time: A vision for event stream processing. In CIDR, 363--374, 2007.Google ScholarGoogle Scholar
  5. P. Barham, A. Donnelly, et al. Using Magpie for request extraction and workload modelling. In OSDI, 259--272, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. H. Bhlen and C. S. Jensen. Temporal data model and query language concepts. Encyclopedia of Information Systems, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  7. N. N. Dalvi and D. Suciu. Efficient query evaluation on probabilistic databases. VLDB J., 16(4):523--544, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. J. Demers, J. Gehrke, et al. Cayuga: A general purpose event monitoring system. In CIDR, 412--422, 2007.Google ScholarGoogle Scholar
  9. L. Ding, S. Chen, et al. Runtime semantic query optimization for event stream processing. In ICDE, 676--685, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C. E. Dyreson and R. T. Snodgrass. Supporting valid-time indeterminacy. ACM Trans. Database Syst., 23(1):1--57, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Ganglia monitoring tool. http://ganglia.sourceforge.net/.Google ScholarGoogle Scholar
  12. E. Koskinen and J. Jannotti. Borderpatrol: isolating events for black-box tracing. In EuroSys, 191--203, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. Liu, M. Li, et al. Sequence pattern query processing over out-of-order event streams. In ICDE, 784--795, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. E. Lo, B. Kao, et al. OLAP on sequence data. In SIGMOD, 649--660, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Y. Mei and S. Madden. ZStream: a cost-based query processor for adaptively detecting composite events. In SIGMOD, 193--206, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. C. Ré, J. Letchner, et al. Event queries on correlated probabilistic streams. In SIGMOD, 715--728, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. Sadri, et al. Expressing and optimizing sequence queries in database systems. ACM Trans. Database Syst., 29(2):282--318, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. U. Srivastava and J. Widom. Flexible time management in data stream systems. In PODS, 263--274, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. T. Tran, C. Sutton, et al. Probabilistic inference over RFID streams in mobile environments. In ICDE, 1096--1107, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. P. A. Tucker, D. Maier, et al. Using punctuation schemes to characterize strategies for querying over data streams. IEEE Trans. Knowl. Data Eng., 19(9):1227--1240, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. W. M. White, M. Riedewald, et al. What is "next" in event processing? In PODS, 263--272, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. E. Wu, Y. Diao, et al. High-performance complex event processing over streams. In SIGMOD, 407--418, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. H. Zhang, Y. Diao, et al. Recognizing Patterns in Streams with Imprecise Timestamps UMass Tech Report UM-CS-2010-025, 2010.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

  • Published in

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 3, Issue 1-2
    September 2010
    1658 pages

    Publisher

    VLDB Endowment

    Publication History

    • Published: 1 September 2010
    Published in pvldb Volume 3, Issue 1-2

    Qualifiers

    • research-article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader