skip to main content
10.1145/3022227.3022255acmconferencesArticle/Chapter ViewAbstractPublication PagesicuimcConference Proceedingsconference-collections
research-article

Efficient query processing on distributed stream processing engine

Published:05 January 2017Publication History

ABSTRACT

Distributed stream processing engines, such as Storm and Samza, have been developed to process large scale stream data. The engines are scale out horizontally with shared nothing architecture, but they do not provide high-level query language like SQL. Supporting query language for flexible analysis has become an important issue. In this paper, we provide efficient continuous relational query processing on distributed stream processing engine. We propose a methodology to transform queries executable in the engine and optimization technique for query processing. Our experimental results show that our methodology is efficient on processing queries for data streams.

References

  1. Amazon Kinesis Analytics. https://aws.amazon.com/kinesis/analytics/.Google ScholarGoogle Scholar
  2. Apache Kafka. http://kafka.apache.org.Google ScholarGoogle Scholar
  3. Apache samza. http://samza.apache.org.Google ScholarGoogle Scholar
  4. Apache storm: Trident. http://storm.apache.org/releases/1.0.2/Trident-API-Overview.html.Google ScholarGoogle Scholar
  5. Oracle CEP. https://docs.oracle.com/cd/E16764_01/doc.1111/e12048/intro.htm.Google ScholarGoogle Scholar
  6. TPC-H. http://www.tpc.org/tpch/.Google ScholarGoogle Scholar
  7. D. J. . Abadi, Y. Ahmad, M. Balazinska, U. Çetintemel, M. Cherniack, J.-H. Hwang, W. Lindner, A. Maskey, A. Rasin, E. Ryvkina, N. Tatbul, Y. Xing, and S. B. . Zdonik. The Design of the Borealis Stream Processing Engine. In Proceedings of Conference on Innovative Data Systems Research (CIDR), pages 277--289, 2005.Google ScholarGoogle Scholar
  8. D. J. Abadi, D. Carney, U. Çetintemel, M. Cherniack, C. Convey, S. Lee, M. Stonebraker, N. Tatbul, and S. Zdonik. Aurora: A new model and architecture for data stream management. The VLDB Journal, 12(2):120--139, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Arasu, S. Babu, and J. Widom. The cql continuous query language: Semantic foundations and query execution. The VLDB Journal, 15(2):121--142, June 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Chandrasekaran, O. Cooper, A. Deshpande, M. J. Franklin, J. M. Hellerstein, W. Hong, S. Krishnamurthy, S. Madden, V. Raman, F. Reiss, and M. A. Shah. Telegraphcq: Continuous dataflow processing for an uncertain world. In Proceedings of Conference on Innovative Data Systems Research (CIDR), 2003.Google ScholarGoogle Scholar
  11. C. Cranor, T. Johnson, O. Spataschek, and V. Shkapenyuk. Gigascope: A stream database for network applications. In Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pages 647--651. ACM, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. B. Gedik, H. Andrade, K.-L. Wu, P. S. Yu, and M. Doo. Spade: The system s declarative stream processing engine. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pages 1123--1134. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. T. M. Ghanem, A. K. Elmagarmid, P.-A. Larson, and W. G. Aref. Supporting views in data stream management systems. ACM Trans. Database Syst., 35(1):1:1--1:47, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. Krishnamurthy, M. J. Franklin, J. Davis, D. Farina, P. Golovko, A. Li, and N. Thombre. Continuous analytics over discontinuous streams. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, SIGMOD '10, pages 1081--1092. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. Kulkarni, N. Bhagat, M. Fu, V. Kedigehalli, C. Kellogg, S. Mittal, J. M. Patel, K. Ramasamy, and S. Taneja. Twitter heron: Stream processing at scale. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, SIGMOD '15, pages 239--250. ACM, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Y.-N. Law, H. Wang, and C. Zaniolo. Query languages and data models for database sequences and data streams. In Proceedings of the Thirtieth International Conference on Very Large Data Bases - Volume 30, pages 492--503. VLDB Endowment, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. L. Neumeyer, B. Robbins, A. Nair, and A. Kesari. S4: Distributed stream computing platform. In Proceedings of the 2010 IEEE International Conference on Data Mining Workshops, ICDMW '10, pages 170--177. IEEE Computer Society, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Toshniwal, S. Taneja, A. Shukla, K. Ramasamy, J. M. Patel, S. Kulkarni, J. Jackson, K. Gade, M. Fu, J. Donham, N. Bhagat, S. Mittal, and D. Ryaboy. Storm@twitter. In Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, SIGMOD '14, pages 147--156. ACM, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Efficient query processing on distributed stream processing engine

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        IMCOM '17: Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication
        January 2017
        746 pages
        ISBN:9781450348881
        DOI:10.1145/3022227

        Copyright © 2017 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 5 January 2017

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        IMCOM '17 Paper Acceptance Rate113of366submissions,31%Overall Acceptance Rate213of621submissions,34%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader