skip to main content
10.1145/1247480.1247568acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Optimization of multi-version expensive predicates

Published:11 June 2007Publication History

ABSTRACT

Modern query optimizers need to take into account the performance of expensive user-defined predicates. Existing research has shown how to incorporate such predicates in a traditional cost-based query optimizer. In this paper we deal with the optimization of the expensive predicates themselves, showing how their cost can be reduced by utilizing cheaper, but less accurate, versions of the predicates to pre-filter tuples. We discuss the generalized tuple handling mechanism, which processes tuples along a fixed sequence of versions, as well as adaptive approaches that either split tuple streams into groups, or make routing decisions at the individual tuple level. We identify the lower bound to the problem of evaluating a multi-version selection predicate by an ideal individualized plan (IIP), and develop an optimal generalized plan (OGP). We then show how realistic individualized or grouped schemes can produce an intermediate cost between OGP and IIP, if tuples substantially deviate from the average stream behavior. Our algorithms are tested experimentally, identifying many of the issues that arise whenever multi-version predicates are used.

References

  1. R. Avnur and J. M. Hellerstein. Eddies: Continuously adaptive query processing. In Proc. of SIGMOD Conference, 2000.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Babu, R. Motwani, K. Munagala, I. Nishizawa, and J. Widom. Adaptive ordering of pipelined stream filters. In Proc. of SIGMOD Conference, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. P. Bizarro, S. Babu, D. DeWitt, and J. Widom. Content-based routing: different plans for different data. In Proc. of VLDB Conference, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. T. Brinkhoff, H. P. Kriegel, R. Schneider, and B. Seeger. Multi-step Processing of Spatial Joins In Proc. of SIGMOD Conference, 1994.Google ScholarGoogle Scholar
  5. S. Chaudhuri and K. Shim. Optimization of queries with user-defined predicates. ACM Trans. Database Syst., 24(2):177--228, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. M. Hellerstein. Optimization techniques for queries with expensive methods. ACM Trans. Database Syst., 23(2):113--157, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. M. Hellerstein and M. Stonebraker. Predicate migration: optimizing queries with expensive predicates. In Proc. of SIGMOD Conference, 1993.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Kemper, G. Moerkotte, K. Peithner, and M. Steinbrunn. Optimizing disjunctive queries with expensive predicates. In Proc. of SIGMOD Conference, 1994.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. I. Lazaridis and S. Mehrotra. Approximate selection queries over imprecise data. In Proc. of ICDE Conference, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. R Development Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2006. ISBN 3-900051-07-0.Google ScholarGoogle Scholar
  11. P. G. Selinger, M. M. Astrahan, D. D. Chamberlin, R. A. Lorie, and T. G. Price. Access path selection in a relational database management system. In Proc. of SIGMOD Conference, 1979.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. N. Tatbul and S. Zdonik. Window-aware load shedding for aggregation queries over data streams. In Proc. of VLDB Conference, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. E. W. Weisstein. Wiener process. From MathWorld - A Wolfram Web Resource. http://mathworld.wolfram.com/WienerProcess.html.Google ScholarGoogle Scholar

Index Terms

  1. Optimization of multi-version expensive predicates

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data
        June 2007
        1210 pages
        ISBN:9781595936868
        DOI:10.1145/1247480
        • General Chairs:
        • Lizhu Zhou,
        • Tok Wang Ling,
        • Program Chair:
        • Beng Chin Ooi

        Copyright © 2007 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 11 June 2007

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate785of4,003submissions,20%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader