Article

Optimization of multi-version expensive predicates

Authors:
Iosif Lazaridis

University of California: Irvine, Irvine, CA

University of California: Irvine, Irvine, CA
View Profile

,
Sharad Mehrotra

University of California: Irvine, Irvine, CA

University of California: Irvine, Irvine, CA
View Profile

SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of dataJune 2007Pages 797–808https://doi.org/10.1145/1247480.1247568

Published:11 June 2007Publication History

SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data

Pages 797–808

ABSTRACT

Modern query optimizers need to take into account the performance of expensive user-defined predicates. Existing research has shown how to incorporate such predicates in a traditional cost-based query optimizer. In this paper we deal with the optimization of the expensive predicates themselves, showing how their cost can be reduced by utilizing cheaper, but less accurate, versions of the predicates to pre-filter tuples. We discuss the generalized tuple handling mechanism, which processes tuples along a fixed sequence of versions, as well as adaptive approaches that either split tuple streams into groups, or make routing decisions at the individual tuple level. We identify the lower bound to the problem of evaluating a multi-version selection predicate by an ideal individualized plan (IIP), and develop an optimal generalized plan (OGP). We then show how realistic individualized or grouped schemes can produce an intermediate cost between OGP and IIP, if tuples substantially deviate from the average stream behavior. Our algorithms are tested experimentally, identifying many of the issues that arise whenever multi-version predicates are used.

References

R. Avnur and J. M. Hellerstein. Eddies: Continuously adaptive query processing. In Proc. of SIGMOD Conference, 2000.Google ScholarDigital Library
S. Babu, R. Motwani, K. Munagala, I. Nishizawa, and J. Widom. Adaptive ordering of pipelined stream filters. In Proc. of SIGMOD Conference, 2004. Google ScholarDigital Library
P. Bizarro, S. Babu, D. DeWitt, and J. Widom. Content-based routing: different plans for different data. In Proc. of VLDB Conference, 2005. Google ScholarDigital Library
T. Brinkhoff, H. P. Kriegel, R. Schneider, and B. Seeger. Multi-step Processing of Spatial Joins In Proc. of SIGMOD Conference, 1994.Google Scholar
S. Chaudhuri and K. Shim. Optimization of queries with user-defined predicates. ACM Trans. Database Syst., 24(2):177--228, 1999. Google ScholarDigital Library
J. M. Hellerstein. Optimization techniques for queries with expensive methods. ACM Trans. Database Syst., 23(2):113--157, 1998. Google ScholarDigital Library
J. M. Hellerstein and M. Stonebraker. Predicate migration: optimizing queries with expensive predicates. In Proc. of SIGMOD Conference, 1993.Google ScholarDigital Library
A. Kemper, G. Moerkotte, K. Peithner, and M. Steinbrunn. Optimizing disjunctive queries with expensive predicates. In Proc. of SIGMOD Conference, 1994.Google ScholarDigital Library
I. Lazaridis and S. Mehrotra. Approximate selection queries over imprecise data. In Proc. of ICDE Conference, 2004. Google ScholarDigital Library
R Development Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2006. ISBN 3-900051-07-0.Google Scholar
P. G. Selinger, M. M. Astrahan, D. D. Chamberlin, R. A. Lorie, and T. G. Price. Access path selection in a relational database management system. In Proc. of SIGMOD Conference, 1979.Google ScholarDigital Library
N. Tatbul and S. Zdonik. Window-aware load shedding for aggregation queries over data streams. In Proc. of VLDB Conference, 2006. Google ScholarDigital Library
E. W. Weisstein. Wiener process. From MathWorld - A Wolfram Web Resource. http://mathworld.wolfram.com/WienerProcess.html.Google Scholar

Index Terms

Optimization of multi-version expensive predicates
1. Information systems
  1. Data management systems
    1. Database management system engines
      1. Database query processing
2. Theory of computation
  1. Theory and algorithms for application domains
    1. Database theory
      1. Database query processing and optimization (theory)

Recommendations

Optimization of queries with user-defined predicates

Relational databases provide the ability to store user-defined functions and predicates which can be invoked in SQL queries. When evaluation of a user-defined predicate is relatively expensive, the traditional method of evaluating predicates as early as ...
Read More
Optimization techniques for queries with expensive methods

Object-relational database management systems allow knowledgeable users to define new data types as well as new methods (operators) for the types. This flexibility produces an attendant complexity, which must be handled in new ways for an object-...
Read More
Dynamically optimizing queries over large scale data platforms
SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data

Enterprises are adapting large-scale data processing platforms, such as Hadoop, to gain actionable insights from their "big data". Query optimization is still an open challenge in this environment due to the volume and heterogeneity of data, comprising ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data
June 2007
1210 pages
ISBN:9781595936868
DOI:10.1145/1247480
General Chairs:
Lizhu Zhou
Tsinghua University, China
,
Tok Wang Ling
National University of Singapore, Singapore
,
Program Chair:
Beng Chin Ooi
National University of Singapore, Singapore
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 June 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
adaptive query processing
data streams
expensive methods
multi-version predicates
multimedia sensor networks
query optimization
user-defined predicates
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate785of4,003submissions,20%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 9
  Total Citations
  View Citations
- 41
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Optimization of multi-version expensive predicates

SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data

ABSTRACT

References

Cited By

Index Terms

Recommendations

Optimization of queries with user-defined predicates

Optimization techniques for queries with expensive methods

Dynamically optimizing queries over large scale data platforms

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Optimization of multi-version expensive predicates

SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data

ABSTRACT

References

Cited By

Index Terms

Recommendations

Optimization of queries with user-defined predicates

Optimization techniques for queries with expensive methods

Dynamically optimizing queries over large scale data platforms

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media