skip to main content
10.1145/543613.543651acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
Article

Least expected cost query optimization: what can we expect?

Published:03 June 2002Publication History

ABSTRACT

A standard assumption in the database query optimization literature is that it suffices to optimize for the "typical" case---that is, the case in which various parameters (e.g., the amount of available memory, the selectivities of predicates, etc.) take on their "typical" values. It was claimed in [CHS99] that we could do better by choosing plans based on their expected cost. Here we investigate this issue more thoroughly. We show that in many circumstances of interest, a "typical" value of the parameter often does give acceptable answers, provided that it is chosen carefully and we are interested only in minimizing expected running time. However, by minimizing the expected running time, we are effectively assuming that if plan p1 runs three times as long as plan p2, then p1 is exactly three times as bad as p2. An assumption like this is not always appropriate. We show that focusing on least expected cost can lead to significant improvement for a number of cost functions of interest.

References

  1. {CHS99} F. Chu, J. Y. Halpern, and P. Seshadri. Least expected cost query optimization: an exercise in utility. In Proceedings of the 18th ACM Symposium on Principles of Database Systems, pages 138-147, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. {CN00} S. Chaudhuri and V. R. Narasayya. Automating statistics management for query optimizers. In Proceedings of the 16th International Conference on Data Engineering, pages 339-348, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. {GBC98} G. Graefe, R. Bunker, and S. Cooper. Hash joins and hash teams in Microsoft, SQL Server. In A. Gupta, O. Shmueli, and J. Widom, editors, VLDB'98, Proceedings of 24th International Conference on Very Large Data Bases, pages 86-97, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. {HHW97} J. M. Hellerstein, P. J. Haas, and H. Wang. Online aggregation. In J. Peckham, editor, SIGMOD 1997, Proceedings ACM SIGMOD International Conference on Management of Data, pages 171-182, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. {HKWY97} L. M. Haas, D. Kossmann, E. L. Wimmers, and J. Yang. Optimizing queries across diverse data sources. In M. Jarke, M. J. Carey, K. R. Dittrich, F. H. Lochovsky, P. Loucopoulos, and M. A. Jeusfeld, editors, Proceedings of 23rd International Conference on Very Large Data Bases, pages 276-285, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. {INSS92} Y. Ioannidis, R. Ng, K. Shim, and T. K. Sellis. Parametric Query Optimization. In Proceedings of the 18th International Conference on Very Large Data Bases, pages 103-114, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. {LG98} P. Larson and G. Graefe. Memory management during run generation in external sorting. In L. M. Haas and A. Tiwary, editors, SIGMOD 1998, Proceedings ACM SIGMOD International Conference on Management of Data, pages 472-483, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. {ROH99) M. T. Roth, F. Özcan, and L. M. Haas. Cost models do matter: Providing cost information for diverse data sources in a federated system. In M. P. Atkinson, M. E. Orlowska, P. Valduriez, S. B. Zdonik, and M. L. Brodie, editors, Proceedings of 25th International Conference on Very Large Data Bases, pages 599-610, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. {SAC+79} P. G. Selinger, M. Astrahan, D. Chamberlin, R. Loric, and T. Price. Access Path Selection in a Relational Database Management System. In Proceedings of ACM SIGMOD '79 International Conference on Management of Data, pages 23-34, 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. {SLMK01} M. Stillger, G. M. Lohman, V. Markl, and M. Kandil. LEO---DB2's LEarning Optimizer. In P. M. G. Apers, P. Atzeni, S. Ceri, S. Paraboschi, K. Ramamohanarao, and R. T. Snodgrass, editors, Proceedings of 27th International Conference on Very Large Data Bases, pages 19-28, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. {TPPC99} Transaction Processing Performance Council. TPC Benchmark™ H (Decision Support) Standard Specification Revision 1.2.1. Transaction Processing Performance Council, 1999.Google ScholarGoogle Scholar
  12. {Ull89} J. D. Ullman. Principles of Database and Knowledge Base Systems, Volume II: The New Technologies. Computer Science Press, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Least expected cost query optimization: what can we expect?

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        PODS '02: Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
        June 2002
        311 pages
        ISBN:1581135076
        DOI:10.1145/543613

        Copyright © 2002 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 3 June 2002

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        PODS '02 Paper Acceptance Rate24of109submissions,22%Overall Acceptance Rate642of2,707submissions,24%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader