skip to main content
research-article

CoPhy: a scalable, portable, and interactive index advisor for large workloads

Published:01 March 2011Publication History
Skip Abstract Section

Abstract

Index tuning, i.e., selecting the indexes appropriate for a workload, is a crucial problem in database system tuning. In this paper, we solve index tuning for large problem instances that are common in practice, e.g., thousands of queries in the workload, thousands of candidate indexes and several hard and soft constraints. Our work is the first to reveal that the index tuning problem has a well structured space of solutions, and this space can be explored efficiently with well known techniques from linear optimization. Experimental results demonstrate that our approach outperforms state-of-the-art commercial and research techniques by a significant margin (up to an order of magnitude).

References

  1. S. Agrawal, S. Chaudhuri, L. Kollár, A. P. Marathe, V. R. Narasayya, and M. Syamala. Database tuning advisor for microsoft sql server 2005. In VLDB, pages 930--932, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge University Press, March 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. N. Bruno and S. Chaudhuri. Automatic physical database tuning: a relaxation-based approach. In SIGMOD, pages 227--238, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. N. Bruno and S. Chaudhuri. Constrained physical design tuning. PVLDB, 1(1):4--15, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. N. Bruno and R. V. Nehme. Configuration-parametric query optimization for physical design tuning. In SIGMOD, pages 941--952, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Caprara and J. Salazar. A branch-and-cut algorithm for a generalization of the uncapacitated facility location problem. TOP, 1(4):135--163, 1996.Google ScholarGoogle ScholarCross RefCross Ref
  7. S. Chaudhuri, M. Datar, and V. Narasayya. Index selection for databases: A hardness study and a principled heuristic solution. IEEE TKDE, 16(11):1313--1323, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Chaudhuri and V. Narayasa. Program for tpc-d data generation with skew. ftp://ftp.research.microsoft.com/users/viveknar/tpcdskew.Google ScholarGoogle Scholar
  9. C. Daskalakis, I. Diakonikolas, and M. Yannakakis. How good is the chord algorithm? In SODA, pages 978--991, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. J. Finkelstein, M. Schkolnick, and P. Tiberio. Physical database design for relational databases. ACM TODS, 13(1):91--128, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. L. Fisher. The lagrangian relaxation method for solving integer programming problems. Manage. Sci., pages 1861--1871, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. V. Y. Lum and H. Ling. An optimization problem on the selection of secondary keys. In Proceedings of the 26th annual conference, pages 349--356, 1971. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. S. Papadomanolakis. Large-scale data management for the sciences. PhD thesis, CMU, 2007.Google ScholarGoogle Scholar
  14. S. Papadomanolakis and A. Ailamaki. An integer linear programming approach to automated database design. In SMDB, pages 442--449, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. Papadomanolakis, D. Dash, and A. Ailamaki. Efficient use of the query optimizer for automated physical design. In VLDB, pages 1093--1104, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. N. Reddy and J. R. Haritsa. Analyzing plan diagrams of database query optimizers. In VLDB, pages 1228--1240, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. K. Schnaitter and N. Polyzotis. A Benchmark for Online Index Selection. In SMDB, pages 1701--1708, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. P. Selinger, M. Astrahan, D. Chamberlin, R. Lorie, and T. Price. Access path selection in a relational database management system. In SIGMOD, pages 23--34, 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Z. A. Talebi, R. Chirkova, Y. Fathi, and M. Stallmann. Exact and inexact methods for selecting views and indexes for olap performance improvement. In EDBT, pages 311--322, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. D. C. Zilio, J. Rao, S. Lightstone, G. M. Lohman, A. Storm, C. Garcia-Arellano, and S. Fadden. Db2 design advisor: Integrated automatic physical database design. In VLDB, pages 1087--1097, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. CoPhy: a scalable, portable, and interactive index advisor for large workloads

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image Proceedings of the VLDB Endowment
            Proceedings of the VLDB Endowment  Volume 4, Issue 6
            March 2011
            71 pages

            Publisher

            VLDB Endowment

            Publication History

            • Published: 1 March 2011
            Published in pvldb Volume 4, Issue 6

            Qualifiers

            • research-article

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader