skip to main content
10.1145/1142351.1142380acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
Article

Asking the right questions: model-driven optimization using probes

Published:26 June 2006Publication History

ABSTRACT

In several database applications, parameters like selectivities and load are known only with some associated uncertainty, which is specified, or modeled, as a distribution over values. The performance of query optimizers and monitoring schemes can be improved by spending resources like time or bandwidth in observing or resolving these parameters, so that better query plans can be generated. In a resource-constrained situation, deciding which parameters to observe in order to best optimize the expected quality of the plan generated (or in general, optimize the expected value of a certain objective function) itself becomes an interesting optimization problem.We present a framework for studying such problems, and present several scenarios arising in anomaly detection in complex systems, monitoring extreme values in sensor networks, load shedding in data stream systems, and estimating rates in wireless channels and minimum latency routes in networks, which can be modeled in this framework with the appropriate objective functions.Even for several simple objective functions, we show the problems are Np-Hard. We present greedy algorithms with good performance bounds. The proof of the performance bounds are via novel sub-modularity arguments.

References

  1. A. Akella, B. M. Maggs, S. Seshan, A. Shaikh, and R. K. Sitaraman. A measurement-based analysis of multihoming. In ACM SIGCOMM Conference, pages 353--364, 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. Avnur and J. M. Hellerstein. Eddies: Continuously adaptive query processing. In SIGMOD '00: Proceedings of the 2000 ACM SIGMOD international conference on Management of data, pages 261--272, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. B. Babcock and S. Chaudhuri. Towards a robust query optimizer: A principled and practical approach. In SIGMOD '05: Proceedings of the 2005 ACM SIGMOD international conference on Management of data, pages 119--130, 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. B. Babcock and C. Olston. Distributed top-k monitoring. In SIGMOD '03: Proceedings of the 2003 ACM SIGMOD international conference on Management of data, pages 28--39, 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Babu and P. Bizarro. Proactive reoptimization. In Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. R. Carr, L. Fleischer, V. Leung, and C. Phillips. Strengthening integrality gaps for capacitated network design and covering problems. In Proc. of the Annual ACM-SIAM Symp. on Discrete Algorithms, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Charikar, R. Fagin, J. Kleinberg, P. Raghavan, and A. Sahai. Querying priced information. Proc. of the Annual ACM Symp. on Theory of Computing, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. F. Chu, J. Halpern, and J. Gehrke. Least expected cost query optimization: What can we expect? Proc. of the ACM Symp. on Principles of Database Systems, 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. F. Chu, J. Halpern, and P. Seshadri. Least expected cost query optimization: An exercise in utility. In Proc. of the ACM Symp. on Principles of Database Systems, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. B. Dean, M. Goemans, and J. Vondrák. Approximating the stochastic knapsack problem: The benefit of adaptivity. Proc. of the Annual Symp. on Foundations of Computer Science, 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Deshpande, C. Guestrin, S. Madden, J. M. Hellerstein, and W. Hong. Model-driven data acquisition in sensor networks. In Proc. of the 2004 Intl. Conf. on Very Large Data Bases, 2004.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. Feder, R. Motwani, R. Panigrahy, C. Olston, and J. Widom. Computing the median with uncertainty. SIAM J. Comput., 32(2), 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Goel and P. Indyk. Stochastic load balancing and related problems. Proc. of the Annual Symp. on Foundations of Computer Science, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. P. K. Gummadi, H. V. Madhyastha, S. D. Gribble, H. M. Levy, and D. Wetherall. Improving the reliability of internet paths with one-hop source routing. In 6th Symposium on Operating System Design and Implementation (OSDI), pages 183--198, 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Gupta, M. Pál, R. Ravi, and A. Sinha. Boosted sampling: Approximation algorithms for stochastic optimization. Proc. of the Annual ACM Symp. on Theory of Computing, 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. Gupta, R. Ravi, and A. Sinha. An edge in time saves nine: LP rounding approximation algorithms for stochastic network design. In Proc. of the Annual Symp. on Foundations of Computer Science, pages 218--227, 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. N. Immorlica, D. Karger, M. Minkoff, and V. Mirrokni. On the costs and benefits of procrastination: Approximation algorithms for stochastic combinatorial optimization problems. In Proc. of the Annual ACM-SIAM Symp. on Discrete Algorithms, 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Khanna and W-C. Tan. On computing functions with uncertainty. In Proc. of the ACM Symp. on Principles of Database Systems, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Kleinberg, Y. Rabani, and É. Tardos. Allocating bandwidth for bursty connections. SIAM J. Comput, 30(1), 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. S. Kolliopoulos and N. Young. Tight approximation results for general covering integer programs. Proc. of the Annual Symp. on Foundations of Computer Science, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. A. Krause and C. Guestrin. Near-optimal nonmyopic value of information in graphical models. Twenty-first Conference on Uncertainty in Artificial Intelligence (UAI 2005), 2005.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. M. L. Massie, B. N. Chun, and D. E. Culler. The Ganglia distributed monitoring system: Design, implementation, and experience. Parallel Computing, 30(7), 2004.]]Google ScholarGoogle Scholar
  23. G. L. Nemhauser, L. A. Wolsey, and M. L. Fisher. An analysis of approximations for maximizing submodular set functions-I. Math Programming, 14(1):265--294, 1978.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. C. Olston. Approximate Replication. PhD thesis, Stanford University, 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. D. Shmoys and C. Swamy. Stochastic optimization is (almost) as easy as discrete optimization. Proc. of the Annual Symp. on Foundations of Computer Science, 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. A. Silberstein, R. Braynard, C. Ellis, K. Munagala, and J. Yang. A sampling based approach to optimizing top-k queries in sensor networks. In Proc. of the Intl. Conf. on Data Engineering, 2006.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. N. Tatbul, U. Etintemel, S. Zdonik, M. Chemiack, and M. Stonebraker. Load shedding in a data stream manager. In Proc. of the Intl. Conf. on Very Large Data Bases, 2003.]]Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Asking the right questions: model-driven optimization using probes

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      PODS '06: Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
      June 2006
      382 pages
      ISBN:1595933182
      DOI:10.1145/1142351

      Copyright © 2006 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 June 2006

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      PODS '06 Paper Acceptance Rate35of185submissions,19%Overall Acceptance Rate642of2,707submissions,24%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader