skip to main content
10.1145/3366423.3380084acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Solving Billion-Scale Knapsack Problems

Published:20 April 2020Publication History

ABSTRACT

Knapsack problems (KPs) are common in industry, but solving KPs is known to be NP-hard and has been tractable only at a relatively small scale. This paper examines KPs in a slightly generalized form and shows that they can be solved nearly optimally at scale via distributed algorithms. The proposed approach can be implemented fairly easily with off-the-shelf distributed computing frameworks (e.g. MPI, Hadoop, Spark). As an example, our implementation leads to one of the most efficient KP solvers known to date – capable to solve KPs at an unprecedented scale (e.g., KPs with 1 billion decision variables and 1 billion constraints can be solved within 1 hour). The system has been deployed to production and called on a daily basis, yielding significant business impacts at Ant Financial.

References

  1. Deepak Agarwal, Bee-Chung Chen, Pradheep Elango, and Xuanhui Wang. 2012. Personalized Click Shaping through Lagrangian Duality for Online Recommendation. In Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, August 12 - 16, 2012. Portland, OR, USA, 485 – 494.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Stephen Boyd and Lieven Vandenberghe. 2004. Convex Optimization. Cambridge University Press, Cambridge, UK.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. P. C. Chu and John E. Beasley. 1998. A Genetic Algorithm for the Multidimensional Knapsack Problem. Journal of Heuristics 4(1998), 63–86.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein. 2009. Introduction to Algorithms(3rd ed.). The MIT Press, Cambridge, Massachusetts, USA.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. CPlex. [n.d.]. www.cplex.com.Google ScholarGoogle Scholar
  6. George B. Dantzig. 1957. Discrete-Variable Extremum Problems. Operations Research 5, 2 (April 1957), 266–277.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Jeffrey Dean and Sanjay Ghemawat. 2008. MapReduce: Simplified Data Processing on Large Clusters. Commun. ACM 51(2008), 107–113.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Fred Glover. 1989. Tabu Search - Part I. ORSA Journal on Computing 1 (1989), 135–206.Google ScholarGoogle ScholarCross RefCross Ref
  9. Rupesh Gupta, Guanfeng Liang, and Rómer Rosales. 2017. Optimizing Email Volume For Sitewide Engagement. In Proceedings of the 26th ACM CIKM International Conference on Information and Knowledge Management, November 6 - 10, 2017. Singapore, 1947 – 1955.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Rupesh Gupta, Guanfeng Liang, Hsiao-Ping Tseng, Ravi Kiran Holur Vijay, Xiaoyu Chen, and Rómer Rosales. 2016. Email Volume Optimization at LinkedIn. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 13 - 17, 2016. San Francisco, CA, USA, 97 – 106.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Gurobi. [n.d.]. www.gurobi.com.Google ScholarGoogle Scholar
  12. Skander Htiouech and Sadok Bouamama. 2013. OSC: Solving the Multidimensional Multi-choice Knapsack Problem with Tight Strategic Oscillation using Surrogate Constraints. International Journal of Computer Applications 73 (2013), 1–22.Google ScholarGoogle ScholarCross RefCross Ref
  13. Hans Kellerer, Ulrich Pferschy, and David Pisinger. 2004. Knapsack Problems. Springer, Berlin.Google ScholarGoogle Scholar
  14. Shahadatullah Khan. 1998. Quality Adaptation in a Multisession Multimedia System: Model, Algorithms and Architecture. Ph.D. Dissertation. Department of Electrical and Computer Engineering, University of Victoria, Canada.Google ScholarGoogle Scholar
  15. S. Kirkpatrick, C. D. Gelatt, and M. P. Vecchi. 1983. Optimization by Simulated Annealing. Science 220(1983), 671–680.Google ScholarGoogle ScholarCross RefCross Ref
  16. Soukaina Laabadi, Mohamed Naimi, Hassan E. Amri, and Boujemâa Achchab. 2018. The 0/1 Multidimensional Knapsack Problem and Its Variants: A Survey of Practical Models and Heuristic Approaches. American Journal of Operations Research 8 (2018), 395–439.Google ScholarGoogle ScholarCross RefCross Ref
  17. Silvano Martello and Paolo Toth. 1990. Knapsack Problems: Algorithms and Computer Implementations. John Wiley & Sons, Chichester.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Laurent Perron and Vincent Furnon. [n.d.]. Google OR-Tools, https://developers.google.com/optimization.Google ScholarGoogle Scholar
  19. Jeremy F. Shapiro. 1979. A survey of Lagrangean techniques for discrete optimization. Annals of Discrete Mathematics 5 (1979), 113–138.Google ScholarGoogle ScholarCross RefCross Ref
  20. Stephen J. Wright. 2015. Coordinate Descent Algorithms. Mathematical Programming, Series B 151 (2015), 3–34.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Bo Zhao, Koichiro Narita, Burkay Orten, and John Egan. 2018. Notification Volume Control and Optimization System at Pinterest. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 19 - 23, 2018. London, United Kingdom, 1012 – 1020.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Solving Billion-Scale Knapsack Problems
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            WWW '20: Proceedings of The Web Conference 2020
            April 2020
            3143 pages
            ISBN:9781450370233
            DOI:10.1145/3366423

            Copyright © 2020 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 20 April 2020

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited

            Acceptance Rates

            Overall Acceptance Rate1,899of8,196submissions,23%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format