skip to main content
10.1145/1835804.1835928acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Combined regression and ranking

Published:25 July 2010Publication History

ABSTRACT

Many real-world data mining tasks require the achievement of two distinct goals when applied to unseen data: first, to induce an accurate preference ranking, and second to give good regression performance. In this paper, we give an efficient and effective Combined Regression and Ranking method (CRR) that optimizes regression and ranking objectives simultaneously. We demonstrate the effectiveness of CRR for both families of metrics on a range of large-scale tasks, including click prediction for online advertisements. Results show that CRR often achieves performance equivalent to the best of both ranking-only and regression-only approaches. In the case of rare events or skewed distributions, we also find that this combination can actually improve regression performance due to the addition of informative ranking constraints.

Skip Supplemental Material Section

Supplemental Material

kdd2010_sculley_crr_01.mov

mov

98.6 MB

References

  1. G. Aggarwal, A. Goel, and R. Motwani. Truthful auctions for pricing search keywords. In EC '06: Proceedings of the 7th ACM conference on Electronic commerce, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. C. M. Bishop. Pattern Recognition and Machine Learning. Springer-Verlag New York, Inc., 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. L. Bottou and O. Bousquet. The tradeoffs of large scale learning. In J. Platt, D. Koller, Y. Singer, and S. Roweis, editors, Advances in Neural Information Processing Systems, volume 20, pages 161--168. NIPS Foundation (http://books.nips.cc), 2008.Google ScholarGoogle Scholar
  4. A. P. Bradley. The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recognition, 30, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. Learning to rank using gradient descent. In ICML '05: Proceedings of the 22nd international conference on Machine learning, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Z. Cao, T. Qin, T.-Y. Liu, M.-F. Tsai, and H. Li. Learning to rank: from pairwise approach to listwise approach. In ICML '07: Proceedings of the 24th international conference on Machine learning, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. Chakrabarti, D. Agarwal, and V. Josifovski. Contextual advertising by combining relevance with click feedback. In WWW '08: Proceeding of the 17th international conference on World Wide Web, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Ciaramita, V. Murdock, and V. Plachouras. Online learning from click data for sponsored search. In WWW '08: Proceeding of the 17th international conference on World Wide Web, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Duchi, S. Shalev-Shwartz, Y. Singer, and T. Chandra. Efficient projections onto the l1-ball for learning in high dimensions. In ICML '08: Proceedings of the 25th international conference on Mach ine learning, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. L. Elsas, V. R. Carvalho, and J. G. Carbonell. Fast learning of document ranking functions with the committee perceptron. In WSDM '08: Proceedings of the international conference on Web search and web data mining, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y. Freund, R. Iyer, R. E. Schapire, and Y. Singer. An efficient boosting algorithm for combining preferences. J. Mach. Learn. Res., 4, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. Joachims. Optimizing search engines using clickthrough data. In KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. T. Joachims. A support vector method for multivariate performance measures. In ICML '05: Proceedings of the 22nd international conference on Machine learning, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. T. Joachims. Training linear svms in linear time. In KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. K. Koh, S.-J. Kim, and S. Boyd. An interior-point method for large-scale l1-regularized logistic regression. J. Mach. Learn. Res., 8, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Langford, L. Li, and T. Zhang. Sparse online learning via truncated gradient. J. Mach. Learn. Res., 10, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. D. D. Lewis, Y. Yang, T. G. Rose, and F. Li. RCV1: A new benchmark collection for text categorization research. J. Mach. Learn. Res., 5:361--397, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. T.-Y. Liu, T. Qin, J. Xu, W. Xiong, and H. Li. LETOR: Benchmark dataset for research on learning to rank for information retrieval. In LR4IR 2007: Workshop on Learning to Rank for Information Retrieval, in conjunction with SIGIR 2007, 2007.Google ScholarGoogle Scholar
  19. M.-F. M.F. Balcan and A. Blum. On a theory of learning with similarity functions. In ICML '06: Proceedings of the 23rd international conference on Machine learning, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. T. M. Mitchell. Generative and discriminative classifiers: Naive bayes and logistic regression. In Machine Learning. http://www.cs.cmu.edu/~tom/mlbook/NBayesLogReg.pdf, 2005.Google ScholarGoogle Scholar
  21. D. Sculley. Large scale learning to rank. In NIPS 2009 Workshop on Advances in Ranking, 2009.Google ScholarGoogle Scholar
  22. D. Sculley, R. G. Malkin, S. Basu, and R. J. Bayardo. Predicting bounce rates in sponsored search advertisements. In KDD '09: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. S. Shalev-Shwartz, Y. Singer, and N. Srebro. Pegasos: Primal estimated sub-gradient solver for svm. In ICML '07: Proceedings of the 24th international conference on Machine learning, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. J. Xu and H. Li. Adarank: a boosting algorithm for information retrieval. In SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Y. Yue, T. Finley, F. Radlinski, and T. Joachims. A support vector method for optimizing average precision. In SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Combined regression and ranking

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
      July 2010
      1240 pages
      ISBN:9781450300551
      DOI:10.1145/1835804

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 25 July 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader