Top

Published in:

2015 | OriginalPaper | Chapter

A Bayesian Approach to Sparse Learning-to-Rank for Search Engine Optimization

Authors : Olga Krasotkina, Vadim Mottl

Published in: Machine Learning and Data Mining in Pattern Recognition

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Search engine optimization (SEO) is the process of affecting the visibility of a web page in the engine’s search results. SEO specialists must understand how search engines work and which features of the web-page affect its position in the search results. This paper employs machine learning ranking algorithms to constructing the rank model of a web-search engine. Ranking a set of retrieved documents according to their relevance to a given query has become a popular problem at the intersection of web search, machine learning and information retrieval. Feature selection in learning to rank has recently emerged as a crucial issue. Recent work on ranking, focused on a number of different paradigms, namely, point-wise, pair-wise, and list-wise approaches, for which several preprocessing feature section methods have been proposed. Unfortunately, only a few works have been focused on integrating the feature selection into the learning process and all of these embedded methods are based on \( l_{1} \) regularization technique. Such type of regularization does not possess many properties, essential for SEO, such as unbiasedness, grouping effect and oracle property. In this paper we suggest a new Bayesian framework for feature selection in learning-to-rank problem. The proposed approach gives the strong probabilistic statement of shrinkage criterion for features selection. The proposed regularization is unbiased, has grouping and oracle properties, its maximal risk diverges to finite value. Experimental results show that the proposed framework is competitive on both artificial data and publicly available LETOR data sets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

previous chapter Learning the Relationship Between Corporate Governance and Company Performance Using Data Mining

next chapter Data Driven Geometry for Learning

Liu, T.Y.: Learning to Rank for Information Retrieval. Now Publishers, Breda (2009)

Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., Hulldender, G.: Learning to rank using gradient descent. In: Proceedings of International Conference on Machine Learning (2005)

Joachims, T.: Optimizing search engines using clickthrough data. In: Proceedings of the ACM Conference on Knowledge Discovery and Data Mining (KDD). ACM (2002)

Zheng, Z., Zha, H., Chen, K., Sun, G.: A regression framework for learning ranking functions using relative relevance judgments. In: Proceedings of Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2007)

Cao, Z., Qin, T., Liu, T.-Y., Tsai, M.-F., Li, H.: Learning to rank: from pairwise approach to listwise approach. In: ICML 2007: Proceedings of the 24th International Conference on Machine Learning, pp. 129–136. ACM, New York (2007)

Weimer, M., Karatzoglou, A., Le, Q., Smola, A.: Cofi rank - maximum margin matrix factorization for collaborative ranking. In: Platt, J., Koller, D., Singer, Y., Roweis, S. (eds.) Advances in Neural Information Processing Systems 20. MIT Press, Cambridge (2008)

Taylor, M., Guiver, J., Robertson, S., Minka, T.: SoftRank: optimising non-smooth rank metrics. In: Proceedings of International ACM Conference on Web Search and Data Mining (2008)

Xia, F., Liu, T.Y., Wang, J., Zhang, W., Li, H.: Listwise approach to learning to rank - theory and algorithm. In: International Conference on Machine Learning (ICML) (2008)

Hua, G., Zhang, M., Liu, Y., Ma, S., Ru, L.: Hierarchical feature selection for ranking. In: Proceedings of 19th International Conference on World Wide Web, pp. 1113–1114 (2010)

10.

Yu, H., Oh, J., Han, W.-S.: Efficient feature weighting methods for ranking. In: Proceedings of 18th ACM Conference on Information and Knowledge Management, pp. 1157–1166 (2009)

11.

Pan, F., Converse, T., Ahn, D., Salvetti, F., Donato, G.: Feature selection for ranking using boosted trees. In: Proceedings of 18th ACM Conference on Information and Knowledge Management, pp. 2025–2028 (2009)

12.

Dang, V., Croft, B.: Feature selection for document ranking using best first search and coordinate ascent. In: SIGIR Workshop on Feature Generation and Selection for Information Retrieval (2010)

13.

Pahikkala, T., Airola, A., Naula, P., Salakoski, T.: Greedy RankRLS: a linear time algorithm for learning sparse ranking models. In: SIGIR 2010 Workshop on Feature Generation and Selection for Information Retrieval, pp. 11–18 (2010)

14.

Sun, Z., Qin, T., Tao, Q., Wang, J.: Robust sparse rank learning for non-smooth ranking measures. In: Proceedings of 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 259–266 (2009)

15.

Lai, H., Pan, Y., Liu, C., Lin, L., Wu, J.: Sparse learning-to-rank via an efficient primal-dual algorithm. IEEE Trans. Comput. 99(PrePrints), 1221–1233 (2012)MathSciNet

16.

Lai, H.-J., Pan, Y., Tang, Y., Yu, R.: Fsmrank: feature selection algorithm for learning to rank. IEEE Trans. Neural Netw. Learn. Syst. 24(6), 940–952 (2013)CrossRef

17.

Zou, H.: The adaptive lasso and its oracle properties. J. Amer. Stat. Assoc. 101(476), 1418–1429 (2006)MATHCrossRef

18.

Mothe, J.: Non-convex regularizations for feature selection in ranking with sparse SVM. X(X): 1

19.

Zhang, C.-H.: Nearly unbiaised variable selection under minimax con-cave penalty. Ann. Stat. 38(2), 894–942 (2010)MATHCrossRef

20.

Leeb, H., Pötscher, B.M.: Sparse estimators and the oracle property, or the return of Hodges’ estimator. J. Econometrics 142(1), 201–211 (2008)MathSciNetCrossRef

21.

Platt, J.: Fast training of support vector machines using sequential minimal optimization. In: Scholkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning. MIT Press, Cambridge (1998)

22.

Chapelle, O., Keerthi, S.S.: Efficient algorithms for ranking with SVMs. Inf. Retrieval J. 13(3), 201–215 (2010)CrossRef

23.

Moon, T., Smola, A.J., Chang, Y., Zheng, Z.: IntervalRank: isotonic regression with listwise and pairwise constraints. In: WSDM 2010, pp. 151–160

24.

http://research.microsoft.com/en-us/um/beijing/projects/letor/

Title: A Bayesian Approach to Sparse Learning-to-Rank for Search Engine Optimization
Authors: Olga Krasotkina
Vadim Mottl
Publisher: Springer International Publishing
Book: Machine Learning and Data Mining in Pattern Recognition
Print ISBN: 978-3-319-21023-0

Electronic ISBN: 978-3-319-21024-7

Copyright Year: 2015
DOI: https://doi.org/10.1007/978-3-319-21024-7_26

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner