Skip to main content
Top

2015 | OriginalPaper | Chapter

A Bayesian Approach to Sparse Learning-to-Rank for Search Engine Optimization

Authors : Olga Krasotkina, Vadim Mottl

Published in: Machine Learning and Data Mining in Pattern Recognition

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Search engine optimization (SEO) is the process of affecting the visibility of a web page in the engine’s search results. SEO specialists must understand how search engines work and which features of the web-page affect its position in the search results. This paper employs machine learning ranking algorithms to constructing the rank model of a web-search engine. Ranking a set of retrieved documents according to their relevance to a given query has become a popular problem at the intersection of web search, machine learning and information retrieval. Feature selection in learning to rank has recently emerged as a crucial issue. Recent work on ranking, focused on a number of different paradigms, namely, point-wise, pair-wise, and list-wise approaches, for which several preprocessing feature section methods have been proposed. Unfortunately, only a few works have been focused on integrating the feature selection into the learning process and all of these embedded methods are based on \( l_{1} \) regularization technique. Such type of regularization does not possess many properties, essential for SEO, such as unbiasedness, grouping effect and oracle property. In this paper we suggest a new Bayesian framework for feature selection in learning-to-rank problem. The proposed approach gives the strong probabilistic statement of shrinkage criterion for features selection. The proposed regularization is unbiased, has grouping and oracle properties, its maximal risk diverges to finite value. Experimental results show that the proposed framework is competitive on both artificial data and publicly available LETOR data sets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Liu, T.Y.: Learning to Rank for Information Retrieval. Now Publishers, Breda (2009) Liu, T.Y.: Learning to Rank for Information Retrieval. Now Publishers, Breda (2009)
2.
go back to reference Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., Hulldender, G.: Learning to rank using gradient descent. In: Proceedings of International Conference on Machine Learning (2005) Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., Hulldender, G.: Learning to rank using gradient descent. In: Proceedings of International Conference on Machine Learning (2005)
3.
go back to reference Joachims, T.: Optimizing search engines using clickthrough data. In: Proceedings of the ACM Conference on Knowledge Discovery and Data Mining (KDD). ACM (2002) Joachims, T.: Optimizing search engines using clickthrough data. In: Proceedings of the ACM Conference on Knowledge Discovery and Data Mining (KDD). ACM (2002)
4.
go back to reference Zheng, Z., Zha, H., Chen, K., Sun, G.: A regression framework for learning ranking functions using relative relevance judgments. In: Proceedings of Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2007) Zheng, Z., Zha, H., Chen, K., Sun, G.: A regression framework for learning ranking functions using relative relevance judgments. In: Proceedings of Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2007)
5.
go back to reference Cao, Z., Qin, T., Liu, T.-Y., Tsai, M.-F., Li, H.: Learning to rank: from pairwise approach to listwise approach. In: ICML 2007: Proceedings of the 24th International Conference on Machine Learning, pp. 129–136. ACM, New York (2007) Cao, Z., Qin, T., Liu, T.-Y., Tsai, M.-F., Li, H.: Learning to rank: from pairwise approach to listwise approach. In: ICML 2007: Proceedings of the 24th International Conference on Machine Learning, pp. 129–136. ACM, New York (2007)
6.
go back to reference Weimer, M., Karatzoglou, A., Le, Q., Smola, A.: Cofi rank - maximum margin matrix factorization for collaborative ranking. In: Platt, J., Koller, D., Singer, Y., Roweis, S. (eds.) Advances in Neural Information Processing Systems 20. MIT Press, Cambridge (2008) Weimer, M., Karatzoglou, A., Le, Q., Smola, A.: Cofi rank - maximum margin matrix factorization for collaborative ranking. In: Platt, J., Koller, D., Singer, Y., Roweis, S. (eds.) Advances in Neural Information Processing Systems 20. MIT Press, Cambridge (2008)
7.
go back to reference Taylor, M., Guiver, J., Robertson, S., Minka, T.: SoftRank: optimising non-smooth rank metrics. In: Proceedings of International ACM Conference on Web Search and Data Mining (2008) Taylor, M., Guiver, J., Robertson, S., Minka, T.: SoftRank: optimising non-smooth rank metrics. In: Proceedings of International ACM Conference on Web Search and Data Mining (2008)
8.
go back to reference Xia, F., Liu, T.Y., Wang, J., Zhang, W., Li, H.: Listwise approach to learning to rank - theory and algorithm. In: International Conference on Machine Learning (ICML) (2008) Xia, F., Liu, T.Y., Wang, J., Zhang, W., Li, H.: Listwise approach to learning to rank - theory and algorithm. In: International Conference on Machine Learning (ICML) (2008)
9.
go back to reference Hua, G., Zhang, M., Liu, Y., Ma, S., Ru, L.: Hierarchical feature selection for ranking. In: Proceedings of 19th International Conference on World Wide Web, pp. 1113–1114 (2010) Hua, G., Zhang, M., Liu, Y., Ma, S., Ru, L.: Hierarchical feature selection for ranking. In: Proceedings of 19th International Conference on World Wide Web, pp. 1113–1114 (2010)
10.
go back to reference Yu, H., Oh, J., Han, W.-S.: Efficient feature weighting methods for ranking. In: Proceedings of 18th ACM Conference on Information and Knowledge Management, pp. 1157–1166 (2009) Yu, H., Oh, J., Han, W.-S.: Efficient feature weighting methods for ranking. In: Proceedings of 18th ACM Conference on Information and Knowledge Management, pp. 1157–1166 (2009)
11.
go back to reference Pan, F., Converse, T., Ahn, D., Salvetti, F., Donato, G.: Feature selection for ranking using boosted trees. In: Proceedings of 18th ACM Conference on Information and Knowledge Management, pp. 2025–2028 (2009) Pan, F., Converse, T., Ahn, D., Salvetti, F., Donato, G.: Feature selection for ranking using boosted trees. In: Proceedings of 18th ACM Conference on Information and Knowledge Management, pp. 2025–2028 (2009)
12.
go back to reference Dang, V., Croft, B.: Feature selection for document ranking using best first search and coordinate ascent. In: SIGIR Workshop on Feature Generation and Selection for Information Retrieval (2010) Dang, V., Croft, B.: Feature selection for document ranking using best first search and coordinate ascent. In: SIGIR Workshop on Feature Generation and Selection for Information Retrieval (2010)
13.
go back to reference Pahikkala, T., Airola, A., Naula, P., Salakoski, T.: Greedy RankRLS: a linear time algorithm for learning sparse ranking models. In: SIGIR 2010 Workshop on Feature Generation and Selection for Information Retrieval, pp. 11–18 (2010) Pahikkala, T., Airola, A., Naula, P., Salakoski, T.: Greedy RankRLS: a linear time algorithm for learning sparse ranking models. In: SIGIR 2010 Workshop on Feature Generation and Selection for Information Retrieval, pp. 11–18 (2010)
14.
go back to reference Sun, Z., Qin, T., Tao, Q., Wang, J.: Robust sparse rank learning for non-smooth ranking measures. In: Proceedings of 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 259–266 (2009) Sun, Z., Qin, T., Tao, Q., Wang, J.: Robust sparse rank learning for non-smooth ranking measures. In: Proceedings of 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 259–266 (2009)
15.
go back to reference Lai, H., Pan, Y., Liu, C., Lin, L., Wu, J.: Sparse learning-to-rank via an efficient primal-dual algorithm. IEEE Trans. Comput. 99(PrePrints), 1221–1233 (2012)MathSciNet Lai, H., Pan, Y., Liu, C., Lin, L., Wu, J.: Sparse learning-to-rank via an efficient primal-dual algorithm. IEEE Trans. Comput. 99(PrePrints), 1221–1233 (2012)MathSciNet
16.
go back to reference Lai, H.-J., Pan, Y., Tang, Y., Yu, R.: Fsmrank: feature selection algorithm for learning to rank. IEEE Trans. Neural Netw. Learn. Syst. 24(6), 940–952 (2013)CrossRef Lai, H.-J., Pan, Y., Tang, Y., Yu, R.: Fsmrank: feature selection algorithm for learning to rank. IEEE Trans. Neural Netw. Learn. Syst. 24(6), 940–952 (2013)CrossRef
17.
go back to reference Zou, H.: The adaptive lasso and its oracle properties. J. Amer. Stat. Assoc. 101(476), 1418–1429 (2006)MATHCrossRef Zou, H.: The adaptive lasso and its oracle properties. J. Amer. Stat. Assoc. 101(476), 1418–1429 (2006)MATHCrossRef
18.
go back to reference Mothe, J.: Non-convex regularizations for feature selection in ranking with sparse SVM. X(X): 1 Mothe, J.: Non-convex regularizations for feature selection in ranking with sparse SVM. X(X): 1
19.
go back to reference Zhang, C.-H.: Nearly unbiaised variable selection under minimax con-cave penalty. Ann. Stat. 38(2), 894–942 (2010)MATHCrossRef Zhang, C.-H.: Nearly unbiaised variable selection under minimax con-cave penalty. Ann. Stat. 38(2), 894–942 (2010)MATHCrossRef
20.
go back to reference Leeb, H., Pötscher, B.M.: Sparse estimators and the oracle property, or the return of Hodges’ estimator. J. Econometrics 142(1), 201–211 (2008)MathSciNetCrossRef Leeb, H., Pötscher, B.M.: Sparse estimators and the oracle property, or the return of Hodges’ estimator. J. Econometrics 142(1), 201–211 (2008)MathSciNetCrossRef
21.
go back to reference Platt, J.: Fast training of support vector machines using sequential minimal optimization. In: Scholkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning. MIT Press, Cambridge (1998) Platt, J.: Fast training of support vector machines using sequential minimal optimization. In: Scholkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning. MIT Press, Cambridge (1998)
22.
go back to reference Chapelle, O., Keerthi, S.S.: Efficient algorithms for ranking with SVMs. Inf. Retrieval J. 13(3), 201–215 (2010)CrossRef Chapelle, O., Keerthi, S.S.: Efficient algorithms for ranking with SVMs. Inf. Retrieval J. 13(3), 201–215 (2010)CrossRef
23.
go back to reference Moon, T., Smola, A.J., Chang, Y., Zheng, Z.: IntervalRank: isotonic regression with listwise and pairwise constraints. In: WSDM 2010, pp. 151–160 Moon, T., Smola, A.J., Chang, Y., Zheng, Z.: IntervalRank: isotonic regression with listwise and pairwise constraints. In: WSDM 2010, pp. 151–160
Metadata
Title
A Bayesian Approach to Sparse Learning-to-Rank for Search Engine Optimization
Authors
Olga Krasotkina
Vadim Mottl
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-21024-7_26

Premium Partner