Skip to main content
Top
Published in: Soft Computing 10/2018

03-01-2018 | Focus

An evolutionary strategy with machine learning for learning to rank in information retrieval

Authors: Osman Ali Sadek Ibrahim, D. Landa-Silva

Published in: Soft Computing | Issue 10/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Learning to rank (LTR) is one of the problems attracting researchers in information retrieval (IR). The LTR problem refers to ranking the retrieved documents for users in search engines, question answering and product recommendation systems. There is a number of LTR approaches based on machine learning and computational intelligence techniques. Most existing LTR methods have limitations, such as being too slow or not being very effective or requiring a huge computer memory to operate. This paper proposes a LTR method that combines a \((1+1)\)-evolutionary strategy with machine learning. Three variants of the method are investigated: ES-Rank, IESR-Rank and IESVM-Rank. They differ on the chromosome initialisation mechanism for the evolutionary process. ES-Rank simply sets all genes in the initial chromosome to the same value. IESR-Rank uses linear regression, and IESVM-Rank uses support vector machine for the initialisation process. Experimental results from comparing the proposed method to fourteen other approaches from the literature show that IESR-Rank achieves the overall highest performance. Ten problem instances are used here, obtained from four datasets: MSLR-WEB10K, LETOR 3 and LETOR 4. Performance is measured at the top-10 query–document pairs retrieved, using five metrics: mean average precision (MAP), root-mean-square error (RMSE), precision (P@10), reciprocal rank (RR@10) and normalized discounted cumulative gain (NDCG@10). The contribution of this paper is proposing an effective and efficient LTR method combining a list-wise evolutionary technique with point-wise and pair-wise machine learning techniques.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Baeza-Yates RA, Ribeiro-Neto BA (2011) Modern information retrieval–the concepts and technology behind search, 2nd edn. Pearson Education Ltd., Harlow Baeza-Yates RA, Ribeiro-Neto BA (2011) Modern information retrieval–the concepts and technology behind search, 2nd edn. Pearson Education Ltd., Harlow
go back to reference Burges C, Shaked T, Renshaw E, Lazier A, Deeds M, Hamilton N, Hullender G (2005) Learning to rank using gradient descent. In: Proceedings of the 22nd international conference on machine learning, ICML ’05, pp 89–96, New York, NY. ACM. ISBN:1-59593-180-5. https://doi.org/10.1145/1102351.1102363 Burges C, Shaked T, Renshaw E, Lazier A, Deeds M, Hamilton N, Hullender G (2005) Learning to rank using gradient descent. In: Proceedings of the 22nd international conference on machine learning, ICML ’05, pp 89–96, New York, NY. ACM. ISBN:1-59593-180-5. https://​doi.​org/​10.​1145/​1102351.​1102363
go back to reference Cao Z, Qin T, Liu T-Y, Tsai M-F, Li H (2007) Learning to rank: from pairwise approach to listwise approach. In: Proceedings of the 24th international conference on machine learning, ICML ’07, pp 129–136, New York, NY. ACM. ISBN:978-1-59593-793-3. https://doi.org/10.1145/1273496.1273513 Cao Z, Qin T, Liu T-Y, Tsai M-F, Li H (2007) Learning to rank: from pairwise approach to listwise approach. In: Proceedings of the 24th international conference on machine learning, ICML ’07, pp 129–136, New York, NY. ACM. ISBN:978-1-59593-793-3. https://​doi.​org/​10.​1145/​1273496.​1273513
go back to reference Diaz-Gomez PA, Hougen DF (2007) Initial population for genetic algorithms: a metric approach. In: Proceedings of the 2007 international conference on genetic and evolutionary methods GEM, pp 43–49 Diaz-Gomez PA, Hougen DF (2007) Initial population for genetic algorithms: a metric approach. In: Proceedings of the 2007 international conference on genetic and evolutionary methods GEM, pp 43–49
go back to reference Islam MA (2013) Rankgpes: learning to rank for information retrieval using a hybrid genetic programming with evolutionary strategies. Master’s thesis, Computer Science, University of Windsor, Toronto, Canada Islam MA (2013) Rankgpes: learning to rank for information retrieval using a hybrid genetic programming with evolutionary strategies. Master’s thesis, Computer Science, University of Windsor, Toronto, Canada
go back to reference Li H (2014) Learning to rank for information retrieval and natural language processing, 2nd edn. Morgan & Claypool Publishers, San Rafael ISBN:9781627055857 Li H (2014) Learning to rank for information retrieval and natural language processing, 2nd edn. Morgan & Claypool Publishers, San Rafael ISBN:9781627055857
go back to reference Lin J-Y, Ke H-R, Chien B-C, Yang W-P (2007) Designing a classifier by a layered multi-population genetic programming approach. Pattern Recognit 40(8):2211–2225CrossRefMATH Lin J-Y, Ke H-R, Chien B-C, Yang W-P (2007) Designing a classifier by a layered multi-population genetic programming approach. Pattern Recognit 40(8):2211–2225CrossRefMATH
go back to reference Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval. Cambridge University Press, New York ISBN 0521865719, 9780521865715CrossRefMATH Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval. Cambridge University Press, New York ISBN 0521865719, 9780521865715CrossRefMATH
go back to reference Mohan A, Chen Z, Weinberger KQ (2011) Web-search ranking with initialized gradient boosted regression trees. J Mach Learn Res Workshop Conf Proc 14:77–89 Mohan A, Chen Z, Weinberger KQ (2011) Web-search ranking with initialized gradient boosted regression trees. J Mach Learn Res Workshop Conf Proc 14:77–89
go back to reference Xu J, Li H (2007) Adarank: a boosting algorithm for information retrieval. In: Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR ’07, pp 391–398, New York, NY. ACM. ISBN:978-1-59593-597-7. https://doi.org/10.1145/1277741.1277809 Xu J, Li H (2007) Adarank: a boosting algorithm for information retrieval. In: Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR ’07, pp 391–398, New York, NY. ACM. ISBN:978-1-59593-597-7. https://​doi.​org/​10.​1145/​1277741.​1277809
go back to reference Yan X, Su XG (2009) Linear regression analysis: theory and computing. World Scientific Publishing Co Inc., River Edge (ISBN:9789812834102, 9812834109)CrossRefMATH Yan X, Su XG (2009) Linear regression analysis: theory and computing. World Scientific Publishing Co Inc., River Edge (ISBN:9789812834102, 9812834109)CrossRefMATH
Metadata
Title
An evolutionary strategy with machine learning for learning to rank in information retrieval
Authors
Osman Ali Sadek Ibrahim
D. Landa-Silva
Publication date
03-01-2018
Publisher
Springer Berlin Heidelberg
Published in
Soft Computing / Issue 10/2018
Print ISSN: 1432-7643
Electronic ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-017-2988-6

Other articles of this Issue 10/2018

Soft Computing 10/2018 Go to the issue

Premium Partner