Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 6/2014

01.12.2014 | Original Article

A novel prior-based real-time click through rate prediction model

verfasst von: Yan Fang, Jinzhi Liu

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 6/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The application of traditional machine learning algorithms on click data has met great challenges working with severe sparse and transient ID features, which tend to bloat data size and prolong training time considerably. On the other hand, due to the data size requirement, training and publishing overhead, the bottleneck of minute-level incremental model updates has emerged. We propose a novel real-time click through rate (CTR) prediction model based on empirical CTRs with a set of pre-learned priors, upon which a Minimum Variance Unbiased Estimator is constructed as the CTR prediction. The dimensions of the empirical CTRs are in the sparsest and finest ID levels, which can be strong indicators but are generally unsuited as machine learning features. Experiments on real-life click data show that our prior-based real-time estimator, combined with traditional machine learning model, gains significant improvement in both prediction accuracy and ranking capability, especially with latest data beyond the time-effectiveness of the machine learning model.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Fußnoten
1
Depending on the application, the ad sponsoring engines usually wait from several minutes to an hour for users’ click.
 
2
As for Sponsor Search, search session IDs and queries can be considered counterparts of the former two.
 
3
A category of features; most of the paper is focused on three dimensions: cookie, URL, and ad-id.
 
4
We limit our attention to Google-like rank-by-revenue auctions, i.e., the ads are displayed in the decreasing order of CTR * bid.
 
5
Most commercial implementations of Machine Learning models (e.g. Google and Baidu’s Logistic Regression, Microsoft’s Bayesian Probit Regression) binarize all features. For example, PLSA score between query and ad is a common feature (class). After binarization, initial PLSA values of 0.1 and 0.099 might become two totally unrelated features.
 
Literatur
1.
Zurück zum Zitat Andrew G, Gao JF (2007) Scalable training of l 1-regularized log-linear models. In: Proceedings of the 24th international conference on Machine learning, ACM, pp 33-40 Andrew G, Gao JF (2007) Scalable training of l 1-regularized log-linear models. In: Proceedings of the 24th international conference on Machine learning, ACM, pp 33-40
2.
Zurück zum Zitat Crammer K, Fern MD, Pereira O, (2008) Exact convex confidence-weighted learning. In: Advances in Neural Information Processing Systems, p 22 Crammer K, Fern MD, Pereira O, (2008) Exact convex confidence-weighted learning. In: Advances in Neural Information Processing Systems, p 22
3.
Zurück zum Zitat Crammer K, Kulesza A, Dredze M, (2009) Adaptive regularization of weighted vectors. In: Advances in Neural Information Processing Systems, p 23 Crammer K, Kulesza A, Dredze M, (2009) Adaptive regularization of weighted vectors. In: Advances in Neural Information Processing Systems, p 23
4.
Zurück zum Zitat Dredze M, Crammer K, Pereira F (2008) Confidence-weighted linear classification. In: Proceedings of the 25th international conference on Machine learning (ACM), pp 264–271 Dredze M, Crammer K, Pereira F (2008) Confidence-weighted linear classification. In: Proceedings of the 25th international conference on Machine learning (ACM), pp 264–271
5.
Zurück zum Zitat Duchi J, Singer Y (2009) Efficient online and batch learning using forward backward splitting. J Mach Learn Res 10:2899–2934MATHMathSciNet Duchi J, Singer Y (2009) Efficient online and batch learning using forward backward splitting. J Mach Learn Res 10:2899–2934MATHMathSciNet
6.
Zurück zum Zitat Farahat A, Bailey MC (2012) How effective is targeted advertising? In: Proceedings of the 21st international conference on World Wide Web (ACM), pp 111–120 Farahat A, Bailey MC (2012) How effective is targeted advertising? In: Proceedings of the 21st international conference on World Wide Web (ACM), pp 111–120
7.
Zurück zum Zitat He H, Garcia EA (2009) Learning from imbalanced data. Knowl Data Eng IEEE Trans 21(9):1263–1284CrossRef He H, Garcia EA (2009) Learning from imbalanced data. Knowl Data Eng IEEE Trans 21(9):1263–1284CrossRef
8.
Zurück zum Zitat Heskes T (1998) Bias/variance decompositions for likelihood-based estimators. Neural Comput 10:1425–1433CrossRef Heskes T (1998) Bias/variance decompositions for likelihood-based estimators. Neural Comput 10:1425–1433CrossRef
9.
Zurück zum Zitat Liu Y (2011) The phoenix nest. In: Invited Talk, The 34th Annual ACM SIGIR Conference (ACM) Liu Y (2011) The phoenix nest. In: Invited Talk, The 34th Annual ACM SIGIR Conference (ACM)
10.
Zurück zum Zitat McMahan HB (2011) Follow-the-regularized-leader and mirror descent: Equivalence theorems and l 1-regularization. In: International Conference on Artifficial Intelligence and Statistics, pp 525–533 McMahan HB (2011) Follow-the-regularized-leader and mirror descent: Equivalence theorems and l 1-regularization. In: International Conference on Artifficial Intelligence and Statistics, pp 525–533
11.
Zurück zum Zitat McMahan HB, Holt G, Sculley D, Young M, Ebner D, Grady J, Nie L, Phillips T, Davydov E, Golovin D et al. (2013) Ad click prediction: a view from the trenches. In: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp 1222-1230 McMahan HB, Holt G, Sculley D, Young M, Ebner D, Grady J, Nie L, Phillips T, Davydov E, Golovin D et al. (2013) Ad click prediction: a view from the trenches. In: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp 1222-1230
12.
Zurück zum Zitat Musa AB (2013) A comparison of l 1-regularizion, pca, kpca and ica for dimensionality reduction in logistic regression. Int J Mach Learn Cybern 1–13. doi:10.1007/s13042-013-0171-7 Musa AB (2013) A comparison of l 1-regularizion, pca, kpca and ica for dimensionality reduction in logistic regression. Int J Mach Learn Cybern 1–13. doi:10.​1007/​s13042-013-0171-7
13.
Zurück zum Zitat Musa AB (2013) Comparative study on classification performance between support vector machine and logistic regression. Int J Mach Learn Cybern 4(1):13–24CrossRef Musa AB (2013) Comparative study on classification performance between support vector machine and logistic regression. Int J Mach Learn Cybern 4(1):13–24CrossRef
14.
Zurück zum Zitat Owen S, Anil R, Dunning T, Friedman E (2011) Mahout in action. Manning Owen S, Anil R, Dunning T, Friedman E (2011) Mahout in action. Manning
15.
Zurück zum Zitat Oza NC, Kagan T (2008) Classifier ensembles: select real-world applications. Inf Fusion 9:4–20CrossRef Oza NC, Kagan T (2008) Classifier ensembles: select real-world applications. Inf Fusion 9:4–20CrossRef
16.
Zurück zum Zitat Strehl AL, Littman ML (2008) Online linear regression and its application to model-based reinforcement learning. Adv Neural Inf Process Syst 20:1417–1424 Strehl AL, Littman ML (2008) Online linear regression and its application to model-based reinforcement learning. Adv Neural Inf Process Syst 20:1417–1424
17.
Zurück zum Zitat Todorovski L, Sašo D (2000) Combining multiple models with meta decision trees. Springer, Berlin Heidelberg Todorovski L, Sašo D (2000) Combining multiple models with meta decision trees. Springer, Berlin Heidelberg
18.
Zurück zum Zitat Xiao L (2010) Dual averaging methods for regularized stochastic learning and online optimization. J Mach Learn Res 9999:2543–2596 Xiao L (2010) Dual averaging methods for regularized stochastic learning and online optimization. J Mach Learn Res 9999:2543–2596
Metadaten
Titel
A novel prior-based real-time click through rate prediction model
verfasst von
Yan Fang
Jinzhi Liu
Publikationsdatum
01.12.2014
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 6/2014
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-014-0231-7

Weitere Artikel der Ausgabe 6/2014

International Journal of Machine Learning and Cybernetics 6/2014 Zur Ausgabe

Neuer Inhalt