research-article

Embedding-based News Recommendation for Millions of Users

Authors:
Shumpei Okura

Yahoo Japan Corporation, Tokyo, Japan

Yahoo Japan Corporation, Tokyo, Japan
View Profile

,
Yukihiro Tagami

Yahoo Japan Corporation, Tokyo, Japan

Yahoo Japan Corporation, Tokyo, Japan
View Profile

,
Shingo Ono

Yahoo Japan Corporation, Tokyo, Japan

Yahoo Japan Corporation, Tokyo, Japan
View Profile

,
Akira Tajima

Yahoo Japan Corporation, Tokyo, Japan

Yahoo Japan Corporation, Tokyo, Japan
View Profile

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data MiningAugust 2017Pages 1933–1942https://doi.org/10.1145/3097983.3098108

Published:13 August 2017Publication History

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Pages 1933–1942

ABSTRACT

It is necessary to understand the content of articles and user preferences to make effective news recommendations. While ID-based methods, such as collaborative filtering and low-rank factorization, are well known for making recommendations, they are not suitable for news recommendations because candidate articles expire quickly and are replaced with new ones within short spans of time. Word-based methods, which are often used in information retrieval settings, are good candidates in terms of system performance but have issues such as their ability to cope with synonyms and orthographical variants and define "queries" from users' historical activities. This paper proposes an embedding-based method to use distributed representations in a three step end-to-end manner: (i) start with distributed representations of articles based on a variant of a denoising autoencoder, (ii) generate user representations by using a recurrent neural network (RNN) with browsing histories as input sequences, and (iii) match and list articles for users based on inner-product operations by taking system performance into consideration. The proposed method performed well in an experimental offline evaluation using past access data on Yahoo! JAPAN's homepage. We implemented it on our actual news distribution system based on these experimental results and compared its online performance with a method that was conventionally incorporated into the system. As a result, the click-through rate (CTR) improved by 23% and the total duration improved by 10%, compared with the conventionally incorporated method. Services that incorporated the method we propose are already open to all users and provide recommendations to over ten million individual users per day who make billions of accesses per month.

References

Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Google ScholarCross Ref
Henriette Cramer. 2015. Effects of Ad Quality & Content-Relevance on Perceived Content Quality Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems.Google Scholar
Abhinandan S. Das, Mayur Datar, Ashutosh Garg, and Shyam Rajaram 2007. Google News Personalization: Scalable Online Collaborative Filtering Proceedings of the 16th International Conference on World Wide Web.Google ScholarDigital Library
Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin 2008. LIBLINEAR: A library for large linear classification. The Journal of Machine Learning Research (2008).Google Scholar
Sepp Hochreiter. 1991. Untersuchungen zu dynamischen neuronalen Netzen. Diploma thesis. bibinfoschoolInstitut für Informatik, Lehrstuhl Prof. Brauer, Technische Universit"at München.Google Scholar
Sepp Hochreiter and Jürgen Schmidhuber 1997. Long short-term memory. Neural computation (1997).Google Scholar
Thorsten Joachims. 2002. Optimizing Search Engines Using Clickthrough Data. Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Google ScholarDigital Library
Rafal Jozefowicz, Wojciech Zaremba, and Ilya Sutskever. 2015. An empirical exploration of recurrent network architectures Proceedings of the 32nd International Conference on Machine Learning.Google Scholar
Andrej Karpathy and Li Fei-Fei 2015. Deep visual-semantic alignments for generating image descriptions Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Google Scholar
Mounia Lalmas, Janette Lehmann, Guy Shaked, Fabrizio Silvestri, and Gabriele Tolomei 2015. Promoting Positive Post-Click Experience for In-Stream Yahoo Gemini Users Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Google Scholar
Quoc Le and Tomas Mikolov 2014. Distributed Representations of Sentences and Documents Proceedings of The 31st International Conference on Machine Learning.Google Scholar
Shumpei Okura, Yukihiro Tagami, and Akira Tajima. 2016. Article De-duplication Using Distributed Representations Proceedings of the 25th International Conference Companion on World Wide Web.Google Scholar
Razvan Pascanu, Tomas Mikolov, and Yoshua Bengio. 2014. On the difficulty of training recurrent neural networks Proceedings of The 30th International Conference on Machine Learning.Google Scholar
Jay Adams Paul Covington and Emre Sargin 2016. Deep Neural Networks for YouTube Recommendations. Proceedings of the 10th ACM Conference on Recommender Systems. New York, NY, USA.Google ScholarDigital Library
Tara N Sainath, Oriol Vinyals, Andrew Senior, and Hasim Sak 2015. Convolutional, long short-term memory, fully connected deep neural networks Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on.Google ScholarCross Ref
Tobias Schnabel, Igor Labutov, David Mimno, and Thorsten Joachims 2015. Evaluation methods for unsupervised word embeddings Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing.Google Scholar
Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to Sequence Learning with Neural Networks Proceedings of Advances in Neural Information Processing Systems 27.Google Scholar
Yukihiro Tagami, Hayato Kobayashi, Shingo Ono, and Akira Tajima 2015. Modeling User Activities on the Web Using Paragraph Vector Proceedings of the 24th International Conference on World Wide Web.Google Scholar
Pascal Vincent, Hugo Larochelle, Yoshua Bengio, and Pierre-Antoine Manzagol 2008. Extracting and composing robust features with denoising autoencoders Proceedings of the 25th international conference on Machine learning.Google Scholar
Pascal Vincent, Hugo Larochelle, Isabelle Lajoie, Yoshua Bengio, and Pierre-Antoine Manzagol. 2010. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. The Journal of Machine Learning Research (2010).Google Scholar
Yuyu Zhang, Hanjun Dai, Chang Xu, Jun Feng, Taifeng Wang, Jiang Bian, Bin Wang, and Tie-Yan Liu. 2014. Sequential click prediction for sponsored search with recurrent neural networks Proceedings of the 28th AAAI Conference on Artificial Intelligence.Google Scholar
Erheng Zhong, Nathan Liu, Yue Shi, and Suju Rajan. 2015. Building Discriminative User Profiles for Large-scale Content Recommendation Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. endthebibliographyGoogle Scholar

Index Terms

Embedding-based News Recommendation for Millions of Users
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking
      1. Learning to rank
    2. Retrieval tasks and goals
      1. Recommender systems
  2. World Wide Web
    1. Web searching and information discovery
      1. Personalization

Recommendations

Personalized news recommendation based on click behavior
IUI '10: Proceedings of the 15th international conference on Intelligent user interfaces

Online news reading has become very popular as the web provides access to news articles from millions of sources around the world. A key challenge of news websites is to help users find the articles that are interesting to read. In this paper, we ...
Read More
Personalized News Recommendation Using Twitter
WI-IAT '13: Proceedings of the 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 03

Online news reading has become a popular way to read news articles from a huge collection of news sources around the globe. News recommender systems help users manage this flood by suggesting articles based on user interests rather than presenting ...
Read More
Personalized hybrid recommendation for group of users

Novel group hybrid method combining collaborative and content-based recommendation.Proposed method improves the quality of recommended items ordering.Proposed method increases the recommendation precision for very Top-N results.Applicable for single ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August 2017
2240 pages
ISBN:9781450348874
DOI:10.1145/3097983
General Chairs:
Stan Matwin
Dalhousie University
,
Shipeng Yu
LinkedIn
,
Faisal Farooq
IBM
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 August 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
distributed representations
large-scale services
neural networks
news recommendations
Qualifiers
- research-article
Conference

Acceptance Rates
KDD '17 Paper Acceptance Rate64of748submissions,9%Overall Acceptance Rate1,133of8,635submissions,13%
More
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 293
  Total Citations
  View Citations
- 6,126
  Total Downloads
- Downloads (Last 12 months)223
- Downloads (Last 6 weeks)37
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Embedding-based News Recommendation for Millions of Users

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Personalized news recommendation based on click behavior

Personalized News Recommendation Using Twitter

Personalized hybrid recommendation for group of users

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Embedding-based News Recommendation for Millions of Users

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Personalized news recommendation based on click behavior

Personalized News Recommendation Using Twitter

Personalized hybrid recommendation for group of users

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media