skip to main content
10.1145/3097983.3098108acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Embedding-based News Recommendation for Millions of Users

Published:13 August 2017Publication History

ABSTRACT

It is necessary to understand the content of articles and user preferences to make effective news recommendations. While ID-based methods, such as collaborative filtering and low-rank factorization, are well known for making recommendations, they are not suitable for news recommendations because candidate articles expire quickly and are replaced with new ones within short spans of time. Word-based methods, which are often used in information retrieval settings, are good candidates in terms of system performance but have issues such as their ability to cope with synonyms and orthographical variants and define "queries" from users' historical activities. This paper proposes an embedding-based method to use distributed representations in a three step end-to-end manner: (i) start with distributed representations of articles based on a variant of a denoising autoencoder, (ii) generate user representations by using a recurrent neural network (RNN) with browsing histories as input sequences, and (iii) match and list articles for users based on inner-product operations by taking system performance into consideration. The proposed method performed well in an experimental offline evaluation using past access data on Yahoo! JAPAN's homepage. We implemented it on our actual news distribution system based on these experimental results and compared its online performance with a method that was conventionally incorporated into the system. As a result, the click-through rate (CTR) improved by 23% and the total duration improved by 10%, compared with the conventionally incorporated method. Services that incorporated the method we propose are already open to all users and provide recommendations to over ten million individual users per day who make billions of accesses per month.

References

  1. Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Google ScholarGoogle ScholarCross RefCross Ref
  2. Henriette Cramer. 2015. Effects of Ad Quality & Content-Relevance on Perceived Content Quality Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems.Google ScholarGoogle Scholar
  3. Abhinandan S. Das, Mayur Datar, Ashutosh Garg, and Shyam Rajaram 2007. Google News Personalization: Scalable Online Collaborative Filtering Proceedings of the 16th International Conference on World Wide Web.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin 2008. LIBLINEAR: A library for large linear classification. The Journal of Machine Learning Research (2008).Google ScholarGoogle Scholar
  5. Sepp Hochreiter. 1991. Untersuchungen zu dynamischen neuronalen Netzen. Diploma thesis. bibinfoschoolInstitut für Informatik, Lehrstuhl Prof. Brauer, Technische Universit"at München.Google ScholarGoogle Scholar
  6. Sepp Hochreiter and Jürgen Schmidhuber 1997. Long short-term memory. Neural computation (1997).Google ScholarGoogle Scholar
  7. Thorsten Joachims. 2002. Optimizing Search Engines Using Clickthrough Data. Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Rafal Jozefowicz, Wojciech Zaremba, and Ilya Sutskever. 2015. An empirical exploration of recurrent network architectures Proceedings of the 32nd International Conference on Machine Learning.Google ScholarGoogle Scholar
  9. Andrej Karpathy and Li Fei-Fei 2015. Deep visual-semantic alignments for generating image descriptions Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle Scholar
  10. Mounia Lalmas, Janette Lehmann, Guy Shaked, Fabrizio Silvestri, and Gabriele Tolomei 2015. Promoting Positive Post-Click Experience for In-Stream Yahoo Gemini Users Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Google ScholarGoogle Scholar
  11. Quoc Le and Tomas Mikolov 2014. Distributed Representations of Sentences and Documents Proceedings of The 31st International Conference on Machine Learning.Google ScholarGoogle Scholar
  12. Shumpei Okura, Yukihiro Tagami, and Akira Tajima. 2016. Article De-duplication Using Distributed Representations Proceedings of the 25th International Conference Companion on World Wide Web.Google ScholarGoogle Scholar
  13. Razvan Pascanu, Tomas Mikolov, and Yoshua Bengio. 2014. On the difficulty of training recurrent neural networks Proceedings of The 30th International Conference on Machine Learning.Google ScholarGoogle Scholar
  14. Jay Adams Paul Covington and Emre Sargin 2016. Deep Neural Networks for YouTube Recommendations. Proceedings of the 10th ACM Conference on Recommender Systems. New York, NY, USA.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Tara N Sainath, Oriol Vinyals, Andrew Senior, and Hasim Sak 2015. Convolutional, long short-term memory, fully connected deep neural networks Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on.Google ScholarGoogle ScholarCross RefCross Ref
  16. Tobias Schnabel, Igor Labutov, David Mimno, and Thorsten Joachims 2015. Evaluation methods for unsupervised word embeddings Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing.Google ScholarGoogle Scholar
  17. Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to Sequence Learning with Neural Networks Proceedings of Advances in Neural Information Processing Systems 27.Google ScholarGoogle Scholar
  18. Yukihiro Tagami, Hayato Kobayashi, Shingo Ono, and Akira Tajima 2015. Modeling User Activities on the Web Using Paragraph Vector Proceedings of the 24th International Conference on World Wide Web.Google ScholarGoogle Scholar
  19. Pascal Vincent, Hugo Larochelle, Yoshua Bengio, and Pierre-Antoine Manzagol 2008. Extracting and composing robust features with denoising autoencoders Proceedings of the 25th international conference on Machine learning.Google ScholarGoogle Scholar
  20. Pascal Vincent, Hugo Larochelle, Isabelle Lajoie, Yoshua Bengio, and Pierre-Antoine Manzagol. 2010. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. The Journal of Machine Learning Research (2010).Google ScholarGoogle Scholar
  21. Yuyu Zhang, Hanjun Dai, Chang Xu, Jun Feng, Taifeng Wang, Jiang Bian, Bin Wang, and Tie-Yan Liu. 2014. Sequential click prediction for sponsored search with recurrent neural networks Proceedings of the 28th AAAI Conference on Artificial Intelligence.Google ScholarGoogle Scholar
  22. Erheng Zhong, Nathan Liu, Yue Shi, and Suju Rajan. 2015. Building Discriminative User Profiles for Large-scale Content Recommendation Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. endthebibliographyGoogle ScholarGoogle Scholar

Index Terms

  1. Embedding-based News Recommendation for Millions of Users

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
          August 2017
          2240 pages
          ISBN:9781450348874
          DOI:10.1145/3097983

          Copyright © 2017 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 13 August 2017

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          KDD '17 Paper Acceptance Rate64of748submissions,9%Overall Acceptance Rate1,133of8,635submissions,13%

          Upcoming Conference

          KDD '24

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader