skip to main content
10.1145/3132847.3132926acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Neural Attentive Session-based Recommendation

Published:06 November 2017Publication History

ABSTRACT

Given e-commerce scenarios that user profiles are invisible, session-based recommendation is proposed to generate recommendation results from short sessions. Previous work only considers the user's sequential behavior in the current session, whereas the user's main purpose in the current session is not emphasized. In this paper, we propose a novel neural networks framework, i.e., Neural Attentive Recommendation Machine (NARM), to tackle this problem. Specifically, we explore a hybrid encoder with an attention mechanism to model the user's sequential behavior and capture the user's main purpose in the current session, which are combined as a unified session representation later. We then compute the recommendation scores for each candidate item with a bi-linear matching scheme based on this unified session representation. We train NARM by jointly learning the item and session representations as well as their matchings. We carried out extensive experiments on two benchmark datasets. Our experimental results show that NARM outperforms state-of-the-art baselines on both datasets. Furthermore, we also find that NARM achieves a significant improvement on long sessions, which demonstrates its advantages in modeling the user's sequential behavior and main purpose simultaneously.

References

  1. G. Adomavicius and A. Tuzhilin. Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering, 17 (6): 734--749, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Amodei, R. Anubhai, E. Battenberg, C. Case, J. Casper, B. Catanzaro, J. Chen, M. Chrzanowski, A. Coates, G. Diamos, et al. Deep speech 2: end-to-end speech recognition in english and mandarin. In Proceedings of the 33rd. International Conference on Machine Learning, pages 173--182, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Chen, J. L. Moore, D. Turnbull, and T. Joachims. Playlist prediction via metric embedding. In Proceedings of the 18th. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 714--722, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Davidson, B. Liebald, J. Liu, P. Nandy, T. Van Vleet, U. Gargi, S. Gupta, Y. He, M. Lambert, B. Livingston, et al. The youtube video recommendation system. In Proceedings of the 4th. ACM Conference on Recommender Systems, pages 293--296, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. L. De Vine, G. Zuccon, B. Koopman, L. Sitbon, and P. Bruza. Medical semantic similarity with a neural language model. In Proceedings of the 23rd. ACM International Conference on Conference on Information and Knowledge Management, pages 1819--1822, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. M. Elkahky, Y. Song, and X. He. A multi-view deep learning approach for cross domain user modeling in recommendation systems. In Proceedings of the 24th. International Conference on World Wide Web, pages 278--288, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Graves, A.-r. Mohamed, and G. Hinton. Speech recognition with deep recurrent neural networks. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 6645--6649, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  8. He, Zhang, Ren, and Sun}he2016deepK. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 770--778, 2016 a.Google ScholarGoogle ScholarCross RefCross Ref
  9. X. He and T.-S. Chua. Neural factorization machines for sparse predictive analytics. In Proceedings of the 40th. International ACM SIGIR conference on Research and Development in Information Retrieval, pages 355--364, 2017. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. He, Zhang, Kan, and Chua}he2016fastX. He, H. Zhang, M.-Y. Kan, and T.-S. Chua. Fast matrix factorization for online recommendation with implicit feedback. In Proceedings of the 39th. International ACM SIGIR conference on Research and Development in Information Retrieval, pages 549--558, 2016 b. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. X. He, L. Liao, H. Zhang, L. Nie, X. Hu, and T.-S. Chua. Neural collaborative filtering. In Proceedings of the 26th. International Conference on World Wide Web, pages 173--182, 2017. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. B. Hidasi, A. Karatzoglou, L. Baltrunas, and D. Tikk. Session-based recommendations with recurrent neural networks. In Proceedings of the 4th. International Conference on Learning Representations, 2016.Google ScholarGoogle Scholar
  13. G. Hinton, L. Deng, D. Yu, G. E. Dahl, A. R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, and T. N. Sainath. Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups. IEEE Signal Processing Magazine, 29 (6): 82--97, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  14. S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural computation, 9 (8): 1735--1780, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. D. Kingma and J. Ba. Adam: a method for stochastic optimization. In Proceedings of the 4th. International Conference on Learning Representations, 2015.Google ScholarGoogle Scholar
  16. Y. Koren, R. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. Computer, 42 (8): 30--37, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In Proceedings of the 25th. International Conference on Neural Information Processing Systems, pages 1097--1105, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Li, Wang, Lam, Ren, and Bing}li2017salienceP. Li, Z. Wang, W. Lam, Z. Ren, and L. Bing. Salience estimation via variational auto-encoders for multi-document summarization. In Proceedings of the 31st. AAAI Conference on Artificial Intelligence, pages 3497--3503, 2017 a.Google ScholarGoogle Scholar
  19. Li, Wang, Ren, Bing, and Lam}li2017neuralP. Li, Z. Wang, Z. Ren, L. Bing, and W. Lam. Neural rating regression with abstractive tips generation for recommendation. In Proceedings of the 40th. International ACM SIGIR conference on Research and Development in Information Retrieval, pages 345--354, 2017 b. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. G. Linden, B. Smith, and J. York. Amazon.com recommendations: item-to-item collaborative filtering. IEEE Internet Computing, 7 (1): 76--80, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Q. Liu, T. Chen, J. Cai, and D. Yu. Enlister: baidu's recommender system for the biggest chinese q&a website. In Proceedings of the 6th. ACM Conference on Recommender Systems, pages 285--288, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. T. Mikolov, I. Sutskever, K. Chen, G. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Proceedings of the 26th. International Conference on Neural Information Processing Systems, pages 3111--3119, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. A. Mild and T. Reutterer. An improved collaborative filtering approach for predicting cross-category purchases based on binary market basket data. Journal of Retailing and Consumer Services, 10 (3): 123--133, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  24. A. Mnih and G. Hinton. A scalable hierarchical distributed language model. In Proceedings of the 21st. International Conference on Neural Information Processing Systems, pages 1081--1088, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. B. Mobasher, H. Dai, T. Luo, and M. Nakagawa. Using sequential and non-sequential patterns in predictive web usage mining tasks. In Proceedings of the IEEE International Conference on Data Mining, pages 669--672, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Ren, Chen, Ren, Wei, Ma, and de Rijke}ren2017leveragingP. Ren, Z. Chen, Z. Ren, F. Wei, J. Ma, and M. de Rijke. Leveraging contextual sentence relations for extractive summarization using a neural attention model. In Proceedings of the 40th. International ACM SIGIR conference on Research and Development in Information Retrieval, pages 95--104, 2017 a. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Ren, Liang, Li, Wang, and de Rijke}ren2017socialZ. Ren, S. Liang, P. Li, S. Wang, and M. de Rijke. Social collaborative viewpoint regression with explainable recommendations. In Proceedings of the 10th. ACM International Conference on Web Search and Data Mining, pages 485--494, 2017 b. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. S. Rendle, C. Freudenthaler, Z. Gantner, and L. Schmidt-Thieme. Bpr: bayesian personalized ranking from implicit feedback. In Proceedings of the 25th. Conference on Uncertainty in Artificial Intelligence, pages 452--461, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. S. Rendle, C. Freudenthaler, and L. Schmidt-Thieme. Factorizing personalized markov chains for next-basket recommendation. In Proceedings of the 19th. International Conference on World Wide Web, pages 811--820, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. O. Rsoy and C. Cardie. Deep recursive neural networks for compositionality in language. In Proceedings of the 27th. International Conference on Neural Information Processing Systems, pages 2096--2104, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. R. Salakhutdinov, A. Mnih, and G. Hinton. Restricted boltzmann machines for collaborative filtering. In Proceedings of the 24th. International Conference on Machine Learning, pages 791--798, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. B. Sarwar, G. Karypis, J. Konstan, and J. Riedl. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th. International Conference on World Wide Web, pages 285--295, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. J. B. Schafer, J. Konstan, and J. Riedl. Recommender systems in e-commerce. In Proceedings of the 1st. ACM Conference on Electronic Commerce, pages 158--166, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. S. Sedhain, A. K. Menon, S. Sanner, and L. Xie. Autorec: autoencoders meet collaborative filtering. In Proceedings of the 24th. International Conference on World Wide Web, pages 111--112, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. L. Shang, Z. Lu, and H. Li. Neural responding machine for short-text conversation. In Proceedings of the 53rd. Annual Meeting of the Association for Computational Linguistics, pages 1577--1586, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  36. G. Shani, D. Heckerman, and R. I. Brafman. An mdp-based recommender system. Journal of Machine Learning Research, 6 (1): 1265--1295, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. R. Socher, C. Y. Lin, A. Y. Ng, and C. D. Manning. Parsing natural scenes and natural language with recursive neural networks. In Proceedings of the 28th. International Conference on Machine Learning, pages 129--136, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. H. Song, Z. Ren, S. Liang, P. Li, J. Ma, and M. de Rijke. Summarizing answers in non-factoid community question-answering. In Proceedings of the 10th. ACM International Conference on Web Search and Data Mining, pages 405--414, 2017. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. X. Su and T. M. Khoshgoftaar. A survey of collaborative filtering techniques. Advances in Artificial Intelligence, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Y. K. Tan, X. Xu, and Y. Liu. Improved recurrent neural networks for session-based recommendations. In Proceedings of the 1st. Workshop on Deep Learning for Recommender Systems, pages 17--22, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. P. Wang, J. Guo, Y. Lan, J. Xu, S. Wan, and X. Cheng. Learning hierarchical representation model for nextbasket recommendation. In Proceedings of the 38th. International ACM SIGIR conference on Research and Development in Information Retrieval, pages 403--412, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. M. Weimer, A. Karatzoglou, Q. V. Le, and A. Smola. Maximum margin matrix factorization for collaborative ranking. In Proceedings of the 20th. International Conference on Neural Information Processing Systems, pages 1--8, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Y. Wu, C. Dubois, A. X. Zheng, and M. Ester. Collaborative denoising auto-encoders for top-n recommender systems. In Proceedings of the 9th. ACM International Conference on Web Search and Data Mining, pages 153--162, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. G. E. Yap, X. L. Li, and P. S. Yu. Effective next-items recommendation via personalized sequential pattern mining. In Proceedings of the 17th. International Conference on Database Systems for Advanced Applications, pages 48--64, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Y. Zhang, H. Dai, C. Xu, J. Feng, T. Wang, J. Bian, B. Wang, and T.-Y. Liu. Sequential click prediction for sponsored search with recurrent neural networks. In Proceedings of the 28th. AAAI Conference on Artificial Intelligence, pages 1369--1375, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. A. Zimdars, D. M. Chickering, and C. Meek. Using temporal data for making recommendations. In Proceedings of the 17th. Conference on Uncertainty in Artificial Intelligence, pages 580--588, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Neural Attentive Session-based Recommendation

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management
          November 2017
          2604 pages
          ISBN:9781450349185
          DOI:10.1145/3132847

          Copyright © 2017 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 6 November 2017

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          CIKM '17 Paper Acceptance Rate171of855submissions,20%Overall Acceptance Rate1,861of8,427submissions,22%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader