skip to main content
10.1145/3298689.3347058acmotherconferencesArticle/Chapter ViewAbstractPublication PagesrecsysConference Proceedingsconference-collections
research-article
Best Paper

Are we really making much progress? A worrying analysis of recent neural recommendation approaches

Published:10 September 2019Publication History

ABSTRACT

Deep learning techniques have become the method of choice for researchers working on algorithmic aspects of recommender systems. With the strongly increased interest in machine learning in general, it has, as a result, become difficult to keep track of what represents the state-of-the-art at the moment, e.g., for top-n recommendation tasks. At the same time, several recent publications point out problems in today's research practice in applied machine learning, e.g., in terms of the reproducibility of the results or the choice of the baselines when proposing new models.

In this work, we report the results of a systematic analysis of algorithmic proposals for top-n recommendation tasks. Specifically, we considered 18 algorithms that were presented at top-level research conferences in the last years. Only 7 of them could be reproduced with reasonable effort. For these methods, it however turned out that 6 of them can often be outperformed with comparably simple heuristic methods, e.g., based on nearest-neighbor or graph-based techniques. The remaining one clearly outperformed the baselines but did not consistently outperform a well-tuned non-neural linear ranking method. Overall, our work sheds light on a number of potential problems in today's machine learning scholarship and calls for improved scientific practices in this area.

Skip Supplemental Material Section

Supplemental Material

References

  1. S. Antenucci, S. Boglio, E. Chioso, E. Dervishaj, K. Shuwen, T. Scarlatti, and M. Ferrari Dacrema. 2018. Artist-driven layering and user's behaviour impact on recommendations in a playlist continuation scenario. In Proceedings of the ACM Recommender Systems Challenge 2018 (RecSys 2018). Source: https://github.com/MaurizioFD/spotify-recsys-challenge. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Timothy G. Armstrong, Alistair Moffat, William Webber, and Justin Zobel. 2009. Improvements That Don't Add Up: Ad-hoc Retrieval Results Since 1998. In Proceedings CIKM '09. 601--610. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Joeran Beel, Corinna Breitinger, Stefan Langer, Andreas Lommatzsch, and Bela Gipp. 2016. Towards reproducibility in recommender-systems research. User Modeling and User-Adapted Interaction 26, 1 (2016), 69--101. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Jöran Beel and Stefan Langer. 2015. A Comparison of Offline Evaluations, Online Evaluations, and User Studies in the Context of Research-Paper Recommender Systems. In Proceedings TPDL '15. 153--168.Google ScholarGoogle ScholarCross RefCross Ref
  5. Robert M Bell and Yehuda Koren. 2007. Improved neighborhood-based collaborative filtering. In KDD cup and workshop at the KDD '07. Citeseer, 7--14.Google ScholarGoogle Scholar
  6. Homanga Bharadhwaj, Homin Park, and Brian Y. Lim. 2018. RecGAN: Recurrent Generative Adversarial Networks for Recommendation Systems. In Proceedings RecSys '18. 372--376. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Jingyuan Chen, Hanwang Zhang, Xiangnan He, Liqiang Nie, Wei Liu, and Tat-Seng Chua. 2017. Attentive collaborative filtering: Multimedia recommendation with item-and component-level attention. In Proceedings SIGIR '17. 335--344. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Colin Cooper, Sang Hyuk Lee, Tomasz Radzik, and Yiannis Siantos. 2014. Random walks in recommender systems: exact computation and simulations. In Proceedings WWW '14. 811--816. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Paolo Cremonesi, Franca Garzotto, and Roberto Turrin. 2012. Investigating the Persuasion Potential of Recommender Systems from a Quality Perspective: An Empirical Study. Transactions on Interactive Intelligent Systems 2, 2 (2012), 1--41. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Travis Ebesu, Bin Shen, and Yi Fang. 2018. Collaborative Memory Network for Recommendation Systems. In Proceedings SIGIR '18. 515--524. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Ali Mamdouh Elkahky, Yang Song, and Xiaodong He. 2015. A multi-view deep learning approach for cross domain user modeling in recommendation systems. In Proceedings WWW '15. 278--288. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Association for Computing Machinery. 2016. Artifact Review and Badging. Available online at: https://www.acm.org/publications/policies/artifact-review-badging (Accessed March, 2018).Google ScholarGoogle Scholar
  13. Florent Garcin, Boi Faltings, Olivier Donatsch, Ayar Alazzawi, Christophe Bruttin, and Amr Huber. 2014. Offline and Online Evaluation of News Recommender Systems at Swissinfo.Ch. In Proceedings RecSys '14. 169--176. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In Proceedings WWW '17. 173--182.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Peter Henderson, Riashat Islam, Philip Bachman, Joelle Pineau, Doina Precup, and David Meger. 2018. Deep Reinforcement Learning That Matters. In Proceedings AAAI '18. 3207--3214.Google ScholarGoogle Scholar
  16. Balázs Hidasi, Alexandras Karatzoglou, Linas Baltrunas, and Domonkos Tikk. 2016. Session-based Recommendations with Recurrent Neural Networks. In Proceedings ICLR '16.Google ScholarGoogle Scholar
  17. Binbin Hu, Chuan Shi, Wayne Xin Zhao, and Philip S Yu. 2018. Leveraging meta-path based context for top-n recommendation with a neural co-attention model. In Proceedings KDD '18. 1531--1540. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Yifan Hu, Yehuda Koren, and Chris Volinsky. 2008. Collaborative Filtering for Implicit Feedback Datasets. In Proceedings ICDM '08. 263--272. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Dietmar Jannach and Malte Ludewig. 2017. When Recurrent Neural Networks Meet the Neighborhood for Session-Based Recommendation. In Proceedings RecSys '17. 306--310. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Dietmar Jannach, Paul Resnick, Alexander Tuzhilin, and Markus Zanker. 2016. Recommender Systems - Beyond Matrix Completion. Commun. ACM 59, 11 (2016), 94--102. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Donghyun Kim, Chanyoung Park, Jinoh Oh, Sungyoung Lee, and Hwanjo Yu. 2016. Convolutional Matrix Factorization for Document Context-Aware Recommendation. In Proceedings RecSys '16. 233--240. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Joseph A. Konstan and John Riedl. 2012. Recommender systems: from algorithms to user experience. User Modeling and User-Adapted Interaction 22, 1 (2012), 101--123. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Xiaopeng Li and James She. 2017. Collaborative variational autoencoder for recommender systems. In Proceedings KDD '17. 305--314. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Dawen Liang, Rahul G Krishnan, Matthew D Hoffman, and Tony Jebara. 2018. Variational Autoencoders for Collaborative Filtering. In Proceedings WWW '18. 689--698. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Jimmy Lin. 2019. The Neural Hype and Comparisons Against Weak Baselines. SIGIR Forum 52, 2 (Jan. 2019), 40--51. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. G. Linden, B. Smith, and J. York. 2003. Amazon.com recommendations: item-to-item collaborative filtering. IEEE Internet Computing 7, 1 (2003), 76--80. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Zachary C. Lipton and Jacob Steinhardt. 2018. Troubling Trends in Machine Learning Scholarship. arXiv:arXiv:1807.03341Google ScholarGoogle Scholar
  28. Pasquale Lops, Marco De Gemmis, and Giovanni Semeraro. 2011. Content-based recommender systems: State of the art and trends. In Recommender Systems Handbook. Springer, 73--105.Google ScholarGoogle Scholar
  29. Malte Ludewig and Dietmar Jannach. 2018. Evaluation of Session-based Recommendation Algorithms. User-Modeling and User-Adapted Interaction 28, 4--5 (2018), 331--390. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Malte Ludewig, Noemi Mauro, Sara Latifi, and Dietmar Jannach. 2019. Performance Comparison of Neural and Non-Neural Approaches to Session-based Recommendation. In Proceedings RecSys '19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Andrii Maksai, Florent Garcin, and Boi Faltings. 2015. Predicting Online Performance of News Recommender Systems Through Richer Evaluation Metrics. In Proceedings RecSys '15. 179--186. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Jarana Manotumruksa, Craig Macdonald, and Iadh Ounis. 2018. A Contextual Attention Recurrent Architecture for Context-Aware Venue Recommendation. In Proceedings SIGIR '18. 555--564. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Xia Ning and George Karypis. 2011. SLIM: Sparse linear methods for top-n recommender systems. In Proceedings ICDM '11. 497--506. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Bibek Paudel, Fabian Christoffel, Chris Newell, and Abraham Bernstein. 2017. Updatable, Accurate, Diverse, and Scalable Recommendations for Interactive Applications. ACM Transactions on Interactive Intelligent Systems 7, 1 (2017), 1. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Hans Ekkehard Plesser. 2017. Reproducibility vs. Replicability: A Brief History of a Confused Terminology. Frontiers in Neuroinformatics 11, 76 (2017).Google ScholarGoogle Scholar
  36. Massimo Quadrana, Paolo Cremonesi, and Dietmar Jannach. 2018. Sequence-Aware Recommender Systems. Comput. Surveys 51, 4 (2018), 1--36. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Marco Rossetti, Fabio Stella, and Markus Zanker. 2016. Contrasting Offline and Online Results when Evaluating Recommendation Algorithms. In Proceedings RecSys '16. 31--34. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Noveen Sachdeva, Kartik Gupta, and Vikram Pudi. 2018. Attentive Neural Architecture Incorporating Song Features for Music Recommendation. In Proceedings RecSys '18. 417--421. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Alan Said and Alejandro Bellogín. 2014. Rival: A Toolkit to Foster Reproducibility in Recommender System Evaluation. In Proceedings RecSys '14. 371--372. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl. 2001. Item-based collaborative filtering recommendation algorithms. In Proceedings WWW '01. 285--295. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Zhu Sun, Jie Yang, Jie Zhang, Alessandro Bozzon, Long-Kai Huang, and Chi Xu. 2018. Recurrent Knowledge Graph Embedding for Effective Recommendation. In Proceedings RecSys '18. 297--305. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Yi Tay, Luu Anh Tuan, and Siu Cheung Hui. 2018. Latent relational metric learning via memory-based attention for collaborative ranking. In Proceedings WWW '18. 729--739. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Yi Tay, Luu Anh Tuan, and Siu Cheung Hui. 2018. Multi-Pointer Co-Attention Networks for Recommendation. In Proceedings SIGKDD '18. 2309--2318. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Trinh Xuan Tuan and Tu Minh Phuong. 2017. 3D Convolutional Networks for Session-based Recommendation with Content Features. In Proceedings RecSys '17. 138--146. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Flavian Vasile, Elena Smirnova, and Alexis Conneau. 2016. Meta-Prod2Vec: Product Embeddings Using Side-Information for Recommendation. In Proceedings RecSys '16. 225--232. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Kiri Wagstaff. 2012. Machine Learning that Matters. In Proceedings ICML '12. 529--536. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Chong Wang and David M Blei. 2011. Collaborative topic modeling for recommending scientific articles. In Proceedings KDD '11. 448--456. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Hao Wang, Naiyan Wang, and Dit-Yan Yeung. 2015. Collaborative deep learning for recommender systems. In Proceedings KDD '15. 1235--1244. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Jun Wang, Arjen P De Vries, and Marcel JT Reinders. 2006. Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In Proceedings SIGIR '06. 501--508. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Jun Wang, Stephen Robertson, Arjen P de Vries, and Marcel JT Reinders. 2008. Probabilistic relevance ranking for collaborative filtering. Information Retrieval 11, 6 (2008), 477--497. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Yao Wu, Christopher DuBois, Alice X Zheng, and Martin Ester. 2016. Collaborative denoising auto-encoders for top-n recommender systems. In Proceedings WSDM '16. 153--162. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Bo Xiao and Izak Benbasat. 2007. E-commerce Product Recommendation Agents: Use, Characteristics, and Impact. MIS Quarterly 31, 1 (March 2007), 137--209. Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Lei Zheng, Chun-Ta Lu, Fei Jiang, Jiawei Zhang, and Philip S. Yu. 2018. Spectral Collaborative Filtering. In Proceedings RecSys '18. 311--319. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Are we really making much progress? A worrying analysis of recent neural recommendation approaches

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          RecSys '19: Proceedings of the 13th ACM Conference on Recommender Systems
          September 2019
          635 pages
          ISBN:9781450362436
          DOI:10.1145/3298689

          Copyright © 2019 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 10 September 2019

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          RecSys '19 Paper Acceptance Rate36of189submissions,19%Overall Acceptance Rate254of1,295submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader