research-article

HybridCite: A Hybrid Model for Context-Aware Citation Recommendation

Authors:
Michael Färber

Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany

Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
View Profile

,
Ashwath Sampath

University of Freiburg, Freiburg, Germany

University of Freiburg, Freiburg, Germany
View Profile

JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020August 2020Pages 117–126https://doi.org/10.1145/3383583.3398534

Published:01 August 2020Publication History

JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020

Pages 117–126

ABSTRACT

Citation recommendation systems aim to recommend citations for either a complete paper or a small portion of text called a citation context. The process of recommending citations for citation contexts is called local citation recommendation and is the focus of this paper. Firstly, we develop citation recommendation approaches based on embeddings, topic modeling, and information retrieval techniques. We combine, for the first time to the best of our knowledge, the best-performing algorithms into a semi-genetic hybrid recommender system for citation recommendation. We evaluate the single approaches and the hybrid approach offline based on several data sets, such as the Microsoft Academic Graph (MAG) and the MAG in combination with arXiv and ACL. We further conduct a user study for evaluating our approaches online. Our evaluation results show that a hybrid model containing embedding and information retrieval-based components outperforms its individual components and further algorithms by a large margin.

Supplemental Material

3383583.3398534.mp4

mp4

36.8 MB

Download

References

Chandra Bhagavatula, Sergey Feldman, Russell Power, and Waleed Ammar. 2018. Content-Based Citation Recommendation. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'18 ). 238--251.Google ScholarCross Ref
Steven Bird, Robert Dale, Bonnie J. Dorr, Bryan R. Gibson, Mark Thomas Joseph, Min-Yen Kan, Dongwon Lee, Brett Powley, Dragomir R. Radev, and Yee Fan Tan. 2008. The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics. In Proceedings of the International Conference on Language Resources and Evaluation (LREC'08).Google Scholar
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet Allocation. Journal of Machine Learning Research, Vol. 3 (2003), 993--1022.Google ScholarDigital Library
Robin Burke. 2002. Hybrid Recommender Systems: Survey and Experiments. User Modeling and User-Adapted Interaction, Vol. 12, 4 (2002), 331--370.Google ScholarDigital Library
Robin Burke. 2007. Hybrid Web Recommender Systems .Springer Berlin Heidelberg, Berlin, Heidelberg, 377--408.Google Scholar
Xiaoyan Cai, Junwei Han, and Libin Yang. 2018. Generative Adversarial Network Based Heterogeneous Bibliographic Network Representation for Personalized Citation Recommendation. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI-18). 5747--5754.Google ScholarCross Ref
Daniel Duma, Ewan Klein, Maria Liakata, James Ravenscroft, and Amanda Clare. 2016a. Rhetorical Classification of Anchor Text for Citation Recommendation. D-Lib Magazine, Vol. 22, 9/10 (2016).Google ScholarCross Ref
Daniel Duma, Maria Liakata, Amanda Clare, James Ravenscroft, and Ewan Klein. 2016b. Applying Core Scientific Concepts to Context-Based Citation Recommendation. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16). 1737--1742.Google Scholar
Travis Ebesu and Yi Fang. 2017. Neural Citation Network for Context-Aware Citation Recommendation. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '17). 1093--1096.Google ScholarDigital Library
Michael Farber and Adam Jatowt. 2020. Citation Recommendation: Approaches and Datasets. International Journal on Digital Libraries (2020).Google Scholar
Michael F"arber, Alexander Thiemann, and Adam Jatowt. 2018. A High-Quality Gold Standard for Citation-based Tasks. In Proceedings of the International Conference on Language Resources and Evaluation (LREC'18). 1885--1889.Google Scholar
Michael F"a rber, Alexander Thiemann, and Adam Jatowt. 2018. To Cite, or Not to Cite? Detecting Citation Contexts in Text. In Proceedings of the 40th European Conference on Information Retrieval (ECIR'18). 598--603.Google Scholar
Soumyajit Ganguly and Vikram Pudi. 2017. Paper2vec: Combining Graph and Text Information for Scientific Paper Representation. In Proceedings of the 39th European Conference on Information Retrieval (ECIR'17). 383--395.Google ScholarCross Ref
Jialong Han, Yan Song, Wayne Xin Zhao, Shuming Shi, and Haisong Zhang. 2018. hyperdoc2vec: Distributed Representations of Hypertext Documents. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL'18). 2384--2394.Google ScholarCross Ref
Qi He, Daniel Kifer, Jian Pei, Prasenjit Mitra, and C. Lee Giles. 2011. Citation Recommendation Without Author Supervision. In Proceedings of the Fourth ACM International Conference on Web Search and Data Mining (WSDM '11). ACM, New York, NY, USA, 755--764.Google Scholar
Qi He, Jian Pei, Daniel Kifer, Prasenjit Mitra, and Lee Giles. 2010. Context-aware Citation Recommendation. In Proceedings of the 19th International Conference on World Wide Web (WWW '10). 421--430.Google ScholarDigital Library
Bo-Yu Hsiao, Chih-Heng Chung, and Bi-Ru Dai. 2015. A Model of Relevant Common Author and Citation Authority Propagation for Citation Recommendation. In Proceedings of the 16th IEEE International Conference on Mobile Data Management (MDM'15). 117--119.Google ScholarDigital Library
Wenyi Huang, Saurabh Kataria, Cornelia Caragea, Prasenjit Mitra, C. Lee Giles, and Lior Rokach. 2012. Recommending Citations: Translating Papers into References. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM'12). 1910--1914.Google ScholarDigital Library
Wenyi Huang, Zhaohui Wu, Chen Liang, Prasenjit Mitra, and C. Lee Giles. 2015. A Neural Probabilistic Model for Context Based Citation Recommendation. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI'15). AAAI Press, 2404--2410.Google Scholar
Wenyi Huang, Zhaohui Wu, Prasenjit Mitra, and C. Lee Giles. 2014. RefSeer: A citation recommendation system. In Proceedings of the 14th Joint Conference on Digital Libraries (JCDL'14). 371--374.Google Scholar
Zhuoren Jiang. 2013. Citation Recommendation via Time-series Scholarly Topic Analysis and Publication Prior Analysis. TCDL Bulletin, Vol. 9, 2 (2013).Google Scholar
Zhuoren Jiang, Yao Lu, and Xiaozhong Liu. 2018a. Cross-language Citation Recommendation via Publication Content and Citation Representation Fusion. In Proceedings of the 18th ACM/IEEE Joint Conference on Digital Libraries (JCDL'18). 347--348.Google ScholarDigital Library
Zhuoren Jiang, Yue Yin, Liangcai Gao, Yao Lu, and Xiaozhong Liu. 2018b. Cross-language Citation Recommendation via Hierarchical Representation Learning on Heterogeneous Graph. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, SIGIR 2018, Ann Arbor, MI, USA, July 08--12, 2018. 635--644.Google ScholarDigital Library
A.and Mabe M. Johnson, R.and Watkinson. 2018. The STM report, an overview of scientific and scholarly publishing. (2018). www.stm-assoc.org/2018_10_04_STM_Report_2018.pdfGoogle Scholar
Anshul Kanakia, Zhihong Shen, Darrin Eide, and Kuansan Wang. 2019. A Scalable Hybrid Research Paper Recommender System for Microsoft Academic. In Proceedings of the The World Wide Web Conference (WWW'19). 2893--2899.Google ScholarDigital Library
Saurabh Kataria, Prasenjit Mitra, and Sumit Bhatia. 2010. Utilizing Context in Generative Bayesian Models for Linked Corpus. In Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI'10). 1340--1345.Google ScholarDigital Library
Yuta Kobayashi, Masashi Shimbo, and Yuji Matsumoto. 2018. Citation Recommendation Using Distributed Representation of Discourse Facets in Scientific Articles. In Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries (JCDL '18). 243--251.Google ScholarDigital Library
Quoc V. Le and Tomas Mikolov. 2014. Distributed Representations of Sentences and Documents. In Proceedings of the 31th International Conference on Machine Learning, ICML 2014, Beijing, China, 21--26 June 2014. 1188--1196.Google ScholarDigital Library
Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou Sun, and Liangcai Gao. 2014. Full-text based Context-Rich Heterogeneous Network Mining Approach for Citation Recommendation. In Proceedings of the Joint Conference on Digital Libraries (JCDL'14). 361--370.Google ScholarCross Ref
Sean M. McNee, Istvan Albert, Dan Cosley, Prateep Gopalkrishnan, Shyong K. Lam, Al Mamunur Rashid, Joseph A. Konstan, and John Riedl. 2002. On the recommending of citations for research papers. In Proceedings of the ACM 2002 Conference on Computer Supported Cooperative (CSCW'02). 116--125.Google ScholarDigital Library
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Distributed Representations of Words and Phrases and their Compositionality. CoRR, Vol. abs/1310.4546 (2013). arxiv: 1310.4546 http://arxiv.org/abs/1310.4546Google Scholar
Juergen Mueller. 2017. Combining aspects of genetic algorithms with weighted recommender hybridization. In Proceedings of the 19th International Conference on Information Integration and Web-based Applications & Services (iiWAS'17). 13--22.Google ScholarDigital Library
Eric T. Nalisnick, Bhaskar Mitra, Nick Craswell, and Rich Caruana. 2016. Improving Document Ranking with Dual Word Embeddings. In Proceedings of the 25th International Conference on World Wide Web (WWW'16). 83--84.Google ScholarDigital Library
Ramesh Nallapati, Amr Ahmed, Eric P. Xing, and William W. Cohen. 2008. Joint latent topic models for text and citations. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08). 542--550.Google Scholar
Lior Rokach, Prasenjit Mitra, Saurabh Kataria, Wenyi Huang, and Lee Giles. 2013. A Supervised Learning Method for Context-Aware Citation Recommendation in a Large Corpus. In Proceedings of the Large-Scale and Distributed Systems for Information Retrieval Workshop (LSDS-IR'13). 17--22.Google Scholar
Tarek Saier and Michael Farber. 2019. Bibliometric-Enhanced arXiv: A Data Set for Paper-Based and Citation-Based Tasks. In Proceedings of the 8th International Workshop on Bibliometric-enhanced Information Retrieval (BIR'19). 14--26.Google Scholar
Arnab Sinha, Zhihong Shen, Yang Song, Hao Ma, Darrin Eide, Bo-June Paul Hsu, and Kuansan Wang. 2015. An Overview of Microsoft Academic Service (MAS) and Applications. In Proceedings of the 24th International Conference on World Wide Web (WWW'15). 243--246.Google ScholarDigital Library
Trevor Strohman, W. Bruce Croft, and David D. Jensen. 2007. Recommending citations for academic papers. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'07). 705--706.Google Scholar
Xuewei Tang, Xiaojun Wan, and Xun Zhang. 2014. Cross-language Context-aware Citation Recommendation in Scientific Articles. In Proceedings of the 37th International Conference on Research and Development in Information Retrieval (SIGIR '14). 817--826.Google ScholarDigital Library
Libin Yang, Yu Zheng, Xiaoyan Cai, Hang Dai, Dejun Mu, Lantian Guo, and Tao Dai. 2018. A LS™ Based Model for Personalized Context-Aware Citation Recommendation. IEEE Access, Vol. 6 (2018), 59618--59627.Google ScholarCross Ref
Jun Yin and Xiaoming Li. 2017. Personalized Citation Recommendation via Convolutional Neural Networks. In Proceedings of the First International Joint Conference on Web and Big Data (APWeb-WAIM'17). 285--293.Google ScholarCross Ref
Fattane Zarrinkalam and Mohsen Kahani. 2013. SemCiR: A citation recommendation system based on a novel semantic distance measure. Program, Vol. 47, 1 (2013), 92--112.Google ScholarCross Ref
Ye Zhang, Libin Yang, Xiaoyan Cai, and Hang Dai. 2018. A Novel Personalized Citation Recommendation Approach Based on GAN. In Proceedings of the 24th International Symposium on Foundations of Intelligent Systems (ISMIS'18). 268--278.Google ScholarDigital Library

Index Terms

HybridCite: A Hybrid Model for Context-Aware Citation Recommendation

Recommendations

Context-aware citation recommendation
WWW '10: Proceedings of the 19th international conference on World wide web

When you write papers, how many times do you want to make some citations at a place but you are not sure which papers to cite? Do you wish to have a recommendation system which can recommend a small number of good candidates for every place that you ...
Read More
Hybrid Trust-Aware Model for Personalized Top-N Recommendation
CODS '17: Proceedings of the 4th ACM IKDD Conferences on Data Sciences

Due to the large quantity and diversity of content being easily available to users, recommender systems (RS) have become an integral part of nearly every online system. They allow users to resolve the information overload problem by proactively ...
Read More
Using social tags to infer context in hybrid music recommendation
WIDM '12: Proceedings of the twelfth international workshop on Web information and data management

Contextual factors can greatly influence users' decisions in selecting items, such as songs when listening to music. The goal of a context-aware recommender system is to adapt its recommendations not just to the general preferences of users, but also to ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020
August 2020
611 pages
ISBN:9781450375856
DOI:10.1145/3383583
General Chairs:
Ruhua Huang
Wuhan University, China
,
Dan Wu
Wuhan University, China
,
Gary Marchionini
University of North Carolina at Chapel Hill, USA
,
Program Chairs:
Daqing He
University of Pittsburgh, USA
,
Sally Jo Cunningham
University of Waikato, New Zealand
,
Preben Hansen
Stockholm University, Sweden
Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 August 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
digital libraries
machine learning
recommender systems
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate415of1,482submissions,28%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 17
  Total Citations
  View Citations
- 218
  Total Downloads
- Downloads (Last 12 months)30
- Downloads (Last 6 weeks)8
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HybridCite: A Hybrid Model for Context-Aware Citation Recommendation

JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Context-aware citation recommendation

Hybrid Trust-Aware Model for Personalized Top-N Recommendation

Using social tags to infer context in hybrid music recommendation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

HybridCite: A Hybrid Model for Context-Aware Citation Recommendation

JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Context-aware citation recommendation

Hybrid Trust-Aware Model for Personalized Top-N Recommendation

Using social tags to infer context in hybrid music recommendation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media