Skip to main content
Erschienen in: International Journal on Digital Libraries 2/2015

01.06.2015

A comprehensive evaluation of scholarly paper recommendation using potential citation papers

verfasst von: Kazunari Sugiyama, Min-Yen Kan

Erschienen in: International Journal on Digital Libraries | Ausgabe 2/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

To help generate relevant suggestions for researchers, recommendation systems have started to leverage the latent interests in the publication profiles of the researchers themselves. While using such a publication citation network has been shown to enhance performance, the network is often sparse, making recommendation difficult. To alleviate this sparsity, in our former work, we identified “potential citation papers” through the use of collaborative filtering. Also, as different logical sections of a paper have different significance, as a secondary contribution, we investigated which sections of papers can be leveraged to represent papers effectively. While this initial approach works well for researchers vested in a single discipline, it generates poor predictions for scientists who work on several different topics in the discipline (hereafter, “intra-disciplinary”). We thus extend our previous work in this paper by proposing an adaptive neighbor selection method to overcome this problem in our imputation-based collaborative filtering framework. On a publicly-available scholarly paper recommendation dataset, we show that recommendation accuracy significantly outperforms state-of-the-art recommendation baselines as measured by nDCG and MRR, when using our adaptive neighbor selection method. While recommendation performance is enhanced for all researchers, improvements are more marked for intra-disciplinary researchers, showing that our method does address the targeted audience.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Algarni, A., Li, Y., Xu, Y.: Selected new training documents to update user profile. In: Proceedings of the 19th International Conference on Information and Knowledge Management (CIKM’10), pp. 799–808 (2010) Algarni, A., Li, Y., Xu, Y.: Selected new training documents to update user profile. In: Proceedings of the 19th International Conference on Information and Knowledge Management (CIKM’10), pp. 799–808 (2010)
2.
Zurück zum Zitat Bethard, S., Jurafsky, D.: Who should I cite? Learning literature search models from citation behavior. In: Proceedings of the 19th International Conference on Information and Knowledge Management (CIKM’10), pp. 609–618 (2010) Bethard, S., Jurafsky, D.: Who should I cite? Learning literature search models from citation behavior. In: Proceedings of the 19th International Conference on Information and Knowledge Management (CIKM’10), pp. 609–618 (2010)
3.
Zurück zum Zitat Caragea, C., Silvescu, A., Mitra, P., Giles, C.L.: Can’t see the forest for the trees? A citation recommendation system. In: Proceedings of the 10th ACM/IEEE Joint Conference on Digital Libraries (JCDL ’13), pp. 111–114 (2013) Caragea, C., Silvescu, A., Mitra, P., Giles, C.L.: Can’t see the forest for the trees? A citation recommendation system. In: Proceedings of the 10th ACM/IEEE Joint Conference on Digital Libraries (JCDL ’13), pp. 111–114 (2013)
4.
Zurück zum Zitat El-Arini, K., Guestrin, C.: Beyond keyword search: discovering relevant scientific literature. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’11), pp. 439–447 (2011) El-Arini, K., Guestrin, C.: Beyond keyword search: discovering relevant scientific literature. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’11), pp. 439–447 (2011)
5.
Zurück zum Zitat Gori, M., Pucci, A.: Research paper recommender systems: a random-walk based approach. In: Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006), pp. 778–781 (2006) Gori, M., Pucci, A.: Research paper recommender systems: a random-walk based approach. In: Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006), pp. 778–781 (2006)
6.
Zurück zum Zitat He, Q., Kifer, D., Pei, J., Mitra, P., Giles, C.L.: Citation recommendation without author supervision. In: Proceedings of the 4th International Conference on Web Search and Data Mining (WSDM’11), pp. 15–24 (2011) He, Q., Kifer, D., Pei, J., Mitra, P., Giles, C.L.: Citation recommendation without author supervision. In: Proceedings of the 4th International Conference on Web Search and Data Mining (WSDM’11), pp. 15–24 (2011)
7.
Zurück zum Zitat He, Q., Pei, J., Kifer, D., Mitra, P., Giles, C.L.: Context-aware citation recommendation. In: Proceedings of the 19th International World Wide Web Conference (WWW2010), pp. 421–430 (2010) He, Q., Pei, J., Kifer, D., Mitra, P., Giles, C.L.: Context-aware citation recommendation. In: Proceedings of the 19th International World Wide Web Conference (WWW2010), pp. 421–430 (2010)
8.
Zurück zum Zitat Herlocker, J., Konstan, J., Borchers, A., Riedl, J.: An algorithmic framework for performing collaborative filtering. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’99), pp. 230–237 (1999) Herlocker, J., Konstan, J., Borchers, A., Riedl, J.: An algorithmic framework for performing collaborative filtering. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’99), pp. 230–237 (1999)
9.
Zurück zum Zitat Huang, W., Kataria, S., Karagea, C., Mitra, P., Giles, C.L., Rokach, L.: Recommending citations: translating papers into references. In: Proceedings of the 21st International Conference on Information and Knowledge Management (CIKM’12), pp. 1910–1914 (2012) Huang, W., Kataria, S., Karagea, C., Mitra, P., Giles, C.L., Rokach, L.: Recommending citations: translating papers into references. In: Proceedings of the 21st International Conference on Information and Knowledge Management (CIKM’12), pp. 1910–1914 (2012)
10.
Zurück zum Zitat Järvelin, K., Kekäläinen, J.: IR evaluation methods for retrieving highly relevant documents. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2000), pp. 41–48 (2000) Järvelin, K., Kekäläinen, J.: IR evaluation methods for retrieving highly relevant documents. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2000), pp. 41–48 (2000)
11.
Zurück zum Zitat Jarvis, R.A., Patrick, E.A.: Clustering using a similarity measure based on shared near neighbors. IEEE Trans. Comput. C22(11), 1025–1034 (1973)CrossRef Jarvis, R.A., Patrick, E.A.: Clustering using a similarity measure based on shared near neighbors. IEEE Trans. Comput. C22(11), 1025–1034 (1973)CrossRef
12.
Zurück zum Zitat Kaptein, R., Serdyukov, P., Kamps, J.: Linking wikipedia to the web. In: Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’10), pp. 839–840 (2010) Kaptein, R., Serdyukov, P., Kamps, J.: Linking wikipedia to the web. In: Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’10), pp. 839–840 (2010)
13.
Zurück zum Zitat Katz, L.: A new status index derived from sociometric analysis. Psychometrika 18(1), 39–43 (1953)CrossRefMATH Katz, L.: A new status index derived from sociometric analysis. Psychometrika 18(1), 39–43 (1953)CrossRefMATH
14.
Zurück zum Zitat Lu, Y., He, J., Shan, D., Yan, H.: Recommending citations with translation model. In: Proceedings of the 20th International Conference on Information and Knowledge Management (CIKM’11), pp. 2017–2020 (2011) Lu, Y., He, J., Shan, D., Yan, H.: Recommending citations with translation model. In: Proceedings of the 20th International Conference on Information and Knowledge Management (CIKM’11), pp. 2017–2020 (2011)
15.
Zurück zum Zitat McNee, S.M., Albert, I., Cosley, D., P. Gopalkrishnan, S.L., Rashid, A.M., Konstan, J.S., Riedl, J.: On the recommending of citations for research papers. In: Proceedings of the 2002 ACM Conference on Computer Supported Cooperative Work (CSCW ’02), pp. 116–125 (2002) McNee, S.M., Albert, I., Cosley, D., P. Gopalkrishnan, S.L., Rashid, A.M., Konstan, J.S., Riedl, J.: On the recommending of citations for research papers. In: Proceedings of the 2002 ACM Conference on Computer Supported Cooperative Work (CSCW ’02), pp. 116–125 (2002)
16.
Zurück zum Zitat Mei, Q., Zhai, C.: Generating impact-based summaries for scientific literature. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies (ACL-08: HLT), pp. 816–824 (2008) Mei, Q., Zhai, C.: Generating impact-based summaries for scientific literature. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies (ACL-08: HLT), pp. 816–824 (2008)
17.
Zurück zum Zitat Milne, D., Witten, I.H.: Learning to link Wikipedia. In: Proceedings of the 17th International Conference on Information and Knowledge Management (CIKM’08), pp. 509–518 (2008) Milne, D., Witten, I.H.: Learning to link Wikipedia. In: Proceedings of the 17th International Conference on Information and Knowledge Management (CIKM’08), pp. 509–518 (2008)
18.
Zurück zum Zitat Nascimento, C., Laender, A.H.F., da Silva, A.S., Gonçalves, M.A.: A source independent framework for research paper recommendation. In: Proceedings of the 11th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2011), pp. 297–306 (2011) Nascimento, C., Laender, A.H.F., da Silva, A.S., Gonçalves, M.A.: A source independent framework for research paper recommendation. In: Proceedings of the 11th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2011), pp. 297–306 (2011)
19.
Zurück zum Zitat Nomoto, T.: Two-tier similarity model for story link detection. In: Proceedings of the 19th International Conference on Information and Knowledge Management (CIKM’10), pp. 789–798 (2010) Nomoto, T.: Two-tier similarity model for story link detection. In: Proceedings of the 19th International Conference on Information and Knowledge Management (CIKM’10), pp. 789–798 (2010)
20.
Zurück zum Zitat Oh, S., Lei, Z., Lee, W.C., Mitra, P., Yen, J.: CV-PCR: a context-guided value-driven framework for patent citation recommendation. In: Proceedings of the 22nd International Conference on Information and Knowledge Management (CIKM’13), pp. 2291–2296 (2013) Oh, S., Lei, Z., Lee, W.C., Mitra, P., Yen, J.: CV-PCR: a context-guided value-driven framework for patent citation recommendation. In: Proceedings of the 22nd International Conference on Information and Knowledge Management (CIKM’13), pp. 2291–2296 (2013)
21.
Zurück zum Zitat Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. In: Technical Report, SIDL-WP-1999-0120, Stanford Digital Library Technologies Project (1998) Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. In: Technical Report, SIDL-WP-1999-0120, Stanford Digital Library Technologies Project (1998)
22.
Zurück zum Zitat Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)CrossRef Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)CrossRef
23.
Zurück zum Zitat Qazvinian, V., Radev, D.R.: Scientific paper summarization using citation summary networks. In: Proceedings of the 22nd International Conference on Computational Linguistics (Coling’08), pp. 689–696 (2008) Qazvinian, V., Radev, D.R.: Scientific paper summarization using citation summary networks. In: Proceedings of the 22nd International Conference on Computational Linguistics (Coling’08), pp. 689–696 (2008)
24.
Zurück zum Zitat Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, London (1983) Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, London (1983)
25.
Zurück zum Zitat Sugiyama, K., Kan, M.Y.: Scholarly paper recommendation via user’s recent research interests. In: Proceedings of the 10th ACM/IEEE Joint Conference on Digital Libraries (JCDL ’10), pp. 29–38 (2010) Sugiyama, K., Kan, M.Y.: Scholarly paper recommendation via user’s recent research interests. In: Proceedings of the 10th ACM/IEEE Joint Conference on Digital Libraries (JCDL ’10), pp. 29–38 (2010)
26.
Zurück zum Zitat Sugiyama, K., Kan, M.Y.: Serendipitous recommendation for scholarly papers considering relations among researchers. In: Proceedings of the 11th ACM/IEEE Joint Conference on Digital Libraries (JCDL ’11), pp. 307–310 (2011) Sugiyama, K., Kan, M.Y.: Serendipitous recommendation for scholarly papers considering relations among researchers. In: Proceedings of the 11th ACM/IEEE Joint Conference on Digital Libraries (JCDL ’11), pp. 307–310 (2011)
27.
Zurück zum Zitat Sugiyama, K., Kan, M.Y.: Exploiting potential citation papers in scholarly paper recommendation. In: Proceedings of the 10th ACM/IEEE Joint Conference on Digital Libraries (JCDL ’13), pp. 153–162 (2013) Sugiyama, K., Kan, M.Y.: Exploiting potential citation papers in scholarly paper recommendation. In: Proceedings of the 10th ACM/IEEE Joint Conference on Digital Libraries (JCDL ’13), pp. 153–162 (2013)
28.
Zurück zum Zitat Strohman, T., Croft, W. B., Jensen, D.: Recommending citations for academic papers. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2007), pp. 705–706 (2007) Strohman, T., Croft, W. B., Jensen, D.: Recommending citations for academic papers. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2007), pp. 705–706 (2007)
29.
Zurück zum Zitat Torres, R., McNee, S.M., Abel, M., Konstan, J.A., Riedl, J.: Enhancing digital libraries with TechLens. In: Proceedings of the 4th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2004), pp. 228–236 (2004) Torres, R., McNee, S.M., Abel, M., Konstan, J.A., Riedl, J.: Enhancing digital libraries with TechLens. In: Proceedings of the 4th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2004), pp. 228–236 (2004)
30.
Zurück zum Zitat Voorhees, E.M.: The TREC-8 question answering track report. In: Proceedings of the 8th Text REtrieval Conference (TREC-8), pp. 77–82 (1999) Voorhees, E.M.: The TREC-8 question answering track report. In: Proceedings of the 8th Text REtrieval Conference (TREC-8), pp. 77–82 (1999)
31.
Zurück zum Zitat Wang, C., Blei, D.M.: Collaborative topic modeling for recommending scientific articles. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’11), pp. 448–456 (2011) Wang, C., Blei, D.M.: Collaborative topic modeling for recommending scientific articles. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’11), pp. 448–456 (2011)
32.
Zurück zum Zitat West, R., Precup, D., Pineau, J.: Completing Wikipedia’s hyperlink structure through dimensionality reduction. In: Proceedings of the 18th International Conference on Information and Knowledge Management (CIKM’09), pp. 1097–1106 (2009) West, R., Precup, D., Pineau, J.: Completing Wikipedia’s hyperlink structure through dimensionality reduction. In: Proceedings of the 18th International Conference on Information and Knowledge Management (CIKM’09), pp. 1097–1106 (2009)
33.
Zurück zum Zitat West, R., Precup, D., Pineau, J.: Automatically suggesting topics for augmenting text documents. In: Proceedings of the 19th International Conference on Information and Knowledge Management (CIKM’10), pp. 929–938 (2010) West, R., Precup, D., Pineau, J.: Automatically suggesting topics for augmenting text documents. In: Proceedings of the 19th International Conference on Information and Knowledge Management (CIKM’10), pp. 929–938 (2010)
34.
Zurück zum Zitat Yang, D., Wei, B., Wu, J., Zhang, Y., Zhang, L.: CARES: A ranking-oriented CADAL recommender system. In: Proceedings of the 9th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2009), pp. 203–211 (2009) Yang, D., Wei, B., Wu, J., Zhang, Y., Zhang, L.: CARES: A ranking-oriented CADAL recommender system. In: Proceedings of the 9th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2009), pp. 203–211 (2009)
Metadaten
Titel
A comprehensive evaluation of scholarly paper recommendation using potential citation papers
verfasst von
Kazunari Sugiyama
Min-Yen Kan
Publikationsdatum
01.06.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Digital Libraries / Ausgabe 2/2015
Print ISSN: 1432-5012
Elektronische ISSN: 1432-1300
DOI
https://doi.org/10.1007/s00799-014-0122-2

Weitere Artikel der Ausgabe 2/2015

International Journal on Digital Libraries 2/2015 Zur Ausgabe

Premium Partner