skip to main content
10.1145/1935826.1935926acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
poster

Citation recommendation without author supervision

Authors Info & Claims
Published:09 February 2011Publication History

ABSTRACT

Automatic recommendation of citations for a manuscript is highly valuable for scholarly activities since it can substantially improve the efficiency and quality of literature search. The prior techniques placed a considerable burden on users, who were required to provide a representative bibliography or to mark passages where citations are needed. In this paper we present a system that considerably reduces this burden: a user simply inputs a query manuscript (without a bibliography) and our system automatically finds locations where citations are needed. We show that naïve approaches do not work well due to massive noise in the document corpus. We produce a successful approach by carefully examining the relevance between segments in a query manuscript and the representative segments extracted from a document corpus. An extensive empirical evaluation using the CiteSeerX data set shows that our approach is effective.

References

  1. S. Aya, C. Lagoze, and T. Joachims. Citation classification and its applications. ICKM, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  2. C. Basu, H. Hirsh, W. Cohen, and C. Nevill-Manning. Technical paper recommendation: A study in combining multiple information sources. J. of Artificial Intelligence Research, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. J. Machine Learning Research, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Broder, M. Fontoura, V. Josifovski, and L. Riedel. A semantic approach to contextual advertising. SIGIR, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. K. Chandrasekaran, S. Gauch, P. Lakkaraju, and H. Luong. Concept-Based Document Recommendations for CiteSeer Authors. Adaptive Hypermedia and Adaptive Web-Based Systems, Springer, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Cohn and T. Hofmann. The missing link -- a probabilistic model of document content and hypertext connectivity. NIPS, 2001.Google ScholarGoogle Scholar
  7. R. Durrett. Probability: Theory and Examples. Duxbury Press, 2nd edition, 1995.Google ScholarGoogle Scholar
  8. E. Erosheva, S. Fienberg, and J. Lafferty. Mixed membership models of scientific publications. PNAS, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  9. Q. He, J. Pei, D. Kifer, P. Mitra, and L. Giles. Context-aware citation recommendation. WWW, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Huang, G. Xue, B. Zhang, Z. Chen, Y. Yu, and W. Ma. Tssp: A reinforcement algorithm to find related papers. WI, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. Kataria, P. Mitra, and S. Bhatia. Utilizing context in generative bayesian models for linked corpus. AAAI, 2010.Google ScholarGoogle Scholar
  12. S. M. Katz. Estimation of probabilities from sparse data for the language model component of a speech recogniser. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1987.Google ScholarGoogle Scholar
  13. J. Kleinberg. Bursty and hierarchical structure in streams. SIGKDD, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. Liben-Nowell and J. Kleinberg. The link prediction problem for social networks. CIKM, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. McNee, I. Albert, D. Cosley, P. Gopalkrishnan, S. Lam, A. Rashid, J. Konstan, and J. Riedl. On the recommending of citations for research papers. CSCW, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. Nallapati, A. Ahmed, E. Xing, and W. Cohen. Joint latent topic models for text and citations. SIGKDD, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. M. Ponte and W. B. Croft. A language modeling approach to information retrieval. Research and Development in Information Retrieval, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. R. Quinlan. C4.5: programs for machine learning. Morgan Kaufmann, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. B. Ribeiro-Neto, M. Cristo, P. B. Golgher, and E. S. de Moura. Impedance coupling in contenttargeted advertising. SIGIR, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. C. Rijsbergen. The Geometry of Information Retrieval. Cambridge University Press, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. A. Ritchie. Citation context analysis for information retrieval. PhD thesis, University of Cambridge, 2008.Google ScholarGoogle Scholar
  22. B. Shaparenko and T. Joachims. Identifying the original contribution of a document via language modeling. ECML, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. D. Simon. Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches. Wiley-Interscience, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. T. Strohman, B. Croft, and D. Jensen. Recommending citations for academic papers. SIGIR, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. J. Tang and J. Zhang. A discriminative approach to topic-based citation recommendations. PAKDD, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. R. Torres, S. McNee, M. Abel, J. Konstan, and J. Riedl. Enhancing digitial libraries with techlens. JCDL, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. V. von Brzeski, U. Irmak, and R. Kraft. Leveraging context in user-centric entity detection systems. CIKM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. F. Wang, B. Chen, and Z. Miao. A survey on reviewer assignment problem. IEA/AIE, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. W. Yih, J. Goodman, and V. R. Carvalho. Finding advertising keywords on web pages. WWW, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Y. Zhao and G. Karypis. Hierarchical clustering algorithms for document datasets. Data Mining and Knowledge Discovery, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. D. Zhou, S. Zhu, K. Yu, X. Song, B. Tseng, H. Zha, and L. Giles. Learning multiple graphs for document recommendations. WWW, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Citation recommendation without author supervision

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      WSDM '11: Proceedings of the fourth ACM international conference on Web search and data mining
      February 2011
      870 pages
      ISBN:9781450304931
      DOI:10.1145/1935826

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 9 February 2011

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      WSDM '11 Paper Acceptance Rate83of372submissions,22%Overall Acceptance Rate498of2,863submissions,17%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader