skip to main content
10.1145/2339530.2339730acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Cross-domain collaboration recommendation

Authors Info & Claims
Published:12 August 2012Publication History

ABSTRACT

Interdisciplinary collaborations have generated huge impact to society. However, it is often hard for researchers to establish such cross-domain collaborations. What are the patterns of cross-domain collaborations? How do those collaborations form? Can we predict this type of collaborations?

Cross-domain collaborations exhibit very different patterns compared to traditional collaborations in the same domain: 1) sparse connection: cross-domain collaborations are rare; 2) complementary expertise: cross-domain collaborators often have different expertise and interest; 3) topic skewness: cross-domain collaboration topics are focused on a subset of topics. All these patterns violate fundamental assumptions of traditional recommendation systems.

In this paper, we analyze the cross-domain collaboration data from research publications and confirm the above patterns. We propose the Cross-domain Topic Learning (CTL) model to address these challenges. For handling sparse connections, CTL consolidates the existing cross-domain collaborations through topic layers instead of at author layers, which alleviates the sparseness issue. For handling complementary expertise, CTL models topic distributions from source and target domains separately, as well as the correlation across domains. For handling topic skewness, CTL only models relevant topics to the cross-domain collaboration.

We compare CTL with several baseline approaches on large publication datasets from different domains. CTL outperforms baselines significantly on multiple recommendation metrics. Beyond accurate recommendation performance, CTL is also insensitive to parameter tuning as confirmed in the sensitivity analysis.

Skip Supplemental Material Section

Supplemental Material

311a_w_talk_3.mp4

mp4

384.6 MB

References

  1. R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. ACM Press, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Balabanović and Y. Shoham. Fab: content-based, collaborative recommendation. Commun. ACM, 40:66--72, March 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. K. Balog, L. Azzopardi, and M. de Rijke. Formal models for expert finding in enterprise corpora. In SIGIR'06, pages 43--55, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. JMLR, 3:993--1022, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. W. Buntine and A. Jakulin. Applying discrete pca in data analysis. In UAI'04, pages 59--66, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Chakrabarti and C. Faloutsos. Graph mining: Laws, generators, and algorithms. ACM Comput. Surv., 38(1):2, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. H.-H. Chen, L. Gou, X. Zhang, and C. L. Giles. Collabseer: a search engine for collaboration discovery. In JCDL'11, pages 231--240, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Das, M. Datar, A. Garg, and S. Rajaram. Google news personalization: Scalable online collaborative filtering. In WWW'07, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Deshpande and G. Karypis. Item-based top-n recommendation algorithms. ACM Trans. Inf. Syst., 22(1):143--177, Jan. 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Doucet, N. de Freitas, K. Murphy, and S. Russell. Rao-blackwellised particle filtering for dynamic bayesian networks. In UAI'00, pages 176--183, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Faloutsos, P. Faloutsos, and C. Faloutsos. On power-law relationships of the internet topology. In SIGCOMM'99, pages 251--262, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Granovetter. The strength of weak ties. American Journal of Sociology, 78(6):1360--1380, 1973.Google ScholarGoogle ScholarCross RefCross Ref
  13. T. L. Griffiths and M. Steyvers. Finding scientific topics. In PNAS'04, pages 5228--5235, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  14. G. Heinrich. Parameter estimation for text analysis. Technical report, University of Leipzig, Germany, 2004.Google ScholarGoogle Scholar
  15. T. Hofmann. Probabilistic latent semantic indexing. In SIGIR'99, pages 50--57, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. H. Kautz, B. Selman, and M. Shah. Referral web: Combining social networks and collaborative filtering. Communications of the ACM, 40(3):63--65, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. I. Konstas, V. Stathopoulos, and J. M. Jose. On social networks and collaborative recommendation. In SIGIR'09, pages 195--202, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. Leskovec and C. Faloutsos. Sampling from large graphs. In KDD'06, pages 631--636, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Leskovec, D. Huttenlocher, and J. Kleinberg. Predicting positive and negative links in online social networks. In WWW'10, pages 641--650, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. D. Liben-Nowell and J. M. Kleinberg. The link-prediction problem for social networks. JASIST, 58(7):1019--1031, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. Lichtenwalter, J. T. Lussier, and N. V. Chawla. New perspectives and methods in link prediction. In KDD'10, pages 243--252, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. L. Lovasz. Random walks on graphs: A survey. Combinatorics, 2(1):1?6, 1993.Google ScholarGoogle Scholar
  23. D. Mimno and A. McCallum. Expertise modeling for matching papers with reviewers. In KDD'07, pages 500--509, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. J. Quackenbush. Microarray analysis and tumor classification. New England Journal of Medicine, 354:2463--2472, June 2006.Google ScholarGoogle ScholarCross RefCross Ref
  25. D. Sculley. Combined regression and ranking. In KDD'10, pages 979--988, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Y. Shi, D. Ye, A. Goder, and S. Narayanan. A large scale machine learning system for recommending heterogeneous content in social networks. In SIGIR'11, pages 1337--1338, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. M. Steyvers, P. Smyth, and T. Griffiths. Probabilistic author-topic models for information discovery. In KDD'04, pages 306--315, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. J. Sun, H. Qu, D. Chakrabarti, and C. Faloutsos. Neighborhood formation and anomaly detection in bipartite graphs. In ICDM'05, pages 418--425, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. J. Tang, L. Yao, and D. Chen. Multi-topic based query-oriented summarization. In SDM'09, pages 1147--1158, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  30. J. Tang, J. Zhang, R. Jin, Z. Yang, K. Cai, L. Zhang, and Z. Su. Topic level expertise search over heterogeneous networks. Machine Learning Journal, 82(2):211--237, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. J. Tang, J. Zhang, L. Yao, J. Li, L. Zhang, and Z. Su. Arnetminer: Extraction and mining of academic social networks. In KDD'08, pages 990--998, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. L. Tang and H. Liu. Relational learning via latent social dimensions. In KDD'09, pages 817--826, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. W. Tang, J. Tang, T. Lei, C. Tan, B. Gao, and T. Li. On optimization of expertise matching with various constraints. Neurocomputing, 76(1):71--83, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. C. Wang and D. M. Blei. Collaborative topic modeling for recommending scientific articles. In KDD'11, pages 448--456, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Q. Yuan, L. Chen, and S. Zhao. Factorization vs. regularization: fusing heterogeneous social relationships in top-n recommendation. In RecSys'11, pages 245--252, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. J. Zhang, J. Tang, and J. Li. Expert finding in a social network. In DASFAA'07, pages 1066--1069, 2007.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Cross-domain collaboration recommendation

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        KDD '12: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
        August 2012
        1616 pages
        ISBN:9781450314626
        DOI:10.1145/2339530

        Copyright © 2012 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 12 August 2012

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate1,133of8,635submissions,13%

        Upcoming Conference

        KDD '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader