skip to main content
10.1145/1835449.1835577acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
poster

Closed form solution of similarity algorithms

Authors Info & Claims
Published:19 July 2010Publication History

ABSTRACT

Algorithms defining similarities between objects of an information network are important of many IR tasks. SimRank algorithm and its variations are popularly used in many applications. Many fast algorithms are also developed. In this note, we first reformulate them as random walks on the network and express them using forward and backward transition probably in a matrix form. Second, we show that P-Rank (SimRank is only the special case of P-Rank) has a unique solution of eeT when decay factor c is equal to 1. We also show that SimFusion algorithm is a special case of P-Rank algorithm and prove that the similarity matrix of SimFusion is the product of PageRank vector. Our experiments on the web datasets show that for P-Rank the decay factor c doesn't seriously affect the similarity accuracy and accuracy of P-Rank is also higher than SimFusion and SimRank.

References

  1. Glen, J. and Widom, J., 2002, SimRank: a measure of structural-context similarity., SIGKDD, 538--543. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Zhao, P.X., Han, J.W. and Sun, Y.Z., 2009, P-Rank: a comprehensive structural similarity measure over information networks., CIKM, 553--562. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Xi, W.S., Fox, E.A. and Fan, W.G., Zhang, B.Y., Chen, Z., Yan, J., Zhuang, D., 2005, SimFusion: measuring similarity using unified relationship matrix, SIGIR, 130--137. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. CMU Four University Dataset, http://www.cs.cmu.edu/afs/cs/project/theo-20/www/data/Google ScholarGoogle Scholar
  5. Jvelin K. and Kenen J., 2002, Cumulated gain-based evaluation of IR techniques, ACM Transactions on Information Systems, 422 -- 446. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Closed form solution of similarity algorithms

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
      July 2010
      944 pages
      ISBN:9781450301534
      DOI:10.1145/1835449

      Copyright © 2010 Copyright is held by the owner/author(s)

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 19 July 2010

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      SIGIR '10 Paper Acceptance Rate87of520submissions,17%Overall Acceptance Rate792of3,983submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader