skip to main content
10.1145/1060745.1060828acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
Article

Object-level ranking: bringing order to Web objects

Published:10 May 2005Publication History

ABSTRACT

In contrast with the current Web search methods that essentially do document-level ranking and retrieval, we are exploring a new paradigm to enable Web search at the object level. We collect Web information for objects relevant for a specific application domain and rank these objects in terms of their relevance and popularity to answer user queries. Traditional PageRank model is no longer valid for object popularity calculation because of the existence of heterogeneous relationships between objects. This paper introduces PopRank, a domain-independent object-level link analysis model to rank the objects within a specific domain. Specifically we assign a popularity propagation factor to each type of object relationship, study how different popularity propagation factors for these heterogeneous relationships could affect the popularity ranking, and propose efficient approaches to automatically decide these factors. Our experiments are done using 1 million CS papers, and the experimental results show that PopRank can achieve significantly better ranking results than naively applying PageRank on the object graph.

References

  1. Citeseer. Scientific Literature Digital Library. http://citeseer.ist.psu.edu.Google ScholarGoogle Scholar
  2. N. Ashish and C. Knoblock. Wrapper generation for semi-structured internet sources. In Proc. Workshop on Management of Semistructured Data, Tucson, 1997.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Andrey Balmin, Vagelis Hristidis, and Yannis Papakonstantinou. Authority-based keyword queries in databases using objectrank. In Very Large Data Bases (VLDB), 2004.Google ScholarGoogle Scholar
  4. Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems, 30(1--7):107--117, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Deng Cai, Xiaofei He, Ji-Rong Wen, and Wei-Ying Ma. Block-level link analysis. In ACM SIGIR Conference (SIGIR), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Junghoo Cho and Sourashis Roy. Impact of web search engines on page popularity. In In Proceedings of the World-Wide Web Conference (WWW), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. L. Guo, F. Shao, C. Botev, and J. Shanmugasundaram. Xrank: Ranked keyword search over xml documents. In ACM SIGMOD, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Bin He, Kevin Chen chuan Chang, and Jiawei Han. Discovering complex matchings across web query interfaces: a correlation mining approach. In Knowledge Discovery and Data Mining (KDD), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Kirkpatrick, C. D. Gelatt, and M. P. Vecchi. Optimization by simulated annealing. Science, 220(4598), 1983.Google ScholarGoogle Scholar
  10. J. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5), 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Nickolas Kushmerick, Daniel S. Weld, and Robert B. Doorenbos. Wrapper induction for information extraction. In Intl. Joint Conference on Artificial Intelligence (IJCAI), pages 729--737, 1997.Google ScholarGoogle Scholar
  12. Bing Liu, Robert Grossman, and Yanhong Zhai. Mining data records in web pages. In ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Libraries, 1998.Google ScholarGoogle Scholar
  14. Ruihua Song, Haifeng Liu, Ji-Rong Wen, and Wei-Ying Ma. Learning block importance models for web pages. In World Wide Web conference (WWW), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Sheila Tejada, Craig A. Knoblock, and Steven Minton. Learning domain-independent string transformation weights for high accuracy object identification. In Knowledge Discovery and Data Mining (KDD), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Jiying Wang, Ji-Rong Wen, Frederick H. Lochovsky, and Wei-Ying Ma. Instance-based schema matching for web databases by domain-specific query probing. In Very Large Data Bases (VLDB), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Wensi Xi, Benyu Zhang, Yizhou Lu, Zheng Chen, Shuicheng Yan, Huajun Zeng, Wei-Ying Ma, and Edward A. Fox. Link fusion: A unified link analysis framework for multi-type interrelated data objects. In WWW, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Object-level ranking: bringing order to Web objects

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            WWW '05: Proceedings of the 14th international conference on World Wide Web
            May 2005
            781 pages
            ISBN:1595930469
            DOI:10.1145/1060745

            Copyright © 2005 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 10 May 2005

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Acceptance Rates

            Overall Acceptance Rate1,899of8,196submissions,23%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader