Article

Expert agreement and content based reranking in a meta search environment using Mearf

Authors:
B. Uygar Oztekin

University of Minnesota

University of Minnesota
View Profile

,
George Karypis

University of Minnesota

University of Minnesota
View Profile

,
Vipin Kumar

University of Minnesota

University of Minnesota
View Profile

WWW '02: Proceedings of the 11th international conference on World Wide WebMay 2002Pages 333–344https://doi.org/10.1145/511446.511490

Published:07 May 2002Publication History

WWW '02: Proceedings of the 11th international conference on World Wide Web

Pages 333–344

ABSTRACT

Recent increase in the number of search engines on the Web and the availability of meta search engines that can query multiple search engines makes it important to find effective methods for combining results coming from different sources. In this paper we introduce novel methods for reranking in a meta search environment based on expert agreement and contents of the snippets. We also introduce an objective way of evaluating different methods for ranking search results that is based upon implicit user judgements. We incorporated our methods and two variations of commonly used merging methods in our meta search engine, Mearf, and carried out an experimental study using logs accumulated over a period of twelve months. Our experiments show that the choice of the method used for merging the output produced by different search engines plays a significant role in the overall quality of the search results. In almost all cases examined, results produced by some of the new methods introduced were consistently better than the ones produced by traditional methods commonly used in various meta search engines. These observations suggest that the proposed methods can offer a relatively inexpensive way of improving the meta search experience over existing methods.

References

Brian T. Bartell, Garrison W. Cottrell, and Richard K. Belew. Automatic combination of multiple ranked retrieval systems. In Research and Development in Information Retrieval, pages 173--181, 1994. Google ScholarDigital Library
C4.com. http://www.c4.com/.Google Scholar
J. P. Callan, Z. Lu, and W. Bruce Croft. Searching Distributed Collections with Inference Networks. In Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 21--28, Seattle, Washington, 1995. ACM Press. Google ScholarDigital Library
Daniel Dreilinger and Adele E. Howe. Experiences with selecting search engines using metasearch. ACM Transactions on Information Systems, 15(3):195--222, 1997. Google ScholarDigital Library
James C. French and Allison L. Powell. Metrics for evaluating database selection techniques. In 10th International Workshop on Database and Expert Systems Applications, 1999. Google ScholarDigital Library
James C. French, Allison L. Powell, James P. Callan, Charles L. Viles, Travis Emmitt, Kevin J. Prey, and Yun Mou. Comparing the performance of database selection algorithms. In Research and Development in Information Retrieval, pages 238--245, 1999. Google ScholarDigital Library
E. Glover. Using Extra-Topical User Preferences to Improve Web-Based Metasearch. PhD thesis, 2001. Google ScholarDigital Library
L. Gravano, H. García-Molina, and A. Tomasic. The effectiveness of GIOSS for the text database discovery problem. SIGMOD Record (ACM Special Interest Group on Management of Data), 23(2):126--137, June 1994. Google ScholarDigital Library
Adele E. Howe and Daniel Dreilinger. SAVVYSEARCH: A metasearch engine that learns which search engines to query. AI Magazine, 18(2):19--25, 1997.Google ScholarDigital Library
Inquirus. http://www.inquirus.com/.Google Scholar
Panagiotis Ipeirotis, Luis Gravano, and Mehran Sahami. Automatic classification of text databases through query probing. Technical Report CUCS-004-00, Computer Science Department, Columbia University, March 2000.Google Scholar
Ixquick. http://www.ixquick.com/.Google Scholar
D. D. Lewis. Evaluating and Optimizing Autonomous Text Classification Systems. In E. A. Fox, P. Ingwersen, and R. Fidel, editors, Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 246--254, Seattle, Washington, 1995. ACM Press. Google ScholarDigital Library
Longzhuang Li and Li Shang. Statistical performance evaluation of search engines. In WWW10 conference posters, May 2--5, 2001, Hong Kong.Google Scholar
Mamma. http://www.mamma.com/.Google Scholar
M. Catherine McCabe, Abdur Chowdhury, David A. Grossman, and Ophir Frieder. A unified environment for fusion of information retrieval approaches. In ACM-CIKM Conference for Information and Knowledge Management, pages 330--334, 1999. Google ScholarDigital Library
Metacrawler. http://www.metacrawler.com/.Google Scholar
Profusion. http://www.profusion.com/.Google Scholar
E. Selberg. Towards Comprehensive Web Search. PhD thesis, 1999. Google ScholarDigital Library
E. Selberg and O. Etzioni. Multi-service search and comparison using the MetaCrawler. In Proceedings of the 4th International World-Wide Web Conference, Darmstadt, Germany, December 1995.Google Scholar
E. Selberg and O. Etzioni. The MetaCrawler architecture for resource aggregation on the Web. IEEE Expert, (January--February):11--14, 1997.Google ScholarCross Ref
Joseph A. Shaw and Edward A. Fox. Combination of multiple searches. In Third Text REtrieval Conference, 1994.Google Scholar
Mario Gomez Susan Gauch, Guijun Wang. Profusion: Intelligent fusion from multiple, distributed search engines. Journal of Universal Computer Science, 2(9):637--649, 1996.Google Scholar
Christopher C. Vogt and Garrison W. Cottrell. Fusion via a linear combination of scores. Information Retrieval, 1(3):151--173, 1999. Google ScholarDigital Library
Zonghuan Wu, Weiyi Meng, Clement Yu, and Zhuogang Li. Towards a highly-scalable and effective metasearch engine. In WWW10 Conference, May 2--5, 2001, Hong Kong. ACM, 2001. Google ScholarDigital Library
Clement T. Yu, Weiyi Meng, King-Lup Liu, Wensheng Wu, and Naphtali Rishe. Efficient and effective metasearch for a large number of text databases. In Proceedings of the 1999 ACM CIKM International Conference on Information and Knowledge Management, Kansas City, Missouri, USA, November 2-6, 1999, pages 217--224. ACM, 1999. Google ScholarDigital Library

Recommendations

Reranking search results for sparse queries
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management

It is well known that clickthrough data can be used to improve the effectiveness of search results: broadly speaking, a query's past clicks are a predictor of future clicks on documents. However, when a new or unusual query appears, or when a system is ...
Read More
Optimizing video search reranking via minimum incremental information loss
MIR '08: Proceedings of the 1st ACM international conference on Multimedia information retrieval

This paper is concerned with video search reranking - the task of reordering the initial ranked documents (video shots) to improve the search performance - in an optimization framework. Conventional supervised reranking approaches empirically convert ...
Read More
Rank aggregation model for meta search: an approach using text and rank analysis measures
Intelligent information processing II

One problem domain of meta search is to combine and improve the precision of ranking results from various search systems. This paper describes a rank aggregation model that incorporates text analysis measure with existing rank-based method, e.g. Best ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WWW '02: Proceedings of the 11th international conference on World Wide Web
May 2002
754 pages
ISBN:1581134495
DOI:10.1145/511446
Conference Chairs:
David Lassner
University of Hawaii
,
Dave De Roure
University of Southampton
,
Arun Iyengar
IBM T.J. Watson Research Center
Copyright © 2002 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 May 2002
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
collection fusion
expert agreement
merging
meta search
reranking
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,899of8,196submissions,23%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 19
  Total Citations
  View Citations
- 375
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Expert agreement and content based reranking in a meta search environment using Mearf

WWW '02: Proceedings of the 11th international conference on World Wide Web

ABSTRACT

References

Cited By

Recommendations

Reranking search results for sparse queries

Optimizing video search reranking via minimum incremental information loss

Rank aggregation model for meta search: an approach using text and rank analysis measures

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Expert agreement and content based reranking in a meta search environment using Mearf

WWW '02: Proceedings of the 11th international conference on World Wide Web

ABSTRACT

References

Cited By

Recommendations

Reranking search results for sparse queries

Optimizing video search reranking via minimum incremental information loss

Rank aggregation model for meta search: an approach using text and rank analysis measures

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media