skip to main content
10.1145/383952acmconferencesBook PagePublication PagesirConference Proceedingsconference-collections
SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
ACM2001 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
SIGIR01: 24th ACM/SIGIR International Conference on Research and Development in Information Retrieval New Orleans Louisiana USA
ISBN:
978-1-58113-331-8
Published:
01 September 2001
Sponsors:

Bibliometrics
Abstract

No abstract available.

Article
Applying summarization techniques for term selection in relevance feedback

Query-expansion is an effective Relevance Feedback technique for improving performance in Information Retrieval. In general query-expansion methods select terms from the complete contents of relevant documents. One problem with this approach is that ...

Article
Temporal summaries of new topics

We discuss technology to help a person monitor changes in news coverage over time. We define temporal summaries of news stories as extracting a single sentence from each event within a news topic, where the stories are presented one at a time and ...

Article
Generic text summarization using relevance measure and latent semantic analysis

In this paper, we propose two generic text summarization methods that create text summaries by ranking and extracting sentences from the original documents. The first method uses standard IR methods to rank sentence relevances, while the second method ...

Article
A new approach to unsupervised text summarization

The paper presents a novel approach to unsupervised text summarization. The novelty lies in exploiting the diversity of concepts in text for summarization, which has not received much attention in the summarization literature. A diversity-based approach ...

Article
Vector-space ranking with effective early termination

Considerable research effort has been invested in improving the effectiveness of information retrieval systems. Techniques such as relevance feedback, thesaural expansion, and pivoting all provide better quality responses to queries when tested in ...

Article
Static index pruning for information retrieval systems

We introduce static index pruning methods that significantly reduce the index size in information retrieval systems.We investigate uniform and term-based methods that each remove selected entries from the index and yet have only a minor effect on ...

Article
Using event segmentation to improve indexing of consumer photographs

Automatic albuming --- the automatic organization of photographs, either as an end in itself or for use in other applications -- is an application that promises to be of great assistance to photographers. Relatively sophisticated image content analysis ...

Article
Ranking retrieval systems without relevance judgments

The most prevalent experimental methodology for comparing the effectiveness of information retrieval systems requires a test collection, composed of a set of documents, a set of query topics, and a set of relevance judgments indicating which documents ...

Article
Evaluation by highly relevant documents

Given the size of the web, the search engine industry has argued that engines should be evaluated by their ability to retrieve highly relevant pages rather than all possible relevant pages. To explore the role highly relevant documents play in retrieval ...

Article
Meta-scoring: automatically evaluating term weighting schemes in IR without precision-recall

In this paper, we present a method that can automatically evaluate performance of different term weighting schemes in information retrieval without resorting to precision-recall based on human relevance judgments. Specifically, the problem is: given two ...

Article
Improving cross language retrieval with triangulated translation

Most approaches to cross language information retrieval assume that resources providing a direct translation between the query and document languages exist. This paper presents research examining the situation where such an assumption is false. Here, ...

Article
Improving query translation for cross-language information retrieval using statistical models

Dictionaries have often been used for query translation in cross-language information retrieval (CLIR). However, we are faced with the problem of translation ambiguity, i.e. multiple translations are stored in a dictionary for a word. In addition, a ...

Article
Evaluating a probabilistic model for cross-lingual information retrieval

This work proposes and evaluates a probabilistic cross-lingual retrieval system. The system uses a generative model to estimate the probability that a document in one language is relevant, given a query in another language. An important component of ...

Article
Document language models, query models, and risk minimization for information retrieval

We present a framework for information retrieval that combines document models and query models using a probabilistic ranking function based on Bayesian decision theory. The framework suggests an operational retrieval model that extends recent ...

Article
Relevance based language models

We explore the relation between classical probabilistic models of information retrieval and the emerging language modeling approaches. It has long been recognized that the primary obstacle to effective performance of classical models is the need to ...

Article
A statistical learning learning model of text classification for support vector machines

This paper develops a theoretical learning model of text classification for Support Vector Machines (SVMs). It connects the statistical properties of text-classification tasks with the generalization performance of a SVM in a quantitative way. Unlike ...

Article
A study of thresholding strategies for text categorization

Thresholding strategies in automated text categorization are an underexplored area of research. This paper presents an examination of the effect of thresholding strategies on the performance of a classifier under various conditions. Using k-Nearest ...

Article
On feature distributional clustering for text categorization

We describe a text categorization approach that is based on a combination of feature distributional clusters with a support vector machine (SVM) classifier. Our feature selection approach employs distributional clustering of words via the recently ...

Article
Iterative residual rescaling

We consider the problem of creating document representations in which inter-document similarity measurements correspond to semantic similarity. We first present a novelsubspace-basedframework for formalizing this task. Using this framework, we derive a ...

Article
Expressive retrieval from XML documents

The emergence of XML as a standard interchange format for structured documents/data has given rise to many XML query language proposals. However, some of these languages do not support information retrieval-style ranked queries based on textual ...

Article
XIRQL: a query language for information retrieval in XML documents

Based on the document-centric view of XML, we present the query language XIRQL. Current proposals for XML query languages lack most IR-related features, which are weighting and ranking, relevance-oriented search, datatypes with vague predicates, and ...

Article
Empirical investigations on query modification using abductive explanations

In this paper we report on a series of experiments designed to investigate query modification techniques motivated by the area of abductive reasoning. In particular we use the notion of abductive explanation, explanations being a description of data ...

Article
Generic summaries for indexing in information retrieval

This paper examines the use of generic summaries for indexing in information retrieval. Our main observations are that: (1) With or without pseudo-relevance feedback, a summary index may be as effective as the corresponding fulltext index forprecision-...

Article
Automatic generation of concise summaries of spoken dialogues in unrestricted domains

Automatic summarization of open domain spoken dialogues is a new research area. This paper introduces the task, the challenges involved, and presents an approach to obtain automatic extract summaries for multi-party dialogues of four different genres, ...

Article
Enhanced topic distillation using text, markup tags, and hyperlinks

Topic distillation is the analysis of hyperlink graph structure to identify mutually reinforcing authorities (popular pages) and hubs (comprehensive lists of links to authorities). Topic distillation is becoming common in Web search engines, but the ...

Article
Transparent Queries: investigation users' mental models of search engines

Typically, commercial Web search engines provide very little feedback to the user concerning how a particular query is processed and interpreted. Specifically, they apply key query transformations without the users knowledge. Although these ...

Article
Why batch and user evaluations do not give the same results

Much system-oriented evaluation of information retrieval systems has used the Cranfield approach based upon queries run against test collections in a batch mode. Some researchers have questioned whether this approach can be applied to the real world, ...

Article
Evaluating a content based image retrieval system

Content Based Image Retrieval (CBIR) presents special challenges in terms of how image data is indexed, accessed, and how end systems are evaluated. This paper discusses the design of a CBIR system that uses global colour as the primary indexing key, ...

Article
Evaluating topic-driven web crawlers

Due to limited bandwidth, storage, and computational resources, and to the dynamic nature of the Web, search engines cannot index every Web page, and even the covered portion of the Web cannot be monitored continuously for changes. Therefore it is ...

Contributors
  • Colorado Technical University
  • University of Massachusetts Amherst
  • Robert Gordon University
  • University of Melbourne

Index Terms

  1. Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval

      Recommendations

      Acceptance Rates

      SIGIR '01 Paper Acceptance Rate47of201submissions,23%Overall Acceptance Rate792of3,983submissions,20%
      YearSubmittedAcceptedRate
      SIGIR'194268420%
      SIGIR '184098621%
      SIGIR '173627822%
      SIGIR '163416218%
      SIGIR '153517020%
      SIGIR '143878221%
      SIGIR '133667320%
      SIGIR '105208717%
      SIGIR '032664617%
      SIGIR '022194420%
      SIGIR '012014723%
      SIGIR '991353324%
      Overall3,98379220%