Skip to main content

2002 | OriginalPaper | Buchkapitel

The Accessibility Dimension for Structured Document Retrieval

verfasst von : Thomas Roelleke, Mounia Lalmas, Gabriella Kazai, Ian Ruthven, Stefan Quicker

Erschienen in: Advances in Information Retrieval

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Structured document retrieval aims at retrieving the document components that best satisfy a query, instead of merely retrieving pre-defined document units. This paper reports on an investigation of a tf -idf -acc approach, where tf and idf are the classical term frequency and inverse document frequency, and acc, a new parameter called accessibility, that captures the structure of documents. The tf -idf -acc approach is defined using a probabilistic relational algebra. To investigate the retrieval quality and estimate the acc values, we developed a method that automatically constructs diverse test collections of structured documents from a standard test collection, with which experiments were carried out. The analysis of the experiments provides estimates of the acc values.

Metadaten
Titel
The Accessibility Dimension for Structured Document Retrieval
verfasst von
Thomas Roelleke
Mounia Lalmas
Gabriella Kazai
Ian Ruthven
Stefan Quicker
Copyright-Jahr
2002
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/3-540-45886-7_19

Premium Partner