2006 | OriginalPaper | Buchkapitel
Creating Synthetic Temporal Document Collections for Web Archive Benchmarking
verfasst von : Kjetil Nørvåg, Albert Overskeid Nybø
Erschienen in: Advances in Web Intelligence and Data Mining
Verlag: Springer Berlin Heidelberg
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
In research in web archives, large temporal document collections are necessary in order to be able to compare and evaluate new strategies and algorithms. Large temporal document collections are not easily available, and an alternative is to create synthetic document collections. In this paper we will describe how to generate synthetic temporal document collections, how this is realized in the
TDocGen
temporal document generator, and we will also present a study of the quality of the document collections created by TDocGen.