2015 | OriginalPaper | Buchkapitel
Faster Exact Search Using Document Clustering
verfasst von : Jonathan Dimond, Peter Sanders
Erschienen in: String Processing and Information Retrieval
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
We show how full-text search based on inverted indices can be accelerated by clustering the documents without losing results (SeCluD –
Se
arch with
Clu
stered
D
ocuments). We develop a fast multilevel clustering algorithm that uses query cost of conjunctive queries as an objective function. Depending on the inputs we get up to four times faster than non-clustered search. The resulting clusters are also useful for data compression and for distributing the work over many machines.