Skip to main content

1993 | ReviewPaper | Buchkapitel

Efficient similarity search in sequence databases

verfasst von : Rakesh Agrawal, Christos Faloutsos, Arun Swami

Erschienen in: Foundations of Data Organization and Algorithms

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

We propose an indexing method for time sequences for processing similarity queries. We use the Discrete Fourier Transform (DFT) to map time sequences to the frequency domain, the crucial observation being that, for most sequences of practical interest, only the first few frequencies are strong. Another important observation is Parseval's theorem, which specifies that the Fourier transform preserves the Euclidean distance in the time or frequency domain. Having thus mapped sequences to a lower-dimensionality space by using only the first few Fourier coefficients, we use R*-trees to index the sequences and efficiently answer similarity queries. We provide experimental results which show that our method is superior to search based on sequential scanning. Our experiments show that a few coefficients (1–3) are adequate to provide good performance. The performance gain of our method increases with the number and length of sequences.

Metadaten
Titel
Efficient similarity search in sequence databases
verfasst von
Rakesh Agrawal
Christos Faloutsos
Arun Swami
Copyright-Jahr
1993
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/3-540-57301-1_5

Premium Partner