Skip to main content

2002 | OriginalPaper | Buchkapitel

A Novel Web Text Mining Method Using the Discrete Cosine Transform

verfasst von : Laurence A. F. Park, Marimuthu Palaniswami, Kotagiri Ramamohanarao

Erschienen in: Principles of Data Mining and Knowledge Discovery

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Fourier Domain Scoring (FDS) has been shown to give a 60% improvement in precision over the existing vector space methods, but its index requires a large storage space. We propose a new Web text mining method using the discrete cosine transform (DCT) to extract useful information from text documents and to provide improved document ranking, without having to store excessive data. While the new method preserves the performance of the FDS method, it gives a 40% improvement in precision over the established text mining methods when using only 20% of the storage space required by FDS.

Metadaten
Titel
A Novel Web Text Mining Method Using the Discrete Cosine Transform
verfasst von
Laurence A. F. Park
Marimuthu Palaniswami
Kotagiri Ramamohanarao
Copyright-Jahr
2002
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/3-540-45681-3_32

Premium Partner