2012 | OriginalPaper | Buchkapitel
Improved Counter Based Algorithms for Frequent Pairs Mining in Transactional Data Streams
verfasst von : Konstantin Kutzkov
Erschienen in: Machine Learning and Knowledge Discovery in Databases
Verlag: Springer Berlin Heidelberg
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
A straightforward approach to frequent pairs mining in transactional streams is to generate all pairs occurring in transactions and apply a frequent items mining algorithm to the resulting stream. The well-known counter based algorithms
Frequent
and
Space-Saving
are known to achieve a very good approximation when the frequencies of the items in the stream adhere to a skewed distribution.
Motivated by observations on real datasets, we present a general technique for applying
Frequent
and
Space-Saving
to transactional data streams for the case when the transactions considerably vary in their lengths. Despite of its simplicity, we show through extensive experiments that our approach is considerably more efficient and precise than the naïve application of
Frequent
and
Space-Saving
.