2011 | OriginalPaper | Buchkapitel
The Algorithm APT to Classify in Concurrence of Latency and Drift
verfasst von : Georg Krempl
Erschienen in: Advances in Intelligent Data Analysis X
Verlag: Springer Berlin Heidelberg
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
Population drift is a challenging problem in classification, and denotes changes in probability distributions over time. Known drift-adaptive classification methods such as incremental learning rely on current, labelled data for classification model updates, assuming that such labelled data are available without verification latency. However, verification latency is a relevant problem in some application domains, where predictions have to be made far into the future. This concurrence of drift and latency requires new approaches in machine learning. We propose a two-stage learning strategy: First, the nature of drift in temporal data needs to be identified. This requires the formulation of explicit drift models for the underlying data generating process. In a second step, these models are used to substitute scarce labelled data for updating classification models.
This paper contributes an explicit drift model, which is characterising a mixture of independently evolving sub-populations. In this model, the joint distribution is a mixture of arbitrarily distributed sub-populations drifting over time. An
a
rbitrary sub-
p
opulation
t
racker algorithm is presented, which can track and predict the distributions by the use of
unlabelled
data. Experimental evaluation shows that the presented
APT
algorithm is capable of tracking and predicting changes in the posterior distribution of class labels accurately.