2008 | OriginalPaper | Buchkapitel
Missing Value Imputation Based on Data Clustering
verfasst von : Shichao Zhang, Jilian Zhang, Xiaofeng Zhu, Yongsong Qin, Chengqi Zhang
Erschienen in: Transactions on Computational Science I
Verlag: Springer Berlin Heidelberg
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
We propose an efficient nonparametric missing value imputation method based on clustering, called CMI (Clustering-based Missing value Imputation), for dealing with missing values in target attributes. In our approach, we impute the missing values of an instance
A
with plausible values that are generated from the data in the instances which do not contain missing values and are most similar to the instance
A
using a kernel-based method. Specifically, we first divide the dataset (including the instances with missing values) into clusters. Next, missing values of an instance
A
are patched up with the plausible values generated from
A
’s cluster. Extensive experiments show the effectiveness of the proposed method in missing value imputation task.