Skip to main content

2002 | OriginalPaper | Buchkapitel

Modeling and Imputation of Large Incomplete Multidimensional Datasets

verfasst von : Xintao Wu, Daniel Barbará

Erschienen in: Data Warehousing and Knowledge Discovery

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

The presence of missing or incomplete data is a commonplace in large real-word databases. In this paper, we study the problem of missing values which occur at the measure dimension of data cube. We propose a two-part mixture model, which combines the logistic model and loglinear model together, to predict and impute the missing values. The logistic model here is applied to predict missing positions while the loglinear model is applied to compute the estimation. Experimental results on real datasets and synthetic datasets are presented.

Metadaten
Titel
Modeling and Imputation of Large Incomplete Multidimensional Datasets
verfasst von
Xintao Wu
Daniel Barbará
Copyright-Jahr
2002
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/3-540-46145-0_28

Premium Partner