Skip to main content
Top

2014 | OriginalPaper | Chapter

Algorithmic-Type Imputation Techniques with Different Data Structures: Alternative Approaches in Comparison

Authors : Nadia Solaro, Alessandro Barbiero, Giancarlo Manzi, Pier Alda Ferrari

Published in: Analysis and Modeling of Complex Data in Behavioral and Social Sciences

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In recent years, with the spread availability of large datasets from multiple sources, increasing attention has been devoted to the treatment of missing information. Recent approaches have paved the way to the development of new powerful algorithmic techniques, in which imputation is performed through computer-intensive procedures. Although most of these approaches are attractive for many reasons, less attention has been paid to the problem of which method should be preferred according to the data structure at hand. This work addresses the problem by comparing the two methods missForest and IPCA with a new method we developed within the forward imputation approach. We carried out comparisons by considering different data patterns with varying skewness and correlation of variables, in order to ascertain in which situations a given method produces more satisfying results.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Aluja-Banet, T., Daunis-i-Estadella, J., & Pellicer, D. (2007). GRAFT, a complete system for data fusion. Computational Statistics & Data Analysis, 52, 635–649.CrossRefMATHMathSciNet Aluja-Banet, T., Daunis-i-Estadella, J., & Pellicer, D. (2007). GRAFT, a complete system for data fusion. Computational Statistics & Data Analysis, 52, 635–649.CrossRefMATHMathSciNet
go back to reference Azzalini, A., & Capitanio, A. (1999). Statistical applications of the multivariate skew normal distribution. Journal of the Royal Statistical Society: Series B, 61(3), 579–602.CrossRefMATHMathSciNet Azzalini, A., & Capitanio, A. (1999). Statistical applications of the multivariate skew normal distribution. Journal of the Royal Statistical Society: Series B, 61(3), 579–602.CrossRefMATHMathSciNet
go back to reference Ferrari, P. A., Annoni, P., Barbiero, A., & Manzi, G. (2011). An imputation method for categorical variables with application to nonlinear principal component analysis. Computational Statistics & Data Analysis, 55, 2410–2420.CrossRefMathSciNet Ferrari, P. A., Annoni, P., Barbiero, A., & Manzi, G. (2011). An imputation method for categorical variables with application to nonlinear principal component analysis. Computational Statistics & Data Analysis, 55, 2410–2420.CrossRefMathSciNet
go back to reference Greenacre, M. (1984). Theory and applications of correspondance analysis. London: Academic. Greenacre, M. (1984). Theory and applications of correspondance analysis. London: Academic.
go back to reference Josse, J., Pagès, J., & Husson, F. (2011). Multiple imputation in principal component analysis. Advances in Data Analysis and Classification, 5, 231–246.CrossRefMATH Josse, J., Pagès, J., & Husson, F. (2011). Multiple imputation in principal component analysis. Advances in Data Analysis and Classification, 5, 231–246.CrossRefMATH
go back to reference Little, R. J. A., & Rubin, D. B. (2002). Statistical analysis with missing data (2nd ed.). New York: Wiley.MATH Little, R. J. A., & Rubin, D. B. (2002). Statistical analysis with missing data (2nd ed.). New York: Wiley.MATH
go back to reference Nora-Chouteau, C. (1974). Une méthode de reconstitution et d’analyse de données incomplètes. Ph.D. thesis, Université Pierre et Marie Curie. Nora-Chouteau, C. (1974). Une méthode de reconstitution et d’analyse de données incomplètes. Ph.D. thesis, Université Pierre et Marie Curie.
go back to reference R Development Core Team (2012). R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. R Development Core Team (2012). R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing.
go back to reference Stekhoven, D. J., & Bühlmann, P. (2012). MissForest - non-parametric missing value imputation for mixed-type data. Bioinformatics, 28(1), 112–118.CrossRef Stekhoven, D. J., & Bühlmann, P. (2012). MissForest - non-parametric missing value imputation for mixed-type data. Bioinformatics, 28(1), 112–118.CrossRef
Metadata
Title
Algorithmic-Type Imputation Techniques with Different Data Structures: Alternative Approaches in Comparison
Authors
Nadia Solaro
Alessandro Barbiero
Giancarlo Manzi
Pier Alda Ferrari
Copyright Year
2014
DOI
https://doi.org/10.1007/978-3-319-06692-9_27

Premium Partner