Skip to main content
Top

2016 | OriginalPaper | Chapter

10. Appendix

Authors : Tilo Wendler, Sören Gröttrup

Published in: Data Mining with SPSS Modeler

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

To show the functionalities of the IBM SPSS Modeler, different datasets are used in this book. The first step in data analytics is knowing the data, which involves familiarizing with the meaning of the different variables in the dataset. This chapter lists all datasets used in this book together with an explanation of their background as well as a description and the meaning of the different variables included.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Beer-Shop-Hamburg. (2014). Beer from all over the world. Accessed 26/08/2014, from http://www.biershop-hamburg.de/Biere-aus-aller-Welt-17 Beer-Shop-Hamburg. (2014). Beer from all over the world. Accessed 26/08/2014, from http://​www.​biershop-hamburg.​de/​Biere-aus-aller-Welt-17
go back to reference Breiman, L., & Friedman, J. H. (1985). Estimating optimal transformations for multiple regression and correlation. Journal of the American Statistical Association, 80(391), 580–598.CrossRefMATHMathSciNet Breiman, L., & Friedman, J. H. (1985). Estimating optimal transformations for multiple regression and correlation. Journal of the American Statistical Association, 80(391), 580–598.CrossRefMATHMathSciNet
go back to reference Bühl, A. (2012). SPSS 20: Einführung in die moderne Datenanalyse, Scientific tools (13th ed.). München: Pearson. Bühl, A. (2012). SPSS 20: Einführung in die moderne Datenanalyse, Scientific tools (13th ed.). München: Pearson.
go back to reference c’t Magazine for IT Technology. (2008). CPU-Wegweiser: x86-Prozessoren im Überblick, Vol. 2008 No. 7, pp. 178–182. c’t Magazine for IT Technology. (2008). CPU-Wegweiser: x86-Prozessoren im Überblick, Vol. 2008 No. 7, pp. 178–182.
go back to reference Fisher, R. A. (1936). The use of multiple measurement in taxonomic problems. Annals of Eugenics, 7(2), 179–188.CrossRef Fisher, R. A. (1936). The use of multiple measurement in taxonomic problems. Annals of Eugenics, 7(2), 179–188.CrossRef
go back to reference Futreal, P. A., Coin, L., Marshall, M., Down, T., Hubbard, T., Wooster, R., Rahman, N., & Stratton, M. R. (2004). A census of human cancer genes. Nature Reviews Cancer, 4(3), 177–183.CrossRef Futreal, P. A., Coin, L., Marshall, M., Down, T., Hubbard, T., Wooster, R., Rahman, N., & Stratton, M. R. (2004). A census of human cancer genes. Nature Reviews Cancer, 4(3), 177–183.CrossRef
go back to reference Gilley, O. W., & Pace, R. (1996). On the Harrison and Rubinfeld Data. Journal of Environmental Economics and Management, 31(3), 403–405.CrossRefMATH Gilley, O. W., & Pace, R. (1996). On the Harrison and Rubinfeld Data. Journal of Environmental Economics and Management, 31(3), 403–405.CrossRefMATH
go back to reference Haferlach, T., Kohlmann, A., Wieczorek, L., Basso, G., Kronnie, G. T., Béné, M.-C., de Vos, J., Hernández, J. M., Hofmann, W.-K., Mills, K. I., Gilkes, A., Chiaretti, S., Shurtleff, S. A., Kipps, T. J., Rassenti, L. Z., Yeoh, A. E., Papenhausen, P. R., Liu, W.-M., Williams, P. M., & Foà, R. (2010). Clinical utility of microarray-based gene expression profiling in the diagnosis and subclassification of leukemia: report from the International Microarray Innovations in Leukemia Study Group. Journal of Clinical Oncology Official Journal of the American Society of Clinical Oncology, 28(15), 2529–2537.CrossRef Haferlach, T., Kohlmann, A., Wieczorek, L., Basso, G., Kronnie, G. T., Béné, M.-C., de Vos, J., Hernández, J. M., Hofmann, W.-K., Mills, K. I., Gilkes, A., Chiaretti, S., Shurtleff, S. A., Kipps, T. J., Rassenti, L. Z., Yeoh, A. E., Papenhausen, P. R., Liu, W.-M., Williams, P. M., & Foà, R. (2010). Clinical utility of microarray-based gene expression profiling in the diagnosis and subclassification of leukemia: report from the International Microarray Innovations in Leukemia Study Group. Journal of Clinical Oncology Official Journal of the American Society of Clinical Oncology, 28(15), 2529–2537.CrossRef
go back to reference Handl, A. (2010). Multivariate Analysemethoden: Theorie und Praxis multivariater Verfahren unter besonderer Berücksichtigung von S-PLUS, Statistik und ihre Anwendungen (2nd ed.). Heidelberg: Springer.CrossRef Handl, A. (2010). Multivariate Analysemethoden: Theorie und Praxis multivariater Verfahren unter besonderer Berücksichtigung von S-PLUS, Statistik und ihre Anwendungen (2nd ed.). Heidelberg: Springer.CrossRef
go back to reference Harrison, D., & Rubinfeld, D. L. (1978). Hedonic housing prices and the demand for clean air. Journal of Environmental Economics and Management, 5(1), 81–102.CrossRefMATH Harrison, D., & Rubinfeld, D. L. (1978). Hedonic housing prices and the demand for clean air. Journal of Environmental Economics and Management, 5(1), 81–102.CrossRefMATH
go back to reference Hebestreit, K., Gröttrup, S., Emden, D., Veerkamp, J., Ruckert, C., Klein, H.-U., Müller-Tidow, C., Dugas, M., & Speletas, M. (2012). Leukemia Gene Atlas – A Public Platform for Integrative Exploration of Genome-Wide Molecular Data. PLoS One, 7(6), e39148.CrossRef Hebestreit, K., Gröttrup, S., Emden, D., Veerkamp, J., Ruckert, C., Klein, H.-U., Müller-Tidow, C., Dugas, M., & Speletas, M. (2012). Leukemia Gene Atlas – A Public Platform for Integrative Exploration of Genome-Wide Molecular Data. PLoS One, 7(6), e39148.CrossRef
go back to reference Heinrich, L. J. (2002a). Informationsmanagement: Planung, Überwachung und Steuerung der Informationsinfrastruktur, Wirtschaftsinformatik (7th ed.). München: Oldenbourg. Heinrich, L. J. (2002a). Informationsmanagement: Planung, Überwachung und Steuerung der Informationsinfrastruktur, Wirtschaftsinformatik (7th ed.). München: Oldenbourg.
go back to reference Heinrich, L. J. (2002b). Questionnaire for a success factor analysis in SME. Heinrich, L. J. (2002b). Questionnaire for a success factor analysis in SME.
go back to reference Henderson, H. V., & Velleman, P. F. (1981). Building multiple regression models interactively. Biometrics, 37, 391–411.CrossRefMATH Henderson, H. V., & Velleman, P. F. (1981). Building multiple regression models interactively. Biometrics, 37, 391–411.CrossRefMATH
go back to reference Hoffmann-Beverages. (2014). Beverage-details. Accessed 27/08/2014, from http://www.getraenke-hoffmann.de/download/durstexpress/DurstExpress_Katalog.pdf Hoffmann-Beverages. (2014). Beverage-details. Accessed 27/08/2014, from http://​www.​getraenke-hoffmann.​de/​download/​durstexpress/​DurstExpress_​Katalog.​pdf
go back to reference IBM. (2014a). SPSS Modeler 16 Applications Guide. Accessed 18/09/2015, from ftp://public.dhe.ibm.com/software/analytics/spss/documentation/modeler/16.0/en/modeler_applications_guide_book.pdf IBM. (2014a). SPSS Modeler 16 Applications Guide. Accessed 18/09/2015, from ftp://public.dhe.ibm.com/software/analytics/spss/documentation/modeler/16.0/en/modeler_applications_guide_book.pdf
go back to reference IBM. (2014b). SPSS Modeler 16 Source, Process, and Output Nodes. Accessed 18/09/2015, from ftp://public.dhe.ibm.com/software/analytics/spss/documentation/modeler/16.0/en/modeler_nodes_general.pdf IBM. (2014b). SPSS Modeler 16 Source, Process, and Output Nodes. Accessed 18/09/2015, from ftp://public.dhe.ibm.com/software/analytics/spss/documentation/modeler/16.0/en/modeler_nodes_general.pdf
go back to reference IBM. (2014c). Test scores dataset. Accessed 18/09/2015, from http://www-01.ibm.com/support/knowledgecenter/SSLVMB_22.0.0/com.ibm.spss.statistics.cs/components/glmm/glmm_testscores_intro.htm IBM. (2014c). Test scores dataset. Accessed 18/09/2015, from http://​www-01.​ibm.​com/​support/​knowledgecenter/​SSLVMB_​22.​0.​0/​com.​ibm.​spss.​statistics.​cs/​components/​glmm/​glmm_​testscores_​intro.​htm
go back to reference IBM. (2015). SPSS Modeler 17 Applications guide. ftp://public.dhe.ibm.com/software/analytics/spss/documentation/modeler/17.0/en/ModelerApplications.pdf IBM. (2015). SPSS Modeler 17 Applications guide. ftp://public.dhe.ibm.com/software/analytics/spss/documentation/modeler/17.0/en/ModelerApplications.pdf
go back to reference IBM Website. (2014). Customer segmentation analytics with IBM SPSS. Accessed 08/05/2015, from http://www.ibm.com/developerworks/library/ba-spss-pds-db2luw/index.html IBM Website. (2014). Customer segmentation analytics with IBM SPSS. Accessed 08/05/2015, from http://​www.​ibm.​com/​developerworks/​library/​ba-spss-pds-db2luw/​index.​html
go back to reference Journal of Statistical Education Data Archive. (2009). LPGA Performance Statistics for 2009. Accessed 12/06/2015, from http://www.stat.ufl.edu/~winner/data/lpga2009.dat Journal of Statistical Education Data Archive. (2009). LPGA Performance Statistics for 2009. Accessed 12/06/2015, from http://​www.​stat.​ufl.​edu/​~winner/​data/​lpga2009.​dat
go back to reference Longley, J. W. (1967). An appraisal of least squares programs for the electronic computer from the point of view of the user. Journal of the American Statistical Association, 62(319), 819–841.CrossRefMathSciNet Longley, J. W. (1967). An appraisal of least squares programs for the electronic computer from the point of view of the user. Journal of the American Statistical Association, 62(319), 819–841.CrossRefMathSciNet
go back to reference Lichman, M. (2013). UCI Machine learning repository. http://archive.ics.uci.edu/ml Lichman, M. (2013). UCI Machine learning repository. http://​archive.​ics.​uci.​edu/​ml
go back to reference Machine Learning Repository. (1990). Pima Indians Diabetes Data Set. Accessed 18/09/2015, from http://archive.ics.uci.edu/ml/datasets/Pima+Indians+Diabetes Machine Learning Repository. (1990). Pima Indians Diabetes Data Set. Accessed 18/09/2015, from http://​archive.​ics.​uci.​edu/​ml/​datasets/​Pima+Indians+Dia​betes
go back to reference Machine Learning Repository. (1991). Wine Data Set. Accessed 2015, from http://archive.ics.uci.edu/ml/datasets/Wine Machine Learning Repository. (1991). Wine Data Set. Accessed 2015, from http://​archive.​ics.​uci.​edu/​ml/​datasets/​Wine
go back to reference Machine Learning Repository. (1992). Breast Cancer Wisconsin (Original) Data Set. Accessed 29/10/2015, from https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Original%29 Machine Learning Repository. (1992). Breast Cancer Wisconsin (Original) Data Set. Accessed 29/10/2015, from https://​archive.​ics.​uci.​edu/​ml/​datasets/​Breast+Cancer+Wi​sconsin+%28Original%29
go back to reference Machine Learning Repository. (1993). Boston Housing Data Set. Accessed 12/06/2015, from https://archive.ics.uci.edu/ml/datasets/Housing Machine Learning Repository. (1993). Boston Housing Data Set. Accessed 12/06/2015, from https://​archive.​ics.​uci.​edu/​ml/​datasets/​Housing
go back to reference Machine Learning Repository. (1994). Chess Endgame Database for White King and Rook against Black King (KRK). Accessed 2015, from https://archive.ics.uci.edu/ml/datasets/Chess+(King-Rook+vs.+King) Machine Learning Repository. (1994). Chess Endgame Database for White King and Rook against Black King (KRK). Accessed 2015, from https://​archive.​ics.​uci.​edu/​ml/​datasets/​Chess+(King-Rook+vs.+King)
go back to reference Machine Learning Repository. (1998). Optical Recognition of Handwritten Digits. Accessed 2015, from https://archive.ics.uci.edu/ml/datasets/Optical+Recognition+of+Handwritten+Digits Machine Learning Repository. (1998). Optical Recognition of Handwritten Digits. Accessed 2015, from https://​archive.​ics.​uci.​edu/​ml/​datasets/​Optical+Recognit​ion+of+Handwritt​en+Digits
go back to reference McCullagh, P., & Nelder, J. A. (1983). Generalized linear models, Monographs on statistics and applied probability. London: Chapman and Hall.CrossRefMATH McCullagh, P., & Nelder, J. A. (1983). Generalized linear models, Monographs on statistics and applied probability. London: Chapman and Hall.CrossRefMATH
go back to reference National cancer Institute. (2013). What you need to know about leukemia, NIH publication, no. 13-3775. Revised September 2013, digital edition. National cancer Institute. (2013). What you need to know about leukemia, NIH publication, no. 13-3775. Revised September 2013, digital edition.
go back to reference Niedermeyer, E., Schomer, D. L., & Lopes da Silva, F. H. (2011). Niedermeyer’s electroencephalography: Basic principles, clinical applications, and related fields (6th ed.). Philadelphia: Wolters Kluwer/Lippincott Williams & Wilkins Health. Niedermeyer, E., Schomer, D. L., & Lopes da Silva, F. H. (2011). Niedermeyer’s electroencephalography: Basic principles, clinical applications, and related fields (6th ed.). Philadelphia: Wolters Kluwer/Lippincott Williams & Wilkins Health.
go back to reference NOMIS UK. (2014). Official Labour Market Statistics – Annual Survey of Hours and Earnings – Workplace Analysis. Accessed 18/09/2015, from http://nmtest.dur.ac.uk/ NOMIS UK. (2014). Official Labour Market Statistics – Annual Survey of Hours and Earnings – Workplace Analysis. Accessed 18/09/2015, from http://​nmtest.​dur.​ac.​uk/​
go back to reference O’Connor, C. M., & Adams, J. U. (2010). Essentials of cell biology. Cambridge, MA: NPG Education. O’Connor, C. M., & Adams, J. U. (2010). Essentials of cell biology. Cambridge, MA: NPG Education.
go back to reference OECD. (2012b). Programm for International Student Assessment (PISA) 2012. Accessed 02/03/2015, from http://pisa2012.acer.edu.au/downloads.php OECD. (2012b). Programm for International Student Assessment (PISA) 2012. Accessed 02/03/2015, from http://​pisa2012.​acer.​edu.​au/​downloads.​php
go back to reference Oh, S.-H., Lee, Y.-R., & Kim, H.-N. (2014). A novel EEG feature extraction method using Hjorth parameter. International Journal of Electronics and Electrical Engineering, 2(2), 106–110.CrossRef Oh, S.-H., Lee, Y.-R., & Kim, H.-N. (2014). A novel EEG feature extraction method using Hjorth parameter. International Journal of Electronics and Electrical Engineering, 2(2), 106–110.CrossRef
go back to reference Potthoff, R. F., & Roy, S. N. (1964). A generalized multivariate analysis of variance model useful especially for growth curve problems. Biometrika, 51, 313–326.CrossRefMATHMathSciNet Potthoff, R. F., & Roy, S. N. (1964). A generalized multivariate analysis of variance model useful especially for growth curve problems. Biometrika, 51, 313–326.CrossRefMATHMathSciNet
go back to reference Schulz, L. O., Bennett, P. H., Ravussin, E., Kidd, J. R., Kidd, K. K., Esparza, J., & Valencia, M. E. (2006). Effects of traditional and western environments on prevalence of type 2 diabetes in Pima Indians in Mexico and the U.S. Diabetes Care, 29(8), 1866–1871.CrossRef Schulz, L. O., Bennett, P. H., Ravussin, E., Kidd, J. R., Kidd, K. K., Esparza, J., & Valencia, M. E. (2006). Effects of traditional and western environments on prevalence of type 2 diabetes in Pima Indians in Mexico and the U.S. Diabetes Care, 29(8), 1866–1871.CrossRef
go back to reference Smith, J. W., Everhart, J. E., Dickson, W. C., Knowler, W. C., & Johannes, R. S. (1988). Using the ADAP learning algorithm to forecast the onset of diabetes mellitus. Proceedings of the Annual Symposium on Computer Application in Medical Care, 261–265. Smith, J. W., Everhart, J. E., Dickson, W. C., Knowler, W. C., & Johannes, R. S. (1988). Using the ADAP learning algorithm to forecast the onset of diabetes mellitus. Proceedings of the Annual Symposium on Computer Application in Medical Care, 261–265.
go back to reference Stacey, K., & Turner, R. (2015). Assessing mathematical literacy: The PISA experience. Stacey, K., & Turner, R. (2015). Assessing mathematical literacy: The PISA experience.
go back to reference UCI Machine Learning Repository. (1996). UCI Machine Learning Repository – Adult Data Set. Accessed 12/09/2015, from https://archive.ics.uci.edu/ml/datasets/Adult UCI Machine Learning Repository. (1996). UCI Machine Learning Repository – Adult Data Set. Accessed 12/09/2015, from https://​archive.​ics.​uci.​edu/​ml/​datasets/​Adult
go back to reference Vanderbilt University School of Medicine. (2004). Department of Biostatistics – Titanic Data. Accessed 12/09/2015, from http://biostat.mc.vanderbilt.edu/wiki/pub/Main/DataSets/titanic.html Vanderbilt University School of Medicine. (2004). Department of Biostatistics – Titanic Data. Accessed 12/09/2015, from http://​biostat.​mc.​vanderbilt.​edu/​wiki/​pub/​Main/​DataSets/​titanic.​html
go back to reference Wendler, T. (2004). Modellierung und Bewertung von IT-Kosten: Empirische Analyse mit Hilfe multivariater mathematischer Methoden, Wirtschaftsinformatik. Wiesbaden: Deutscher Universitäts-Verlag.CrossRef Wendler, T. (2004). Modellierung und Bewertung von IT-Kosten: Empirische Analyse mit Hilfe multivariater mathematischer Methoden, Wirtschaftsinformatik. Wiesbaden: Deutscher Universitäts-Verlag.CrossRef
go back to reference Wolberg, W. H. (2003). Wisconsin breast cancer data. Accessed 12/06/2015, from http://www.stat.yale.edu/~pollard/Courses/230.spring03/WBC/ Wolberg, W. H. (2003). Wisconsin breast cancer data. Accessed 12/06/2015, from http://​www.​stat.​yale.​edu/​~pollard/​Courses/​230.​spring03/​WBC/​
go back to reference Wolberg, W. H., & Mangasarian, O. L. (1990). Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc Natl Acad Sci USA, 87(23), 9193–9196. Wolberg, W. H., & Mangasarian, O. L. (1990). Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc Natl Acad Sci USA, 87(23), 9193–9196.
Metadata
Title
Appendix
Authors
Tilo Wendler
Sören Gröttrup
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-28709-6_10

Premium Partner