Skip to main content
Top

2017 | OriginalPaper | Chapter

4. Measuring and Visualizing Associations

Authors : Jean-Michel Josselin, Benoît Le Maux

Published in: Statistical Tools for Program Evaluation

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

One goal of statistical studies is to highlight associations between pairs of variables. This is particularly useful when one wants to get a clear picture of a multi-dimensional data set and motivate a specific policy intervention (Sect. 4.1). Yet, the choice of a method is not straightforward. Testing for correlation is the relevant approach to investigate a linear association between two numerical variables (Sect. 4.2). The chi-square test is an inferential test that uses data from a sample to make conclusions about the relationship between two categorical variables (Sect. 4.3). When one variable is numerical and the other is categorical, the usual approach is to test for differences between means or to implement an analysis of variance (Sect. 4.4). When faced with more than two variables, it is also possible to provide a multidimensional representation of the problem using methods such as principal component analysis (Sect. 4.5) and multiple correspondence analysis (Sect. 4.6). The idea is to reduce the dimensionality of a data set by plotting all the observations on 2D graphs describing how observations cluster with respect to various characteristics. These groups can for instance serve to identify the beneficiaries of a particular intervention. Using R-CRAN, several examples are included in this chapter to illustrate the different methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Giudici, P. (2005). Applied data mining: Statistical methods for business and industry. New York: Wiley. Giudici, P. (2005). Applied data mining: Statistical methods for business and industry. New York: Wiley.
go back to reference Lang, T. A., & Secic, M. (2006). How to report statistics in medicine: Annotated guidelines for authors, editors, and reviewers. Philadelphia, PA: ACP. Lang, T. A., & Secic, M. (2006). How to report statistics in medicine: Annotated guidelines for authors, editors, and reviewers. Philadelphia, PA: ACP.
go back to reference MacDonell, W. R. (1902). On criminal anthropometry and the identification of criminals. Biometrika, 1, 177–227.CrossRef MacDonell, W. R. (1902). On criminal anthropometry and the identification of criminals. Biometrika, 1, 177–227.CrossRef
go back to reference Pearson, K. (1900). On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philosophical Magazine Series, 5, 157–175.CrossRef Pearson, K. (1900). On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philosophical Magazine Series, 5, 157–175.CrossRef
go back to reference Pearson, K. (1901). On lines and planes of closest fit to systems of points in space. Philosophical Magazine Series, 6, 559–572.CrossRef Pearson, K. (1901). On lines and planes of closest fit to systems of points in space. Philosophical Magazine Series, 6, 559–572.CrossRef
go back to reference Pearson, K. (1906). On certain points connected with scale order in the case of a correlation of two characters which for some arrangement give a linear regression line. Biometrika, 5, 176–178. Pearson, K. (1906). On certain points connected with scale order in the case of a correlation of two characters which for some arrangement give a linear regression line. Biometrika, 5, 176–178.
go back to reference Rosenthal, G., & Rosenthal, J. A. (2011). Statistics and data interpretation for social work. New York: Springer. Rosenthal, G., & Rosenthal, J. A. (2011). Statistics and data interpretation for social work. New York: Springer.
go back to reference Tufféry, S. (2011). Data mining and statistics for decision making. Wiley. Tufféry, S. (2011). Data mining and statistics for decision making. Wiley.
Metadata
Title
Measuring and Visualizing Associations
Authors
Jean-Michel Josselin
Benoît Le Maux
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-52827-4_4