Skip to main content
Top
Published in:
Cover of the book

2016 | OriginalPaper | Chapter

1. An Introduction to Modern Statistical Methods in HCI

Authors : Judy Robertson, Maurits Kaptein

Published in: Modern Statistical Methods for HCI

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This chapter explains why we think statistical methodology matters so much to the HCI community and why we should attempt to improve it. It introduces some flaws in the well-accepted methodology of Null Hypothesis Significance Testing and briefly introduces some alternatives. Throughout the book we aim to critically evaluate current practices in HCI and support a less rigid, procedural view of statistics in favour of “fair statistical communication”. Each chapter provides scholars and practitioners with the methods and tools to do so.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Footnotes
1
While technically the often applied methods are an—arguably erroneous—hybrid between methods introduced by Neymann-Pearson and Fisher, we focus here on the common practice.
 
Literature
go back to reference American Psychological Association (2009) Publication Manual of the American Psychological Association, APA, 6th edn, p. 272 American Psychological Association (2009) Publication Manual of the American Psychological Association, APA, 6th edn, p. 272
go back to reference Baguley T (2012) Serious stats: a guide to advanced statistics for the behavioral sciences. Palgrave Macmillan, Basingstoke Baguley T (2012) Serious stats: a guide to advanced statistics for the behavioral sciences. Palgrave Macmillan, Basingstoke
go back to reference Barnard GA, Bayes T (1958) Studies in the history of probability and statistics: IX. In: Thomas Bayes’s essay towards solving a problem in the doctrine of chances. Biometrika 45(3/4):293–315 Barnard GA, Bayes T (1958) Studies in the history of probability and statistics: IX. In: Thomas Bayes’s essay towards solving a problem in the doctrine of chances. Biometrika 45(3/4):293–315
go back to reference Box JF (1987) Guinness, gosset, fisher, and small samples. Stat Sci 2(1):45–52 Box JF (1987) Guinness, gosset, fisher, and small samples. Stat Sci 2(1):45–52
go back to reference Cairns P (2007) HCI... not as it should be: inferential statistics in HCI research. In: Proceedings of HCI 2007. Lancaster, UK, pp 195–201 Cairns P (2007) HCI... not as it should be: inferential statistics in HCI research. In: Proceedings of HCI 2007. Lancaster, UK, pp 195–201
go back to reference Chan ISF, Zhang Z (1999) Test-based exact confidence intervals for the difference of two binomial proportions. Biometrics 55(4):1202–1209 Chan ISF, Zhang Z (1999) Test-based exact confidence intervals for the difference of two binomial proportions. Biometrics 55(4):1202–1209
go back to reference Cohen J (1992) Statistical power analysis. In: Current directions in psychological science, pp 98–101 Cohen J (1992) Statistical power analysis. In: Current directions in psychological science, pp 98–101
go back to reference Conley T, Hansen C, Mcculloch R, Rossi P (2008) A semi-parametric Bayesian approach to the instrumental variable problem. J Econom 144(1):276–305 Conley T, Hansen C, Mcculloch R, Rossi P (2008) A semi-parametric Bayesian approach to the instrumental variable problem. J Econom 144(1):276–305
go back to reference Dienes Z (2008) Understanding psychology as a science: an introduction to scientific and statistical inference, 1st edn. Palgrave Macmillan, Basingstoke, p 150 Dienes Z (2008) Understanding psychology as a science: an introduction to scientific and statistical inference, 1st edn. Palgrave Macmillan, Basingstoke, p 150
go back to reference Dunlop MD, Baillie M(2009) Paper rejected (p> 0.05): an introduction to the debate on appropriateness of null-hypothesis testing. Int J Mob Hum Comput Interact 1(3):86 Dunlop MD, Baillie M(2009) Paper rejected (p> 0.05): an introduction to the debate on appropriateness of null-hypothesis testing. Int J Mob Hum Comput Interact 1(3):86
go back to reference Fisher RA (1934) Statistical methods for research workers, 5th edn. Oliver and Boyd, EdinburghMATH Fisher RA (1934) Statistical methods for research workers, 5th edn. Oliver and Boyd, EdinburghMATH
go back to reference Fox RJ, Dimmic MW (2006) A two-sample Bayesian t-test for microarray data. BMC Bioinform 7:126 Fox RJ, Dimmic MW (2006) A two-sample Bayesian t-test for microarray data. BMC Bioinform 7:126
go back to reference Gelman A (2005) Analysis of variance: why it is more important than ever. Ann Stat 33(1):1–31 Gelman A (2005) Analysis of variance: why it is more important than ever. Ann Stat 33(1):1–31
go back to reference Gelman A, David W (2009) Of beauty, sex, and power: statistical challenges in estimating small effects. Am Sci 97:310–316CrossRef Gelman A, David W (2009) Of beauty, sex, and power: statistical challenges in estimating small effects. Am Sci 97:310–316CrossRef
go back to reference Gray WD, Salzman MC (1998) Damaged merchandise? a review of experiments that compare usability evaluation methods. Hum Comput Interact 13:203–261 Gray WD, Salzman MC (1998) Damaged merchandise? a review of experiments that compare usability evaluation methods. Hum Comput Interact 13:203–261
go back to reference Ioannidis’s JPA (2005) Why most published research findings are false. In: Jantsch W, Schaffler F (eds.) PLoS Med 2(8), e124 Ioannidis’s JPA (2005) Why most published research findings are false. In: Jantsch W, Schaffler F (eds.) PLoS Med 2(8), e124
go back to reference Kampenes V, Dyba T, Hannay J, Sjoberg D (2007) A systematic review of effect size in software engineering experiments. Inform Softw Technol 49(11–12):1073–1086 Kampenes V, Dyba T, Hannay J, Sjoberg D (2007) A systematic review of effect size in software engineering experiments. Inform Softw Technol 49(11–12):1073–1086
go back to reference Kaptein MC, Robertson J (2012) Rethinking statistical methods for HCI. In: Proceedings of the 2011 Annual Conference on Human Factors in Computing Systems, CHI 212. ACM Press, New York, New York, USA, pp 1105–1114 Kaptein MC, Robertson J (2012) Rethinking statistical methods for HCI. In: Proceedings of the 2011 Annual Conference on Human Factors in Computing Systems, CHI 212. ACM Press, New York, New York, USA, pp 1105–1114
go back to reference Kruschke JK (2010) What to believe: Bayesian methods for data analysis. Trends Cogn Sci 14(7):293–300 Kruschke JK (2010) What to believe: Bayesian methods for data analysis. Trends Cogn Sci 14(7):293–300
go back to reference Kruschke JK (2011a) Bayesian assessment of null values via parameter estimation and model comparison. Perspect Psychol Sci 6(3):299–312 Kruschke JK (2011a) Bayesian assessment of null values via parameter estimation and model comparison. Perspect Psychol Sci 6(3):299–312
go back to reference Kruschke JK (2011b) Doing bayesian data analysis: a tutorial with R and BUGS. Elsevier, BurlingtonMATH Kruschke JK (2011b) Doing bayesian data analysis: a tutorial with R and BUGS. Elsevier, BurlingtonMATH
go back to reference Maxwell SE (2004) The persistence of underpowered studies in psychological research: causes, consequences, and remedies. Psychol Methods 9(2):147 Maxwell SE (2004) The persistence of underpowered studies in psychological research: causes, consequences, and remedies. Psychol Methods 9(2):147
go back to reference Oakes M (1986) Statistical inference: a commentary for the social and behavioural sciences, p. 196. Wiley, New York Oakes M (1986) Statistical inference: a commentary for the social and behavioural sciences, p. 196. Wiley, New York
go back to reference Paternoster R, Brame R, Mazerolle P, Piquero A (1998) Using the correct statistical test for the equality of regression coefficients. Criminology 36(4):859–866 Paternoster R, Brame R, Mazerolle P, Piquero A (1998) Using the correct statistical test for the equality of regression coefficients. Criminology 36(4):859–866
go back to reference Pearson K, Fisher RA, Inman HF (1994) Karl Pearson and R.A. Fisher on statistical tests: a 1935 exchange from nature. Am Stat 48(1):2–11 Pearson K, Fisher RA, Inman HF (1994) Karl Pearson and R.A. Fisher on statistical tests: a 1935 exchange from nature. Am Stat 48(1):2–11
go back to reference Rosenthal R (1979) The file drawer problem and tolerance for null results. Psychol Bull 86(3):638 Rosenthal R (1979) The file drawer problem and tolerance for null results. Psychol Bull 86(3):638
go back to reference Rouder JN, Speckman PL, Sun D, Morey RD, Iverson G (2009) Bayesian t tests for accepting and rejecting the null hypothesis. Psychon Bull Rev 16(2):225–237 Rouder JN, Speckman PL, Sun D, Morey RD, Iverson G (2009) Bayesian t tests for accepting and rejecting the null hypothesis. Psychon Bull Rev 16(2):225–237
go back to reference Seaman, C. B. (1999). Qualitative methods in empirical studies of software engineering. IEEE Trans Softw Eng 25(4):557–572 Seaman, C. B. (1999). Qualitative methods in empirical studies of software engineering. IEEE Trans Softw Eng 25(4):557–572
go back to reference Simmons JP, Nelson LD, Simonsohn U (2011) False-positive psychology: undisclosed flexibility in data collection and analysis allows presenting anything as significant. Psychol Sci 22(11):1359–1366 Simmons JP, Nelson LD, Simonsohn U (2011) False-positive psychology: undisclosed flexibility in data collection and analysis allows presenting anything as significant. Psychol Sci 22(11):1359–1366
go back to reference Student (1908) The probable error of a mean. Biometrika 6:1–25 Student (1908) The probable error of a mean. Biometrika 6:1–25
go back to reference Wagenmakers E, Wetzels R, Borsboom D, Van der Maas HLJ (2011) Why psychologists must change the way they analyze their data: The case of psi: Comment on Bem. J Pers Soc Psychol 100:426–443 Wagenmakers E, Wetzels R, Borsboom D, Van der Maas HLJ (2011) Why psychologists must change the way they analyze their data: The case of psi: Comment on Bem. J Pers Soc Psychol 100:426–443
go back to reference Ziliak S, McCloskey D (2008) The cult of statistical significance: how the standard error costs us jobs, justice and lives. University of Michigan Press, Ann ArborMATH Ziliak S, McCloskey D (2008) The cult of statistical significance: how the standard error costs us jobs, justice and lives. University of Michigan Press, Ann ArborMATH
go back to reference Ziliak ST (2008) The Cult of statistical significance: how the standard error costs us jobs, justice, and lives. J Econ Lit 47(2):499–503 Ziliak ST (2008) The Cult of statistical significance: how the standard error costs us jobs, justice, and lives. J Econ Lit 47(2):499–503
go back to reference Ziliak ST, McCloskey DN (2004) Size matters: the standard error. J Socio-Econ 33(5):527–546 Ziliak ST, McCloskey DN (2004) Size matters: the standard error. J Socio-Econ 33(5):527–546
Metadata
Title
An Introduction to Modern Statistical Methods in HCI
Authors
Judy Robertson
Maurits Kaptein
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-26633-6_1