Skip to main content
Erschienen in:

07.10.2020

A Unified Treatment of Agreement Coefficients and their Asymptotic Results: the Formula of the Weighted Mean of Weighted Ratios

verfasst von: Haruhiko Ogasawara

Erschienen in: Journal of Classification | Ausgabe 2/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Eine einheitliche Behandlung der Übereinkommenskoeffizienten für mehrere Ratingagenturen wird gezeigt, wobei die zufallserwarteten Anteile des Bennett et al.-Typs, des Scott-Typs, seiner neuen Variation und des Cohen-Typs mittels vollständiger oder untergeordneter Übereinstimmung zwischen Ratingagenturen behandelt werden. Wenn nur paarweise Übereinstimmung für mehrere Ratingagenturen verwendet wird, werden auch Zufallskorrekturen des Gwet-Typs und seiner neuen Variation in Betracht gezogen. Für die einheitliche Behandlung werden Congers zwei Formeln des Mittelwerts und des Mittelwerts der Raten zu einer übergeordneten Formel des gewichteten Mittelwerts der gewichteten Raten kombiniert. Die entsprechenden einheitlichen Ausdrücke ihrer neuen asymptotischen Ergebnisse werden präsentiert.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Agresti, A. (2013). Categorical data analysis (3rd ed.). Hoboken: Wiley.MATH Agresti, A. (2013). Categorical data analysis (3rd ed.). Hoboken: Wiley.MATH
Zurück zum Zitat Bennett, E. M., Alpert, R., & Goldstein, A. C. (1954). Communications through limited-response questioning. The Public Opinion Quarterly, 18, 303–308.CrossRef Bennett, E. M., Alpert, R., & Goldstein, A. C. (1954). Communications through limited-response questioning. The Public Opinion Quarterly, 18, 303–308.CrossRef
Zurück zum Zitat Berry, K. J., Johnston, J. E., & Mielke Jr., P. W. (2005). Exact and resampling probability values for weighted kappa. Psychological Reports, 96, 243–252.CrossRef Berry, K. J., Johnston, J. E., & Mielke Jr., P. W. (2005). Exact and resampling probability values for weighted kappa. Psychological Reports, 96, 243–252.CrossRef
Zurück zum Zitat Berry, K. J., Johnston, J. E., & Mielke Jr., P. W. (2018). The measurement of association: A permutation statistical approach. Cham: Springer Nature Switzerland AG.MATHCrossRef Berry, K. J., Johnston, J. E., & Mielke Jr., P. W. (2018). The measurement of association: A permutation statistical approach. Cham: Springer Nature Switzerland AG.MATHCrossRef
Zurück zum Zitat Brennan, R. L., & Prediger, D. (1981). Coefficient kappa: Some uses, misuses, and alternatives. Educational and Psychological Measurement, 41, 687–699.CrossRef Brennan, R. L., & Prediger, D. (1981). Coefficient kappa: Some uses, misuses, and alternatives. Educational and Psychological Measurement, 41, 687–699.CrossRef
Zurück zum Zitat Brown, L. D., Cai, T. T., & DasGupta, A. (2002). Confidence interval for a binomial proportion and asymptotic expansions. The Annals of Statistics, 30, 160–201.MathSciNetMATH Brown, L. D., Cai, T. T., & DasGupta, A. (2002). Confidence interval for a binomial proportion and asymptotic expansions. The Annals of Statistics, 30, 160–201.MathSciNetMATH
Zurück zum Zitat Cicchetti, D. V., & Allison, T. (1971). A new procedure for assessing reliability of scoring EEG sleep recordings. American Journal of EEG Technology, 11, 101–109.CrossRef Cicchetti, D. V., & Allison, T. (1971). A new procedure for assessing reliability of scoring EEG sleep recordings. American Journal of EEG Technology, 11, 101–109.CrossRef
Zurück zum Zitat Cohen, J. (1960). A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20, 37–46.CrossRef Cohen, J. (1960). A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20, 37–46.CrossRef
Zurück zum Zitat Cohen, J. (1968). Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit. Psychological Bulletin, 70, 213–220.CrossRef Cohen, J. (1968). Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit. Psychological Bulletin, 70, 213–220.CrossRef
Zurück zum Zitat Conger, A. J. (1980). Integration and generalization of kappas for multiple raters. Psychological Bulletin, 88, 322–328.CrossRef Conger, A. J. (1980). Integration and generalization of kappas for multiple raters. Psychological Bulletin, 88, 322–328.CrossRef
Zurück zum Zitat Donner, A., & Eliasziw, M. (1992). A goodness-of-fit approach to inference procedures for the kappa statistic: confidence interval construction, significance testing and sample size estimation. Statistics in Medicine, 11, 1511–1519.CrossRef Donner, A., & Eliasziw, M. (1992). A goodness-of-fit approach to inference procedures for the kappa statistic: confidence interval construction, significance testing and sample size estimation. Statistics in Medicine, 11, 1511–1519.CrossRef
Zurück zum Zitat Everitt, B. S. (1968). Moments of the statistics kappa and weighted kappa. The British Journal of Mathematical and Statistical Psychology, 21, 97–103.CrossRef Everitt, B. S. (1968). Moments of the statistics kappa and weighted kappa. The British Journal of Mathematical and Statistical Psychology, 21, 97–103.CrossRef
Zurück zum Zitat Fisher, R. A. (1956). Statistical methods and scientific inference. Edinburgh: Oliver and Boyd.MATH Fisher, R. A. (1956). Statistical methods and scientific inference. Edinburgh: Oliver and Boyd.MATH
Zurück zum Zitat Fisher, R. A. (1970). Statistical methods for research workers (14th ed.). Edinburgh: Oliver & Boyd.MATH Fisher, R. A. (1970). Statistical methods for research workers (14th ed.). Edinburgh: Oliver & Boyd.MATH
Zurück zum Zitat Flack, V. F. (1987). Confidence intervals for the interrater agreement measure kappa. Communications in Statistics – Theory and Methods, 16, 953–968.MATHCrossRef Flack, V. F. (1987). Confidence intervals for the interrater agreement measure kappa. Communications in Statistics – Theory and Methods, 16, 953–968.MATHCrossRef
Zurück zum Zitat Fleiss, J. L. (1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76, 378–382.CrossRef Fleiss, J. L. (1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76, 378–382.CrossRef
Zurück zum Zitat Fleiss, J. L., & Cicchetti, D. V. (1978). Inference about weighted kappa in the non-null case. Applied Psychological Measurement, 2, 113–117.CrossRef Fleiss, J. L., & Cicchetti, D. V. (1978). Inference about weighted kappa in the non-null case. Applied Psychological Measurement, 2, 113–117.CrossRef
Zurück zum Zitat Fleiss, J. L., & Cohen, J. (1973). The equivalence of weighted kappa and the intraclass correlation coefficient as measures of reliability. Educational and Psychological Measurement, 33, 613–619.CrossRef Fleiss, J. L., & Cohen, J. (1973). The equivalence of weighted kappa and the intraclass correlation coefficient as measures of reliability. Educational and Psychological Measurement, 33, 613–619.CrossRef
Zurück zum Zitat Fleiss, J. L., Cohen, J., & Everitt, B. S. (1969). Large sample standard errors of kappa and weighted kappa. Psychological Bulletin, 72, 323–327.CrossRef Fleiss, J. L., Cohen, J., & Everitt, B. S. (1969). Large sample standard errors of kappa and weighted kappa. Psychological Bulletin, 72, 323–327.CrossRef
Zurück zum Zitat Fleiss, J. L., & Davies, M. (1982). Jackknifing functions of multinomial frequencies, with an application to a measure of concordance. American Journal of Epidemiology, 115, 841–845.CrossRef Fleiss, J. L., & Davies, M. (1982). Jackknifing functions of multinomial frequencies, with an application to a measure of concordance. American Journal of Epidemiology, 115, 841–845.CrossRef
Zurück zum Zitat Fleiss, J. L., Levin, B., & Paik, M. C. (2003). Statistical methods for rates and proportions (3rd ed.). Hoboken: Wiley.MATHCrossRef Fleiss, J. L., Levin, B., & Paik, M. C. (2003). Statistical methods for rates and proportions (3rd ed.). Hoboken: Wiley.MATHCrossRef
Zurück zum Zitat Fleiss, J. L., Nee, J. C. M., & Landis, J. R. (1979). Large sample variance of kappa in the case of different sets of raters. Psychological Bulletin, 86, 974–977.CrossRef Fleiss, J. L., Nee, J. C. M., & Landis, J. R. (1979). Large sample variance of kappa in the case of different sets of raters. Psychological Bulletin, 86, 974–977.CrossRef
Zurück zum Zitat Grizzle, J. E., Starmer, C. F., & Koch, G. G. (1969). Analysis of categorical data by linear models. Biometrics, 25, 489–504.MathSciNetMATHCrossRef Grizzle, J. E., Starmer, C. F., & Koch, G. G. (1969). Analysis of categorical data by linear models. Biometrics, 25, 489–504.MathSciNetMATHCrossRef
Zurück zum Zitat Gwet, K. L. (2002). Inter-rater reliability: dependency on trait prevalence and marginal homogeneity. In Statistical Methods for Inter-Rater Reliability Assessment No. 2. Gwet, K. L. (2002). Inter-rater reliability: dependency on trait prevalence and marginal homogeneity. In Statistical Methods for Inter-Rater Reliability Assessment No. 2.
Zurück zum Zitat Gwet, K. L. (2008a). Computing inter-rater reliability and its variance in the presence of high agreement. British Journal of Mathematical and Statistical Psychology, 61, 29–48.MathSciNetCrossRef Gwet, K. L. (2008a). Computing inter-rater reliability and its variance in the presence of high agreement. British Journal of Mathematical and Statistical Psychology, 61, 29–48.MathSciNetCrossRef
Zurück zum Zitat Gwet, K. L. (2008b). Variance estimation of nominal-scale inter-rater reliability with random selection of raters. Psychometrika, 73, 407–430.MathSciNetMATHCrossRef Gwet, K. L. (2008b). Variance estimation of nominal-scale inter-rater reliability with random selection of raters. Psychometrika, 73, 407–430.MathSciNetMATHCrossRef
Zurück zum Zitat Gwet, K. L. (2014). Handbook of inter-rater reliability: the definitive guide to measuring the extent of agreement among raters (4th ed.). Gaithersburg: Advanced Analytics. Gwet, K. L. (2014). Handbook of inter-rater reliability: the definitive guide to measuring the extent of agreement among raters (4th ed.). Gaithersburg: Advanced Analytics.
Zurück zum Zitat Hale, C. A., & Fleiss, J. L. (1993). Interval estimation under two study designs for kappa with binary classifications. Biometrics, 49, 523–533.CrossRef Hale, C. A., & Fleiss, J. L. (1993). Interval estimation under two study designs for kappa with binary classifications. Biometrics, 49, 523–533.CrossRef
Zurück zum Zitat Hall, P. (1982). Improving the normal approximation when constructing one-sided confidence intervals for binomial or Poisson parameters. Biometrika, 69, 647–652.MathSciNetMATHCrossRef Hall, P. (1982). Improving the normal approximation when constructing one-sided confidence intervals for binomial or Poisson parameters. Biometrika, 69, 647–652.MathSciNetMATHCrossRef
Zurück zum Zitat Hall, P. (1992). The bootstrap and Edgeworth expansion. New York: Springer. Corrected printing, 1997. Hall, P. (1992). The bootstrap and Edgeworth expansion. New York: Springer. Corrected printing, 1997.
Zurück zum Zitat Hallgren, K. A. (2012). Computing inter-rater reliability for observational data: an overview and tutorial. Tutorials in Quantitative Methods for Psychology, 8, 23–34.CrossRef Hallgren, K. A. (2012). Computing inter-rater reliability for observational data: an overview and tutorial. Tutorials in Quantitative Methods for Psychology, 8, 23–34.CrossRef
Zurück zum Zitat Holley, W., & Guilford, J. P. (1964). A note on the G-index of agreement. Educational and Psychological Measurement, 24, 749–753.CrossRef Holley, W., & Guilford, J. P. (1964). A note on the G-index of agreement. Educational and Psychological Measurement, 24, 749–753.CrossRef
Zurück zum Zitat Hsu, L. M., & Field, R. (2003). Interrater agreement measures: Comments on Kappan, Cohen’s kappa, Scott’s π and Aickin’s α. Understanding Statistics, 2, 205–219.CrossRef Hsu, L. M., & Field, R. (2003). Interrater agreement measures: Comments on Kappan, Cohen’s kappa, Scott’s π and Aickin’s α. Understanding Statistics, 2, 205–219.CrossRef
Zurück zum Zitat Hubert, L. (1977). Kappa revisited. Psychological Bulletin, 84, 289–297.CrossRef Hubert, L. (1977). Kappa revisited. Psychological Bulletin, 84, 289–297.CrossRef
Zurück zum Zitat Janson, S., & Vegelius, J. (1979). On generalizations of the G index and the phi coefficient to nominal scales. Multivariate Behavioral Research, 14, 255–269.CrossRef Janson, S., & Vegelius, J. (1979). On generalizations of the G index and the phi coefficient to nominal scales. Multivariate Behavioral Research, 14, 255–269.CrossRef
Zurück zum Zitat Koch, G. G., Landis, J. R., Freeman, J. L., Freeman Jr., D. H., & Lehnen, R. G. (1977). A general methodology for the analysis of experiments with repeated measurement of categorical data. Biometrics, 33, 133–158.MATHCrossRef Koch, G. G., Landis, J. R., Freeman, J. L., Freeman Jr., D. H., & Lehnen, R. G. (1977). A general methodology for the analysis of experiments with repeated measurement of categorical data. Biometrics, 33, 133–158.MATHCrossRef
Zurück zum Zitat Kraemer, H. C., Periyakoil, V. S., & Noda, A. (2002). Kappa coefficients in medical research. Statistics in Medicine, 21, 2109–2129.CrossRef Kraemer, H. C., Periyakoil, V. S., & Noda, A. (2002). Kappa coefficients in medical research. Statistics in Medicine, 21, 2109–2129.CrossRef
Zurück zum Zitat Landis, J. R., & Koch, G. G. (1977a). The measurement of observer agreement for categorical data. Biometrics, 33, 159–174.MATHCrossRef Landis, J. R., & Koch, G. G. (1977a). The measurement of observer agreement for categorical data. Biometrics, 33, 159–174.MATHCrossRef
Zurück zum Zitat Landis, J. R., & Koch, G. G. (1977b). An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. Biometrics, 33, 363–374.MATHCrossRef Landis, J. R., & Koch, G. G. (1977b). An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. Biometrics, 33, 363–374.MATHCrossRef
Zurück zum Zitat Light, R. J. (1971). Measures of response agreement for qualitative data: some generalizations and alternatives. Psychological Bulletin, 76, 365–377.CrossRef Light, R. J. (1971). Measures of response agreement for qualitative data: some generalizations and alternatives. Psychological Bulletin, 76, 365–377.CrossRef
Zurück zum Zitat Lipsitz, S. R., Laird, N. M., & Brennan, T. A. (1994). Simple moment estimates of the k-coefficient and its variance. Applied Statistics, 43, 309–323.MathSciNetMATHCrossRef Lipsitz, S. R., Laird, N. M., & Brennan, T. A. (1994). Simple moment estimates of the k-coefficient and its variance. Applied Statistics, 43, 309–323.MathSciNetMATHCrossRef
Zurück zum Zitat Maclure, M., & Willett, W. C. (1987). Misinterpretation and misuse of the kappa statistic. American Journal of Epidemiology, 126, 161–169.CrossRef Maclure, M., & Willett, W. C. (1987). Misinterpretation and misuse of the kappa statistic. American Journal of Epidemiology, 126, 161–169.CrossRef
Zurück zum Zitat Maxwell, A. E. (1977). Comparing the classification of subjects by two independent judges. British Journal of Psychiatry, 116, 651–655.CrossRef Maxwell, A. E. (1977). Comparing the classification of subjects by two independent judges. British Journal of Psychiatry, 116, 651–655.CrossRef
Zurück zum Zitat McGraw, K. O., & Wong, S. P. (1996). Forming inferences about some intraclass correlation coefficients. Psychological Methods, 1, 30–46 (Correction to McGraw and Wong (1996). 1, 390.).CrossRef McGraw, K. O., & Wong, S. P. (1996). Forming inferences about some intraclass correlation coefficients. Psychological Methods, 1, 30–46 (Correction to McGraw and Wong (1996). 1, 390.).CrossRef
Zurück zum Zitat Mielke Jr., P. W., Berry, K. J., & Johnston, J. E. (2007). The exact variance of weighted kappa with multiple raters. Psychological Reports, 101, 655–660.CrossRef Mielke Jr., P. W., Berry, K. J., & Johnston, J. E. (2007). The exact variance of weighted kappa with multiple raters. Psychological Reports, 101, 655–660.CrossRef
Zurück zum Zitat Ogasawara, H. (2010). Asymptotic expansions for the pivots using log-likelihood derivatives with an application in item response theory. Journal of Multivariate Analysis, 101, 2149–2167.MathSciNetMATHCrossRef Ogasawara, H. (2010). Asymptotic expansions for the pivots using log-likelihood derivatives with an application in item response theory. Journal of Multivariate Analysis, 101, 2149–2167.MathSciNetMATHCrossRef
Zurück zum Zitat Ogasawara, H. (2012). Cornish-Fisher expansions using sample cumulants and monotonic transformations. Journal of Multivariate Analysis, 103, 1–18.MathSciNetMATHCrossRef Ogasawara, H. (2012). Cornish-Fisher expansions using sample cumulants and monotonic transformations. Journal of Multivariate Analysis, 103, 1–18.MathSciNetMATHCrossRef
Zurück zum Zitat Ogasawara, H. (2020a). Asymptotic cumulants of the minimum phi-divergence estimator for categorical data under possible model misspecification. Communications in Statistics – Theory and Methods, 49, 2448–2465.MathSciNetCrossRef Ogasawara, H. (2020a). Asymptotic cumulants of the minimum phi-divergence estimator for categorical data under possible model misspecification. Communications in Statistics – Theory and Methods, 49, 2448–2465.MathSciNetCrossRef
Zurück zum Zitat Scott, W. A. (1955). Reliability of content analysis: the case of nominal scale coding. The Public Opinion Quarterly, 19, 321–325.CrossRef Scott, W. A. (1955). Reliability of content analysis: the case of nominal scale coding. The Public Opinion Quarterly, 19, 321–325.CrossRef
Zurück zum Zitat Shrout, P. E., & Fleiss, J. L. (1979). Intraclass correlations: uses in assessing rater reliability. Psychological Bulletin, 86, 420–428.CrossRef Shrout, P. E., & Fleiss, J. L. (1979). Intraclass correlations: uses in assessing rater reliability. Psychological Bulletin, 86, 420–428.CrossRef
Zurück zum Zitat Spitzer, R. L., Cohen, J., Fleiss, J. L., & Endicott, J. (1967). Quantification of agreement in psychiatric diagnosis: a new approach. Archives of General Psychiatry, 17, 83–87.CrossRef Spitzer, R. L., Cohen, J., Fleiss, J. L., & Endicott, J. (1967). Quantification of agreement in psychiatric diagnosis: a new approach. Archives of General Psychiatry, 17, 83–87.CrossRef
Zurück zum Zitat Stuart, A., & Ord, J. K. (1994). Kendall’s advanced theory of statistics: distribution theory (Vol. 1, 6th ed.). London: Arnold. Stuart, A., & Ord, J. K. (1994). Kendall’s advanced theory of statistics: distribution theory (Vol. 1, 6th ed.). London: Arnold.
Zurück zum Zitat Zhao, X., Liu, J. S., & Deng, K. (2013). Assumptions behind intercoder reliability indices. In C. T. Salmon (Ed.), Communication yearbook 36 (pp. 419–480). New York: Routledge. Zhao, X., Liu, J. S., & Deng, K. (2013). Assumptions behind intercoder reliability indices. In C. T. Salmon (Ed.), Communication yearbook 36 (pp. 419–480). New York: Routledge.
Metadaten
Titel
A Unified Treatment of Agreement Coefficients and their Asymptotic Results: the Formula of the Weighted Mean of Weighted Ratios
verfasst von
Haruhiko Ogasawara
Publikationsdatum
07.10.2020
Verlag
Springer US
Erschienen in
Journal of Classification / Ausgabe 2/2021
Print ISSN: 0176-4268
Elektronische ISSN: 1432-1343
DOI
https://doi.org/10.1007/s00357-020-09366-1