Skip to main content
Erschienen in: Advances in Data Analysis and Classification 3/2019

25.07.2018 | Regular Article

Regression trees for detecting preference patterns from rank data

verfasst von: Yu-Shan Shih, Kuang-Hsun Liu

Erschienen in: Advances in Data Analysis and Classification | Ausgabe 3/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A regression tree method for analyzing rank data is proposed. A key ingredient of the methodology is to convert ranks into scores by paired comparison. We then utilize the GUIDE tree method on the score vectors to identify the preference patterns in the data. This method is exempt from selection bias and the simulation results show that it is good with respect to the selection of split variables and has a better prediction accuracy than the two other investigated methods in some cases. Furthermore, it is applicable to complex data which may contain incomplete ranks and missing covariate values. We demonstrate its usefulness in two real data studies.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Pearson’s Chi-Square test of independence and its Bonferroni-adjusted p value were used in CHAID, a classical tree method (Kass 1980).
 
Literatur
Zurück zum Zitat Bradley RA, Terry ME (1952) Rank analysis of incomplete block designs. I. The method of paired comparisons. Biometrika 39:324–345MathSciNetMATH Bradley RA, Terry ME (1952) Rank analysis of incomplete block designs. I. The method of paired comparisons. Biometrika 39:324–345MathSciNetMATH
Zurück zum Zitat Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. Wadsworth, BelmontMATH Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. Wadsworth, BelmontMATH
Zurück zum Zitat Cheng W, Hühn J, Hüllermeier E (2009) Decision tree and instance-based learning for label ranking. In: International conference on machine learning, Montreal Cheng W, Hühn J, Hüllermeier E (2009) Decision tree and instance-based learning for label ranking. In: International conference on machine learning, Montreal
Zurück zum Zitat Critchlow DE (1985) Metric methods for analyzing partially ranked data. Springer, New YorkCrossRefMATH Critchlow DE (1985) Metric methods for analyzing partially ranked data. Springer, New YorkCrossRefMATH
Zurück zum Zitat D’Ambrosio A, Heiser WJ (2016) A recursive partitioning method for the prediction of preference rankings based upon Kemeny distances. Psychometrika 81:774–794MathSciNetCrossRefMATH D’Ambrosio A, Heiser WJ (2016) A recursive partitioning method for the prediction of preference rankings based upon Kemeny distances. Psychometrika 81:774–794MathSciNetCrossRefMATH
Zurück zum Zitat Davidson RR (1970) On extending the Bradley–Terry model to accommodate ties in paired comparison experiments. J Am Stat Assoc 65:317–328CrossRef Davidson RR (1970) On extending the Bradley–Terry model to accommodate ties in paired comparison experiments. J Am Stat Assoc 65:317–328CrossRef
Zurück zum Zitat De’ath G (2002) Multivariate regression trees: a new technique for modeling species-environment relationships. Ecology 83:1105–1117 De’ath G (2002) Multivariate regression trees: a new technique for modeling species-environment relationships. Ecology 83:1105–1117
Zurück zum Zitat Diaconis P (1988) Group representations in probability and statistics. Institute of Mathematical Statistics, HaywardMATH Diaconis P (1988) Group representations in probability and statistics. Institute of Mathematical Statistics, HaywardMATH
Zurück zum Zitat Emond EJ, Mason DW (2002) A new rank correlation coefficient with application to the consensus ranking problem. J Multi-Criteria Decis Anal 11:17–28CrossRefMATH Emond EJ, Mason DW (2002) A new rank correlation coefficient with application to the consensus ranking problem. J Multi-Criteria Decis Anal 11:17–28CrossRefMATH
Zurück zum Zitat Francis B, Dittrich R, Hatzinger R, Penn R (2002) Analysing partial ranks by using smoothed paired comparison methods: an investigation of value orientation in Europe. J R Stat Soc Ser C (Appl Stat) 51:319–336MathSciNetCrossRefMATH Francis B, Dittrich R, Hatzinger R, Penn R (2002) Analysing partial ranks by using smoothed paired comparison methods: an investigation of value orientation in Europe. J R Stat Soc Ser C (Appl Stat) 51:319–336MathSciNetCrossRefMATH
Zurück zum Zitat Francis B, Dittrich R, Hatzinger R, Humphreys L (2014) A mixture model for longitudinal partially ranked data. Commun Stat Theory Methods 43:722–734MathSciNetCrossRefMATH Francis B, Dittrich R, Hatzinger R, Humphreys L (2014) A mixture model for longitudinal partially ranked data. Commun Stat Theory Methods 43:722–734MathSciNetCrossRefMATH
Zurück zum Zitat Hatzinger R, Dittrich R (2012) prefmod: an R package for modeling preferences based on paired comparisons, rankings, or ratings. J Stat Softw 48:1–31CrossRef Hatzinger R, Dittrich R (2012) prefmod: an R package for modeling preferences based on paired comparisons, rankings, or ratings. J Stat Softw 48:1–31CrossRef
Zurück zum Zitat Hothorn T, Hornik K, Zeileis A (2006) Unbiased recursive partitioning: a conditional inference framework. J Comput Graph Stat 15:651–674MathSciNetCrossRef Hothorn T, Hornik K, Zeileis A (2006) Unbiased recursive partitioning: a conditional inference framework. J Comput Graph Stat 15:651–674MathSciNetCrossRef
Zurück zum Zitat Inglehart R (1977) The silent revolution: changing values and political styles among western publics. Princeton University Press, Princeton Inglehart R (1977) The silent revolution: changing values and political styles among western publics. Princeton University Press, Princeton
Zurück zum Zitat Kass GV (1980) An exploratory technique for investigating large quantities of categorical data. Appl Stat 29:119–127CrossRef Kass GV (1980) An exploratory technique for investigating large quantities of categorical data. Appl Stat 29:119–127CrossRef
Zurück zum Zitat Kemeny JG, Snell JL (eds) (1962) Preference rankings: an axiomatic approach. In: Mathematical models in the social sciences. The MIT press, Cambridge, pp 9–23 Kemeny JG, Snell JL (eds) (1962) Preference rankings: an axiomatic approach. In: Mathematical models in the social sciences. The MIT press, Cambridge, pp 9–23
Zurück zum Zitat Kung YH, Lin CT, Shih YS (2012) Split variable selection for tree modeling on rank data. Comput Stat Data Anal 56:2830–2836MathSciNetCrossRefMATH Kung YH, Lin CT, Shih YS (2012) Split variable selection for tree modeling on rank data. Comput Stat Data Anal 56:2830–2836MathSciNetCrossRefMATH
Zurück zum Zitat Liu KH, Shih YS (2016) Score-scale decision tree for paired comparison data. Statistica Sinica 26:429–444MathSciNetMATH Liu KH, Shih YS (2016) Score-scale decision tree for paired comparison data. Statistica Sinica 26:429–444MathSciNetMATH
Zurück zum Zitat Loh WY (2014) Fifty years of classification and regression trees (with discussion). Int Stat Rev 34:329–370CrossRefMATH Loh WY (2014) Fifty years of classification and regression trees (with discussion). Int Stat Rev 34:329–370CrossRefMATH
Zurück zum Zitat Marden JI (1995) Analyzing and modeling rank data. Chapman & Hall, LondonMATH Marden JI (1995) Analyzing and modeling rank data. Chapman & Hall, LondonMATH
Zurück zum Zitat Qinglong L (2015) StatMethRank: statistical methods for ranking data. R package version 1.3 Qinglong L (2015) StatMethRank: statistical methods for ranking data. R package version 1.3
Zurück zum Zitat Strobl C, Wickelmaier F, Zeileis A (2011) Accounting for individual differences in Bradley–Terry models by means of recursive partitioning. J Educ Behav Stat 36:135–153CrossRef Strobl C, Wickelmaier F, Zeileis A (2011) Accounting for individual differences in Bradley–Terry models by means of recursive partitioning. J Educ Behav Stat 36:135–153CrossRef
Zurück zum Zitat Vermunt J (2003) Multilevel latent class models. Sociol Methodol 33:213–239CrossRef Vermunt J (2003) Multilevel latent class models. Sociol Methodol 33:213–239CrossRef
Zurück zum Zitat Yandell BS (1997) Practical data analysis for designed experiments. Chapman & Hall, Boca RatonCrossRefMATH Yandell BS (1997) Practical data analysis for designed experiments. Chapman & Hall, Boca RatonCrossRefMATH
Zurück zum Zitat Yu PLH, Wan WM, Lee PH (2010) Decision tree modeling for ranking data. In: Fürnkranz J, Hüllermeier E (eds) Preference learning. Springer, New York, pp 83–106CrossRef Yu PLH, Wan WM, Lee PH (2010) Decision tree modeling for ranking data. In: Fürnkranz J, Hüllermeier E (eds) Preference learning. Springer, New York, pp 83–106CrossRef
Zurück zum Zitat Yu PLH, Lee PH, Cheung SF, Lau EYY, Mok DSY, Hui HC (2016) Logit tree models for discrete choice data with application to advice-seeking preferences among Chinese Christians. Comput Stat 31:799–827MathSciNetCrossRefMATH Yu PLH, Lee PH, Cheung SF, Lau EYY, Mok DSY, Hui HC (2016) Logit tree models for discrete choice data with application to advice-seeking preferences among Chinese Christians. Comput Stat 31:799–827MathSciNetCrossRefMATH
Metadaten
Titel
Regression trees for detecting preference patterns from rank data
verfasst von
Yu-Shan Shih
Kuang-Hsun Liu
Publikationsdatum
25.07.2018
Verlag
Springer Berlin Heidelberg
Erschienen in
Advances in Data Analysis and Classification / Ausgabe 3/2019
Print ISSN: 1862-5347
Elektronische ISSN: 1862-5355
DOI
https://doi.org/10.1007/s11634-018-0332-3

Weitere Artikel der Ausgabe 3/2019

Advances in Data Analysis and Classification 3/2019 Zur Ausgabe

Premium Partner