Skip to main content

1985 | Buch

Metric Methods for Analyzing Partially Ranked Data

verfasst von: Douglas E. Critchlow

Verlag: Springer New York

Buchreihe : Lecture Notes in Statistics

insite
SUCHEN

Über dieses Buch

A full ranking of n items is simply an ordering of all these items, of the form: first choice, second choice, •. . , n-th choice. If two judges each rank the same n items, statisticians have used various metrics to measure the closeness of the two rankings, including Ken­ dall's tau, Spearman's rho, Spearman's footrule, Ulam's metric, Hal1l11ing distance, and Cayley distance. These metrics have been em­ ployed in many contexts, in many applied statistical and scientific problems. Thi s monograph presents genera 1 methods for extendi ng these metri cs to partially ranked data. Here "partially ranked data" refers, for instance, to the situation in which there are n distinct items, but each judge specifies only his first through k-th choices, where k < n. More complex types of partially ranked data are also investigated. Group theory is an important tool for extending the metrics. Full rankings are identified with elements of the permutation group, whereas partial rankings are identified with points in a coset space of the permutation group. The problem thus becomes one of ex­ tending metrics on the permutation group to metrics on a coset space of the permutation group. To carry out the extens"ions, two novel methods -- the so-called Hausdorff and fixed vector methods -- are introduced and implemented, which exploit this group-theoretic structure. Various data-analytic applications of metrics on fully ranked data have been presented in the statistical literature.

Inhaltsverzeichnis

Frontmatter
Chapter I. Introduction and Outline
Abstract
There are many instances of partially ranked data, where several items are ranked according to some criterion, but the ordering is not complete. In its simplest form, such data arises when there are n distinct items, and each judge lists in order his k favorite items, where k < n. An example with n = 5 and k = 3 is afforded by the De troit Area study, which asks people to specify the first, second, and third most important out of five named parts of marriage [B4]. More complex types of partially ranked data are also possible. The General Social Survey [D1] lists thirteen qualities that a child could possess, and from this list, respondents are asked to choose the most desirable quality, the two next most desirable qualities, the least desirable quality, and the two next least desirable qualities.
Douglas E. Critchlow
Chapter II. Metrics on Fully Ranked Data
Abstract
This chapter reviews, in part, an approach to permutation data presented in Diaconis’ monograph, Group Theory in Statistics [D2].
Douglas E. Critchlow
Chapter III. Metrics on Partially Ranked Data: The Case Where Each Judge Lists His k Favorite Items Out of n
Abstract
An illustration of this type of partially ranked data, with n = 5 and k = 3, is obtained by modifying the example of Section II.C: there are still five flavors of ice cream, but now each judge specifies only his first, second, and third choices. In this chapter, the six metrics of Section II.B are extended to metrics on this type of partially ranked data. Furthermore, the extensions preserve the necessary property of right-invariance, discussed in Section II.C.
Douglas E. Critchlow
Chapter IV. Metrics on Other Types of Partially Ranked Data
Abstract
In this chapter, the results of Chapter III are extended to the following more general type of partially ranked data: each judge is given a list of n items. He partitions these n items into r groups: the first group contains his n1 favorite items, the second group con tains his n2 next most preferred items, and so on; the final group contains his nr least favorite items. The judge does not state any preferences among members of the same group. Here n1,…, nr are strictly positive integers, satisfying n; the partial ranking is said to be “of type n1,…nr.
Douglas E. Critchlow
Chapter V. Distributional Properties of the Metrics
Abstract
Given two partial rankings π and σ , both of the same type, we might ask whether the two rankings are “significantly” correlated. That is, do the two rankings show more agreement than one might ex pect, under the assumption that they were generated, independently, from a uniform distribution on all possible partial rankings?
Douglas E. Critchlow
Chapter VI. Data Analysis, Using the Metrics
Abstract
Several applications of metrics on fully ranked data have already been proposed by various authors, especially Mallows [Ml], Diaconis [D2], and Friedman and Rafsky [F3, F4]. These applications include fitting probability models to fully ranked data, clustering ranked data for a goodness-of-fit test, multidimensional scaling for fully ranked data, and testing for significant differences between two populations of rankers. A nice presentation of the uses of metrics on fully ranked data is in Diaconis’ monograph [D2].
Douglas E. Critchlow
Backmatter
Metadaten
Titel
Metric Methods for Analyzing Partially Ranked Data
verfasst von
Douglas E. Critchlow
Copyright-Jahr
1985
Verlag
Springer New York
Electronic ISBN
978-1-4612-1106-8
Print ISBN
978-0-387-96288-7
DOI
https://doi.org/10.1007/978-1-4612-1106-8