1 Introduction
2 Statistical Model and Optimal Classifiers
3 Motivating Example
Country | Posterior probabilities | Total | ||
---|---|---|---|---|
Sweden | \(p_1=0.23\) | \(p_2=0.10\) | \(p_3=0.07\) | \(P_1=0.40\) |
Norway | \(p_4=0.22\) | \(p_5=0.20\) | \(p_6=0.18\) | \(P_2=0.60\) |
Reward function I | Reward function II | Reward function III | |||
---|---|---|---|---|---|
c | \(\hat{I}\) | c | \(\hat{I}\) | c | \(\hat{I}\) |
\((0.220,\infty )\) | \(\{1\}\) | \((0.550,\infty )\) | \(\emptyset \) | \((0.550,\infty )\) | \(\emptyset \) |
(0.200, 0.220] | \(\{1,4\}\) | (0.383, 0.550] | \(\{4\}\) | (0.500, 0.550] | \(\{4\}\) |
(0.180, 0.200] | \(\{1,4,5\}\) | (0.200, 0.383] | \(\{1,4\}\) | (0.450, 0.500] | \(\{4,5\}\) |
(0.100, 0.180] | \(\{1,4,5,6\}\) | (0.180, 0.200] | \(\{1,4,5\}\) | (0.383, 0.450] | \(\{4,5,6\}\) |
(0.070, 0.010] | \(\{1,2,4,5,6\}\) | (0.100, 0.180] | \(\{1,4,5,6\}\) | (0.167, 0.383] | \(\{1,4,5,6\}\) |
[0.000, 0.070] | \(\{1,2,3,4,5,6\}\) | (0.070, 0.010] | \(\{1,2,4,5,6\}\) | (0.117, 0.167] | \(\{1,2,4,5,6\}\) |
[0.000, 0.070] | \(\{1,2,3,4,5,6\}\) | [0.000, 0.117] | \(\{1,2,3,4,5,6\}\) |
4 Partial Rejection, One Block of Categories
4.1 Additive Penalties for the Sizes of Classified Sets
4.2 Multiplicative Penalties for the Sizes of Classified Sets
5 Several Blocks of Categories
5.1 Block-Dependent Rewards of Including the True Category
5.2 Additive Penalties of Sizes of Classified Sets
5.3 Multiplicative Penalties of Sizes of Classified Sets
6 Illustration for An Ornithological Data Set
Species | \(n_i\) |
---|---|
Reed warbler | 409 |
Blyth’s reed warbler | 41 |
Paddyfield warbler | 18 |
Marsh warbler | 414 |
6.1 Data Set
6.2 Model
mvtnorm
(Genz et al., 2021).6.3 Classifiers Based on Composite Proportion-Based Re-ward Functions
Binary reward function | Reward criteria |
---|---|
\(\tilde{R}_1(\mathcal{I},i) = 1(\mathcal{I}=\{i\})\) | Correct (point) classification |
\(\tilde{R}_2(\mathcal{I},i) = 1(i\in \mathcal{I}\wedge \mathcal{I}\subseteq \mathcal{N}_{k(i)})\) | Correct category is in the classifier, and no category from an incorrect block is in the classifier. The analogy of \(\tilde{R}_1\) for block prediction. |
\(\tilde{R}_3(\mathcal{I},i) = 1(i\in \mathcal{I})\) | Correct category in classifier |
\(\tilde{R}_4(\mathcal{I},i) = 1(\mathcal{I}\cap \mathcal{N}_{k(i)}\ne \emptyset )\) | Some category from the correct block in the classifier. The analogy of \(\tilde{R}_3\) for block prediction. |
\(\varepsilon \) | Prior | \(\tilde{R}\) | \(b_{\varepsilon ,0.05}\) | ||
---|---|---|---|---|---|
\({w^{(\text {bird})}}\) | \({w^{(\text {spec})}}\) | \({w^{(\text {rare})}}\) | |||
1/2 | \({\pi ^{(\text {flat})}}\) | \(\tilde{R}_3\) | \(\ge 20.00\) | 2.29 | 2.11 |
\(\tilde{R}_4\) | \(\ge 20.00\) | 3.05 | 2.11 | ||
\({\pi ^{(\text {prop})}}\) | \(\tilde{R}_3\) | \(\ge 20.00\) | 5.23 | 4.81 | |
\(\tilde{R}_4\) | \(\ge 20.00\) | 6.96 | 4.81 | ||
\({\pi ^{(\text {real})}}\) | \(\tilde{R}_3\) | 1.07 | 0.43 | 0.18 | |
\(\tilde{R}_4\) | \(\ge 20.00\) | 0.52 | 0.18 | ||
2 | \({\pi ^{(\text {flat})}}\) | \(\tilde{R}_3\) | \(\ge 20.00\) | 2.29 | 2.11 |
\(\tilde{R}_4\) | \(\ge 20.00\) | 3.05 | 2.11 | ||
\({\pi ^{(\text {prop})}}\) | \(\tilde{R}_3\) | \(\ge 20.00\) | 5.23 | 4.81 | |
\(\tilde{R}_4\) | \(\ge 20.00\) | 6.96 | 4.81 | ||
\({\pi ^{(\text {real})}}\) | \(\tilde{R}_3\) | 1.07 | 0.21 | 0.18 | |
\(\tilde{R}_4\) | \(\ge 20.00\) | 0.52 | 0.18 |
6.4 Results
optimise
-function in R\(\varepsilon \) | Prior | \(\tilde{R}\) | \({w^{(\text {bird})}}\) | \({w^{(\text {spec})}}\) | \({w^{(\text {rare})}}\) | |||
---|---|---|---|---|---|---|---|---|
\(b_\varepsilon \) | non-reward rate | \(b_\varepsilon \) | non-reward rate | \(b_\varepsilon \) | non-reward rate | |||
\(\frac{1}{2}\) | \({\pi ^{(\text {flat})}}\) | \(\tilde{R}_1\) | 1.24 | 1.59% | 17.16 | 9.12% | 1.16 | 4.06 % |
\(\tilde{R}_2\) | 1.24 | 1.59% | 17.16 | 9.12% | 1.16 | 4.06% | ||
\({\pi ^{(\text {prop})}}\) | \(\tilde{R}_1\) | 2.63 | 1.59% | 2.24 | 4.06% | 2.63 | 4.06% | |
\(\tilde{R}_2\) | 2.63 | 1.59% | 2.24 | 4.06% | 2.63 | 4.06% | ||
\({\pi ^{(\text {real})}}\) | \(\tilde{R}_1\) | 10.28 | 6.69% | 10.28 | 19.41% | 10.28 | 49.62% | |
\(\tilde{R}_2\) | 10.28 | 6.69% | 10.28 | 19.41% | 10.28 | 49.62% | ||
2 | \({\pi ^{(\text {flat})}}\) | \(\tilde{R}_1\) | 1.24 | 1.59% | 17.16 | 9.12% | 1.16 | 4.06% |
\(\tilde{R}_2\) | 1.24 | 1.59% | 17.16 | 9.12% | 1.16 | 4.06% | ||
\({\pi ^{(\text {prop})}}\) | \(\tilde{R}_1\) | 2.63 | 1.59% | 2.24 | 4.06% | 2.63 | 4.06% | |
\(\tilde{R}_2\) | 2.63 | 1.59% | 2.24 | 4.06% | 2.63 | 4.06% | ||
\({\pi ^{(\text {real})}}\) | \(\tilde{R}_1\) | 10.28 | 6.69% | 10.28 | 19.41% | 10.28 | 49.62% | |
\(\tilde{R}_2\) | 10.28 | 6.69% | 10.28 | 19.41% | 10.28 | 49.62% |