1 Introduction
1.1 Notation
\(\mathcal {T}\)
| set of terms in the collection |
\(\mathcal {D}\)
| set of documents in the collection |
t
| a term \(t\in \mathcal {T}\) |
d
| a document \(d\in \mathcal {D}\) |
\(|\mathcal {T}|\)
| number of terms |
\(|\mathcal {D}|\)
| number of documents |
\(l_c\)
| length of collection (number of term occurrences) |
\(l_t\)
| number of occurrences of the term t in the collection, here also called term length (aka collection frequency) |
\(\mathcal {D}_t\)
| set of documents where t occurs |
\(\mathcal {T}_d\)
| set of terms in d |
\(|\mathcal {D}_t|\)
| number of documents where t occurs (aka document frequency, \({\text {df}}(t)\)) |
\(|\mathcal {T}_d|\)
| number of distinct terms in d |
\(l_d\)
| length of document d (number of term occurrences, note \(l_d \ge |\mathcal {T}_d|\)) |
\(\mathrm {E}_{\mathcal {D}_t}[\textit{tf}_d] = l_t/|\mathcal {D}_t|\)
| average frequency of term t in the documents in which the term occurs |
\(\mathrm {E}_{\mathcal {T}_d}[\textit{tf}_d]=l_d/|\mathcal {T}_d|\)
| average term frequency of terms that occur in document d |
\(\bar{l}_d := \mathrm {E}_{\mathcal {D}}[l_d] = l_c/|\mathcal {D}|\)
| average document length |
\(\bar{l}_t := \mathrm {E}_{\mathcal {T}}[l_t] = l_c/|\mathcal {T}|\)
| average term length |
\(P(t)=P_L(t)=l_t/l_c\)
| location based probability of \(t\in \mathcal {T}\) |
\(P(d)=P_L(d)=l_d/l_c\)
| location based probability of \(d\in \mathcal {D}\) |
\(P_D(t)=|\mathcal {D}_t|/|\mathcal {D}|\)
| document based probability of \(t\in \mathcal {T}\) |
\(P_T(d)=|\mathcal {T}_d|/|\mathcal {T}|\)
| term based probability of \(d\in \mathcal {D}\) |
1.2 Motivations
A document is verbose if few terms are repeated many times; its domain is \([1, l_d]\), 1 for non-verbose (no term occurs more then once), and \(l_d\) for maximally verbose (one term is repeated \(l_d\) times).
A term is bursty if it occurs in few documents many times; its domain is \([1, l_t]\), 1 for a non-bursty term (it occurs only once in each document where it is present), \(l_t\) for maximally bursty (all the occurrences are only in one document).
1.3 Contributions and structure
2 Background
3 TF normalisations
3.1 Duality: document verboseness and length
-
\(\ddot{K}_d\): the non-elite normalization comprises the non-elite pivots \(\ddot{l}_d\) and \(\ddot{v}_d\).
-
\(\hat{K}_d\): the elite normalization comprises the elite pivots \(\hat{l}_d\) and \(\hat{v}_d\).
-
The expression \({\text {pivdl}}\), pivoted document length, denotes one of the two:
3.2 Example of calculation of the pivotizations
3.3 Other dualities
3.4 Summary
Document verboseness |
\(v_d := l_d/|\mathcal {T}_d|\)
|
Document length | \(l_d := l_d/|\mathcal {D}_d|\) (noting that \(|\mathcal {D}_d|=1\)) |
Term burstiness |
\(b_t := l_t/|\mathcal {D}_t|\)
|
Term length | \(l_t := l_t/|\mathcal {T}_t|\) (noting that \(|\mathcal {T}_t|=1\)) |
4 Probabilistic derivation of IR models
4.1 Observations about the \({\text {TF}}\) component
The following equation indicates the difference between the standard \(K_d\) as known for BM25 [as shown in Eq. (26)], and the systematic extension proposed and investigated in this paper:the pivoted document length (\({\text {pivdl}}\)) andthe pivoted document verboseness (\(\text {pivdv}\)).
4.2 Observations about the \({\text {IDF}}\) component
4.3 LM and TF-IDF
5 Experiments
Corpus | EC | Challenge |
\(|\mathcal {D}|\)
|
\(|\mathcal {T}|\)
|
\(l_c\)
|
---|---|---|---|---|---|
\(\bar{l}_d\)
|
\(\bar{v}_d\)
|
\(\breve{v}_d\downarrow\)
| |||
\(\bar{l}_t\)
|
\(\bar{b}_t\)
|
\(\breve{b}_t\)
| |||
Aquaint | TREC | HARD’05 | 1,033,461 | 647,280 | 282,858,247 |
273.700 | 436.995 | 1.519 | |||
436.995 | 273.700 | 1.384 | |||
Disks 4&5 | TREC | Ad Hoc 8 | 528,106 | 737,963 | 156,226,039 |
295.823 | 211.699 | 1.575 | |||
211.699 | 295.823 | 1.377 | |||
eHealth’14 | CLEF | eHealth’14 | 1,104,298 | 1,103,947 | 685,458,908 |
620.917 | 308.294 | 1.900 | |||
308.294 | 620.917 | 1.349 | |||
.GOV | TREC | Web’02 | 1,214,592 | 2,937,251 | 1,770,120,644 |
1,457.379 | 602.645 | 4.830 | |||
602.645 | 1,457.379 | 3.012 |
5.1 Setup and materials
-
16 models based on TF-IDF variants: 4 \({\text {TF}}\) normalizations for each of the 4 \({\text {TF}}\) quantifications defined in Definition 2. Each model is identified by its \({\text {TF}}\) quantification, \(\text {TF}_{\text {total}}\), \(\text {TF}_{\text {log}}\), \(\text {TF}_{\text {BM25}}\), and \(\text {TF}_{\text {constant}}\) and kind of \({\text {TF}}\) normalization applied: non-elite disjunctive \(\ddot{K}_{\vee ,d}\), non-elite conjunctive \(\ddot{K}_{\wedge ,d}\), elite disjunctive \(\hat{K}_{\vee ,d}\) and elite conjunctive \(\hat{K}_{\wedge ,d}\).
-
4 models based on D-LM: Each Dirichlet-based mixture is identified by its kind of \(\lambda _{d}\) normalization applied: non-elite disjunctive \(\ddot{\lambda }_{\vee ,d}\), non-elite conjunctive \(\ddot{\lambda }_{\wedge ,d}\), elite disjunctive \(\hat{\lambda }_{\vee ,d}\) and elite conjunctive \(\hat{\lambda }_{\wedge ,d}\).
-
4 models based on the TF-\(\text {IDF}_\text {L}\): Each Dirichlet-based mixture is identified by its kind of \(\lambda _{q}\) normalization applied: non-elite disjunctive \(\ddot{\lambda }_{\vee ,q}\), non-elite conjunctive \(\ddot{\lambda }_{\wedge ,q}\), elite disjunctive \(\hat{\lambda }_{\vee ,q}\) and elite conjunctive \(\hat{\lambda }_{\wedge ,q}\). As \({\text {TF}}\) component, we select the non-normalized \(\text {TF}_{\text {total}}\).
5.2 Model candidates/structure
5.3 Results
P | Q | K | C | k1 | b | a |
\(\text {AP}\)
|
\(\text {NDCG}\)
|
\(\text {P@10}\)
|
---|---|---|---|---|---|---|---|---|---|
Non-elite |
\(\text {TF}_{\text {total}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.0721 | 0.2936 | 0.1920 |
T | – |
\(>0\)
| 0.5 | – | 0.0900 \(\dagger\) | 0.3201 \(\dagger\) | 0.2160 | ||
\(\vee\)
|
\(>0\)
| 0.9 | 0.9 | 0.0904 \(\dagger\) | 0.3223 \(\dagger \, \ddagger\) | 0.2200 | |||
\(\wedge\)
|
\(>0\)
| 1.0 | 0.6 | 0.0942 \(\dagger \, \ddagger\) | 0.3277 \(\dagger \, \ddagger\) | 0.2380 \(\ddagger\) | |||
\(\text {TF}_{\text {log}}\)
| S | – | 1.0 | 0.0 | – | 0.1614 | 0.4424 | 0.4160 | |
T | – | 0.2 | 0.3 | – | 0.2005 \(\dagger\) | 0.4799 \(\dagger\) | 0.4360 | ||
\(\vee\)
| 0.2 | 0.4 | 0.2 | 0.2010 \(\dagger\) | 0.4801 \(\dagger\) | 0.4320 | |||
\(\wedge\)
| 5.0 | 0.8 | 0.7 | 0.2003 \(\dagger\) | 0.4813 \(\dagger\) | 0.4400 | |||
\(\text {TF}_{\text {BM25}}\)
| S | – | 1.2 | 0.7 | – | 0.1848 | 0.4563 | 0.3660 | |
T |
\(\vee\)
| 1.2 | 0.7 | 0.6 | 0.1898 | 0.4584 | 0.4280 \(\dagger\) | ||
– | 1.5 | 0.3 | – | 0.2023 \(\dagger\) | 0.4797 \(\dagger\) | 0.4440 \(\dagger\) | |||
\(\vee\)
| 1.9 | 0.4 | 0.5 | 0.2030 \(\dagger\) | 0.4802 \(\dagger\) | 0.4480 \(\dagger\) | |||
\(\wedge\)
| 3.2 | 0.4 | 0.3 |
0.2032
\(\dagger\)
|
0.4812
\(\dagger\)
|
0.4540
\(\dagger\)
| |||
\(\text {TF}_{\text {constant}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.0613 | 0.2436 | 0.1500 | |
T | – |
\(>0\)
| 0.1 | – | 0.0735 \(\dagger\) | 0.2744 \(\dagger\) | 0.1620 | ||
\(\vee\)
|
\(>0\)
| 0.2 | 0.7 | 0.0742 \(\dagger\) | 0.2756 \(\dagger\) | 0.1620 | |||
\(\wedge\)
|
\(>0\)
| 0.1 | 0.0 | 0.0740 \(\dagger\) | 0.2745 \(\dagger\) | 0.1660 | |||
Elite |
\(\text {TF}_{\text {total}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.0721 | 0.2936 | 0.1920 |
T | – |
\(>0\)
| 0.5 | – | 0.0900 \(\dagger\) | 0.3201 \(\dagger\) | 0.2160 | ||
\(\vee\)
|
\(>0\)
| 1.0 | 0.6 | 0.0946 \(\dagger \, \ddagger\) | 0.3283 \(\dagger \, \ddagger\) | 0.2380 \(\ddagger\) | |||
\(\wedge\)
|
\(>0\)
| 1.0 | 0.6 | 0.0942 \(\dagger \, \ddagger\) | 0.3277 \(\dagger \, \ddagger\) | 0.2380 \(\ddagger\) | |||
\(\text {TF}_{\text {log}}\)
| S | – | 1.0 | 0.0 | – | 0.1614 | 0.4424 | 0.4160 | |
T | – | 0.2 | 0.3 | – | 0.2005 \(\dagger\) | 0.4799 \(\dagger\) | 0.4360 | ||
\(\vee\)
| 0.2 | 0.6 | 0.5 | 0.2013 \(\dagger\) | 0.4798 \(\dagger\) | 0.4300 | |||
\(\wedge\)
| 0.2 | 0.8 | 0.7 | 0.2003 \(\dagger\) | 0.4810 \(\dagger\) | 0.4400 | |||
\(\text {TF}_{\text {BM25}}\)
| S | – | 1.2 | 0.7 | – | 0.1848 | 0.4563 | 0.3660 | |
T |
\(\vee\)
| 1.2 | 0.7 | 0.6 | 0.2012 \(\dagger\) | 0.4759 \(\dagger\) |
0.4480
\(\dagger\)
| ||
– | 1.5 | 0.3 | – | 0.2023 \(\dagger\) | 0.4797 \(\dagger\) | 0.4440 \(\dagger\) | |||
\(\vee\)
| 1.5 | 0.5 | 0.5 | 0.2034 \(\dagger\) | 0.4807 \(\dagger\) | 0.4420 \(\dagger\) | |||
\(\wedge\)
| 1.9 | 0.8 | 0.7 |
0.2037
\(\dagger\)
|
0.4833
\(\dagger\)
| 0.4400 \(\dagger\) | |||
\(\text {TF}_{\text {constant}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.0613 | 0.2436 | 0.1500 | |
T | – |
\(>0\)
| 0.1 | – | 0.0735 \(\dagger\) | 0.2744 \(\dagger\) | 0.1620 | ||
\(\vee\)
|
\(>0\)
| 0.1 | 0.0 | 0.0735 \(\dagger\) | 0.2744 \(\dagger\) | 0.1620 | |||
\(\wedge\)
|
\(>0\)
| 0.1 | 0.0 | 0.0740 \(\dagger\) | 0.2745 \(\dagger\) | 0.1660 |
P | Q | K | C | k1 | b | a |
\(\text {AP}\)
|
\(\text {NDCG}\)
|
\(\text {P@10}\)
|
---|---|---|---|---|---|---|---|---|---|
Non-elite |
\(\text {TF}_{\text {total}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.0635 | 0.2762 | 0.1360 |
T | – |
\(>0\)
| 0.5 | – | 0.0977 \(\dagger\) | 0.3306 \(\dagger\) | 0.2240 \(\dagger\) | ||
\(\vee\)
|
\(>0\)
| 0.5 | 0.0 | 0.0977 \(\dagger\) | 0.3306 \(\dagger\) | 0.2240 \(\dagger\) | |||
\(\wedge\)
|
\(>0\)
| 1.0 | 0.5 | 0.1076 \(\dagger \, \ddagger\) | 0.3491 \(\dagger \, \ddagger\) | 0.2400 \(\dagger\) | |||
\(\text {TF}_{\text {log}}\)
| S | – | 1.0 | 0.0 | – | 0.1753 | 0.4568 | 0.3360 | |
T | – | 0.1 | 0.3 | – | 0.2478 \(\dagger\) | 0.5381 \(\dagger\) | 0.4280 \(\dagger\) | ||
\(\vee\)
| 0.1 | 0.9 | 0.9 | 0.2563 | 0.5415 | 0.4560 | |||
\(\wedge\)
| 0.1 | 0.9 | 0.5 | 0.2625 \(\dagger \, \ddagger\) | 0.5475 \(\dagger\) | 0.4620 \(\dagger \, \ddagger\) | |||
\(\text {TF}_{\text {BM25}}\)
| S | – | 1.2 | 0.7 | – | 0.2433 | 0.5193 | 0.4680 | |
T |
\(\vee\)
| 1.2 | 0.7 | 0.8 | 0.2614 \(\dagger\) | 0.5438 \(\dagger\) | 0.4480 | ||
– | 0.6 | 0.3 | – | 0.2614 \(\dagger\) | 0.5447 \(\dagger\) | 0.4520 | |||
\(\vee\)
| 0.6 | 0.3 | 0.1 | 0.2616 \(\dagger\) | 0.5441 \(\dagger\) | 0.4620 \(\ddagger\) | |||
\(\wedge\)
| 2.7 | 0.6 | 0.5 |
0.2681
\(\dagger \, \ddagger\)
|
0.5523
\(\dagger \, \ddagger\)
|
0.4660
| |||
\(\text {TF}_{\text {constant}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.1550 | 0.4071 | 0.2060 | |
T | – |
\(>0\)
| 0.1 | – | 0.1868 \(\dagger\) | 0.4387 \(\dagger\) | 0.3260 \(\dagger\) | ||
\(\vee\)
|
\(>0\)
| 0.1 | 0.9 | 0.1880 \(\dagger\) | 0.4452 \(\dagger \, \ddagger\) | 0.3240 \(\dagger\) | |||
\(\wedge\)
|
\(>0\)
| 0.2 | 0.4 | 0.1922 \(\dagger\) | 0.4462 \(\dagger \, \ddagger\) | 0.3260 \(\dagger\) | |||
Elite |
\(\text {TF}_{\text {total}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.0635 | 0.2762 | 0.1360 |
T | – |
\(>0\)
| 0.5 | – | 0.0977 \(\dagger\) | 0.3306 \(\dagger\) | 0.2240 \(\dagger\) | ||
\(\vee\)
|
\(>0\)
| 1.0 | 0.7 | 0.1056 \(\dagger \, \ddagger\) | 0.3469 \(\dagger \, \ddagger\) | 0.2380 \(\dagger\) | |||
\(\wedge\)
|
\(>0\)
| 1.0 | 0.5 | 0.1076 \(\dagger \, \ddagger\) | 0.3491 \(\dagger \, \ddagger\) | 0.2400 \(\dagger\) | |||
\(\text {TF}_{\text {log}}\)
| S | – | 1.0 | 0.0 | – | 0.1753 | 0.4568 | 0.3360 | |
T | – | 0.1 | 0.3 | – | 0.2478 \(\dagger\) | 0.5381 \(\dagger\) | 0.4280 \(\dagger\) | ||
\(\vee\)
| 0.1 | 1.0 | 0.7 | 0.2521 \(\dagger\) | 0.5435 \(\dagger\) | 0.4500 \(\dagger \, \ddagger\) | |||
\(\wedge\)
| 0.1 | 0.8 | 0.6 | 0.2562 \(\dagger \, \ddagger\) | 0.5474 \(\dagger \, \ddagger\) | 0.4540 \(\dagger \, \ddagger\) | |||
\(\text {TF}_{\text {BM25}}\)
| S | – | 1.2 | 0.7 | – | 0.2433 | 0.5193 | 0.4680 | |
T |
\(\vee\)
| 1.2 | 0.7 | 0.6 | 0.2535 \(\dagger\) | 0.5399 \(\dagger\) |
0.4700
| ||
– | 0.6 | 0.3 | – | 0.2614 \(\dagger\) | 0.5447 \(\dagger\) | 0.4520 | |||
\(\vee\)
| 0.5 | 1.0 | 0.7 | 0.2638 \(\dagger\) | 0.5463 \(\dagger\) |
0.4700
| |||
\(\wedge\)
| 0.6 | 0.6 | 0.5 |
0.2681
\(\dagger \, \ddagger\)
|
0.5524
\(\dagger \, \ddagger\)
| 0.4680 \(\ddagger\) | |||
\(\text {TF}_{\text {constant}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.1550 | 0.4071 | 0.2060 | |
T | – |
\(>0\)
| 0.1 | – | 0.1868 \(\dagger\) | 0.4387 \(\dagger\) | 0.3260 \(\dagger\) | ||
\(\vee\)
|
\(>0\)
| 0.1 | 0.4 | 0.1878 \(\dagger\) | 0.4418 \(\dagger \, \ddagger\) | 0.3320 \(\dagger\) | |||
\(\wedge\)
|
\(>0\)
| 0.2 | 0.4 | 0.1922 \(\dagger\) | 0.4462 \(\dagger \, \ddagger\) | 0.3260 \(\dagger\) |
P | Q | K | C | k1 | b | a |
\(\text {AP}\)
|
\(\text {NDCG}\)
|
\(\text {P@10}\)
|
---|---|---|---|---|---|---|---|---|---|
Non-elite |
\(\text {TF}_{\text {total}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.1166 | 0.3361 | 0.2640 |
T | – |
\(>0\)
| 0.7 | – | 0.2594 \(\dagger\) | 0.5206 \(\dagger\) | 0.5580 \(\dagger\) | ||
\(\vee\)
|
\(>0\)
| 0.8 | 0.4 | 0.2610 \(\dagger\) | 0.5209 \(\dagger\) | 0.5540 \(\dagger\) | |||
\(\wedge\)
|
\(>0\)
| 1.0 | 0.4 | 0.2699 \(\dagger\) | 0.5322 \(\dagger\) | 0.5580 \(\dagger\) | |||
\(\text {TF}_{\text {log}}\)
| S | – | 1.0 | 0.0 | – | 0.2106 | 0.4637 | 0.4280 | |
T | – | 0.2 | 0.7 | – | 0.4222 | 0.6701 \(\dagger\) | 0.7960 \(\dagger\) | ||
\(\vee\)
| 0.4 | 0.8 | 0.5 | 0.4242 |
0.6729
\(\dagger \, \ddagger\)
| 0.8000 \(\dagger\) | |||
\(\wedge\)
| 1.9 | 1.0 | 0.4 |
0.4260
|
0.6729
\(\dagger\)
|
0.8040
\(\dagger\)
| |||
\(\text {TF}_{\text {BM25}}\)
| S | – | 1.2 | 0.7 | – | 0.3729 | 0.6310 | 0.7640 | |
T |
\(\vee\)
| 1.2 | 0.7 | 0.0 | 0.3729 | 0.6310 | 0.7640 | ||
– | 4.5 | 0.6 | – | 0.4022 \(\dagger\) | 0.6595 \(\dagger\) | 0.7840 | |||
\(\vee\)
| 4.5 | 0.6 | 0.0 | 0.4022 \(\dagger\) | 0.6595 \(\dagger\) | 0.7840 | |||
\(\wedge\)
| 4.5 | 0.7 | 0.0 | 0.4018 \(\dagger\) | 0.6542 \(\dagger\) | 0.7880 | |||
\(\text {TF}_{\text {constant}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.0474 | 0.2021 | 0.1140 | |
T | – |
\(>0\)
| 0.2 | – | 0.0755 \(\dagger\) | 0.2552 \(\dagger\) | 0.2280 \(\dagger\) | ||
\(\vee\)
|
\(>0\)
| 0.0 | 0.0 | 0.0840 \(\dagger\) | 0.3523 \(\dagger \, \ddagger\) | 0.1760 \(\dagger\) | |||
\(\wedge\)
|
\(>0\)
| 0.2 | 0.2 | 0.0745 \(\dagger\) | 0.2551 \(\dagger\) | 0.2260 \(\dagger\) | |||
Elite |
\(\text {TF}_{\text {total}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.1166 | 0.3361 | 0.2640 |
T | – |
\(>0\)
| 0.7 | – | 0.2594 \(\dagger\) | 0.5206 \(\dagger\) | 0.5580 \(\dagger\) | ||
\(\vee\)
|
\(>0\)
| 1.0 | 0.5 | 0.2697 \(\dagger\) | 0.5316 \(\dagger \, \ddagger\) | 0.5820 \(\dagger\) | |||
\(\wedge\)
|
\(>0\)
| 1.0 | 0.4 | 0.2699 \(\dagger\) | 0.5322 \(\dagger\) | 0.5580 \(\dagger\) | |||
\(\text {TF}_{\text {log}}\)
| S | – | 1.0 | 0.0 | – | 0.2106 | 0.4637 | 0.4280 | |
T | – | 0.2 | 0.7 | – | 0.4222 | 0.6701 \(\dagger\) | 0.7960 \(\dagger\) | ||
\(\vee\)
| 0.2 | 1.0 | 0.4 |
0.4239
| 0.6713 \(\dagger\) |
0.8080
\(\dagger\)
| |||
\(\wedge\)
| 0.2 | 1.0 | 0.4 |
0.4239
|
0.6715
\(\dagger\)
| 0.8060 \(\dagger\) | |||
\(\text {TF}_{\text {BM25}}\)
| S | – | 1.2 | 0.7 | – | 0.3729 | 0.6310 | 0.7640 | |
T |
\(\vee\)
| 1.2 | 0.7 | 0.1 | 0.3742 | 0.6320 | 0.7640 | ||
– | 4.5 | 0.6 | – | 0.4022 \(\dagger\) | 0.6595 \(\dagger\) | 0.7840 | |||
\(\vee\)
| 5.0 | 1.0 | 0.5 | 0.4079 \(\dagger \, \ddagger\) | 0.6635 \(\dagger \, \ddagger\) | 0.7900 | |||
\(\wedge\)
| 5.0 | 1.0 | 0.4 | 0.4092 \(\dagger \, \ddagger\) | 0.6607 \(\dagger\) | 0.8000 | |||
\(\text {TF}_{\text {constant}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.0474 | 0.2021 | 0.1140 | |
T | – |
\(>0\)
| 0.2 | – | 0.0755 \(\dagger\) | 0.2552 \(\dagger\) | 0.2280 \(\dagger\) | ||
\(\vee\)
|
\(>0\)
| 0.2 | 0.0 | 0.0755 \(\dagger\) | 0.2552 \(\dagger\) | 0.2280 \(\dagger\) | |||
\(\wedge\)
|
\(>0\)
| 0.2 | 0.2 | 0.0745 \(\dagger\) | 0.2551 \(\dagger\) | 0.2260 \(\dagger\) |
P | Q | K | C | k1 | b | a |
\(\text {AP}\)
|
\(\text {NDCG}\)
|
\(\text {P@10}\)
|
---|---|---|---|---|---|---|---|---|---|
Non-elite |
\(\text {TF}_{\text {total}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.0171 | 0.1387 | 0.0260 |
T | – |
\(>0\)
| 0.9 | – | 0.0568 \(\dagger\) | 0.2642 \(\dagger\) | 0.0880 \(\dagger\) | ||
\(\vee\)
|
\(>0\)
| 0.9 | 0.4 | 0.0577 \(\dagger\) | 0.2713 \(\dagger \, \ddagger\) | 0.0820 \(\dagger\) | |||
\(\wedge\)
|
\(>0\)
| 1.0 | 0.4 | 0.0563 \(\dagger\) | 0.2732 \(\dagger\) | 0.0800 \(\dagger\) | |||
\(\text {TF}_{\text {log}}\)
| S | – | 1.0 | 0.0 | – | 0.0603 | 0.2719 | 0.1100 | |
T | – | 0.2 | 0.8 | - | 0.1951 \(\dagger\) | 0.4799 \(\dagger\) | 0.2420 \(\dagger\) | ||
\(\vee\)
| 0.2 | 0.9 | 0.6 | 0.1991 \(\dagger\) | 0.4803 \(\dagger\) | 0.2360 \(\dagger\) | |||
\(\wedge\)
| 0.2 | 0.9 | 0.2 | 0.1974 \(\dagger\) | 0.4812 \(\dagger\) | 0.2360 \(\dagger\) | |||
\(\text {TF}_{\text {BM25}}\)
| S | – | 1.2 | 0.7 | – | 0.1948 | 0.4696 | 0.2380 | |
T |
\(\vee\)
| 1.2 | 0.7 | 0.0 | 0.1948 | 0.4696 | 0.2380 | ||
– | 4.1 | 0.7 | – | 0.2010 | 0.4777 |
0.2520
| |||
\(\vee\)
| 3.1 | 0.7 | 0.1 |
0.2016
|
0.4816
| 0.2420 | |||
\(\wedge\)
| 5.0 | 0.8 | 0.2 | 0.1923 | 0.4722 | 0.2520 | |||
\(\text {TF}_{\text {constant}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.0140 | 0.1514 | 0.0140 | |
T | – |
\(>0\)
| 0.1 | – | 0.0310 \(\dagger\) | 0.2041 \(\dagger\) | 0.0500 \(\dagger\) | ||
\(\vee\)
|
\(>0\)
| 0.2 | 0.3 | 0.0310 \(\dagger\) | 0.2008 \(\dagger\) | 0.0500 \(\dagger\) | |||
\(\wedge\)
|
\(>0\)
| 0.1 | 0.5 | 0.0311 \(\dagger\) | 0.1979 \(\dagger\) | 0.0480 \(\dagger\) | |||
Elite |
\(\text {TF}_{\text {total}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.0171 | 0.1387 | 0.0260 |
T | – |
\(>0\)
| 0.9 | – | 0.0568 \(\dagger\) | 0.2642 \(\dagger\) | 0.0880 \(\dagger\) | ||
\(\vee\)
|
\(>0\)
| 1.0 | 0.4 | 0.0635 \(\dagger\) | 0.2860 \(\dagger \, \ddagger\) | 0.0940 \(\dagger\) | |||
\(\wedge\)
|
\(>0\)
| 1.0 | 0.4 | 0.0563 \(\dagger\) | 0.2732 \(\dagger\) | 0.0800 \(\dagger\) | |||
\(\text {TF}_{\text {log}}\)
| S | – | 1.0 | 0.0 | – | 0.0603 | 0.2719 | 0.1100 | |
T | – | 0.2 | 0.8 | - | 0.1951 \(\dagger\) | 0.4799 \(\dagger\) | 0.2420 \(\dagger\) | ||
\(\vee\)
| 0.1 | 0.9 | 0.2 | 0.1989 |
0.4817
| 0.2360 | |||
\(\wedge\)
| 0.1 | 0.9 | 0.2 | 0.1975 \(\dagger\) | 0.4816 \(\dagger\) | 0.2380 \(\dagger\) | |||
\(\text {TF}_{\text {BM25}}\)
| S | – | 1.2 | 0.7 | – | 0.1948 | 0.4696 | 0.2380 | |
T |
\(\vee\)
| 1.2 | 0.7 | 0.0 | 0.1948 | 0.4696 | 0.2380 | ||
– | 4.1 | 0.7 | – | 0.2010 | 0.4777 |
0.2520
| |||
\(\vee\)
| 3.6 | 0.8 | 0.2 |
0.2016
| 0.4808 | 0.2460 | |||
\(\wedge\)
| 3.3 | 1.0 | 0.4 | 0.1966 | 0.4770 | 0.2500 | |||
\(\text {TF}_{\text {constant}}\)
| S | – |
\(>0\)
| 0.0 | – | 0.0140 | 0.1514 | 0.0140 | |
T | – |
\(>0\)
| 0.1 | – | 0.0310 \(\dagger\) | 0.2041 \(\dagger\) | 0.0500 \(\dagger\) | ||
\(\vee\)
|
\(>0\)
| 0.2 | 0.3 | 0.0319 \(\dagger\) | 0.1988 \(\dagger\) | 0.0520 \(\dagger\) | |||
\(\wedge\)
|
\(>0\)
| 0.1 | 0.5 | 0.0311 \(\dagger\) | 0.1979 \(\dagger\) | 0.0480 \(\dagger\) |
Challenge | P | K | C | b | a |
\(\text {AP}\)
|
\(\text {NDCG}\)
|
\(\text {P@10}\)
|
---|---|---|---|---|---|---|---|---|
HARD’05 | S | – | 1.0 | – | 0.1912 | 0.4680 | 0.4220 | |
Non-elite | T |
\(\vee\)
| 1.0 | 0.8 | 0.1970 | 0.4801 \(\dagger\) |
0.4580
\(\dagger\)
| |
\(\wedge\)
| 1.0 | 0.3 |
0.1998
\(\dagger\)
|
0.4806
\(\dagger\)
| 0.4380 | |||
Elite | T |
\(\vee\)
| 1.0 | 0.0 | 0.1912 | 0.4680 | 0.4220 | |
\(\wedge\)
| 1.0 | 0.0 | 0.1912 | 0.4680 | 0.4220 | |||
Ad Hoc 8 | S | – | 1.0 | – | 0.2583 | 0.5420 | 0.4560 | |
Non-elite | T |
\(\vee\)
| 0.9 | 0.7 |
0.2625
\(\dagger\)
|
0.5481
\(\dagger\)
| 0.4600 | |
\(\wedge\)
| 0.8 | 0.3 | 0.2606 | 0.5448 | 0.4480 | |||
Elite | T |
\(\vee\)
| 0.9 | 0.0 | 0.2589 | 0.5410 |
0.4680
| |
\(\wedge\)
| 0.9 | 0.0 | 0.2587 | 0.5415 | 0.4600 | |||
eHealth’14 | S | – | 1.0 | – | 0.3863 | 0.6444 | 0.7980 | |
Non-elite | T |
\(\vee\)
| 0.8 | 0.5 | 0.3965 \(\dagger\) | 0.6468 | 0.7900 | |
\(\wedge\)
| 0.7 | 0.7 |
0.4082
\(\dagger\)
|
0.6616
\(\dagger\)
|
0.7920
| |||
Elite | T |
\(\vee\)
| 0.8 | 0.0 | 0.3939 \(\dagger\) | 0.6467 | 0.7820 \(\dagger\) | |
\(\wedge\)
| 0.7 | 0.0 | 0.3927 \(\dagger\) | 0.6468 | 0.7900 | |||
Web’02 | S | – | 1.0 | – | 0.1877 | 0.4617 | 0.2380 | |
Non-elite | T |
\(\vee\)
| 0.8 | 0.0 | 0.1984 \(\dagger\) | 0.4767 \(\dagger\) | 0.2580 | |
\(\wedge\)
| 0.5 | 0.1 |
0.2039
\(\dagger\)
|
0.4844
\(\dagger\)
| 0.2600 | |||
Elite | T |
\(\vee\)
| 0.9 | 0.3 | 0.2002 \(\dagger\) | 0.4785 \(\dagger\) | 0.2620 | |
\(\wedge\)
| 0.5 | 0.0 | 0.2037 \(\dagger\) | 0.4836 \(\dagger\) |
0.2660
|
Challenge | P | K | C | b | a |
\(\text {AP}\)
|
\(\text {NDCG}\)
|
\(\text {P@10}\)
|
---|---|---|---|---|---|---|---|---|
HARD’05 | S | – | – | – | 0.0721 | 0.2936 | 0.1920 | |
Non-elite | T |
\(\vee\)
| 1.0 | 1.0 |
0.0967
\(\dagger\)
|
0.3329
\(\dagger\)
|
0.2120
| |
\(\wedge\)
| 1.0 | 1.0 |
0.0967
\(\dagger\)
|
0.3329
\(\dagger\)
|
0.2120
| |||
Elite | T |
\(\vee\)
| 1.0 | 1.0 | 0.0753 \(\dagger\) | 0.2994 \(\dagger\) | 0.1960 | |
\(\wedge\)
| 1.0 | 1.0 | 0.0753 \(\dagger\) | 0.2994 \(\dagger\) | 0.1960 | |||
Ad Hoc 8 | S | – | – | – | 0.0635 | 0.2762 | 0.1360 | |
Non-elite | T |
\(\vee\)
| 1.0 | 1.0 |
0.1500
\(\dagger\)
|
0.4135
\(\dagger\)
|
0.2440
\(\dagger\)
| |
\(\wedge\)
| 1.0 | 1.0 |
0.1500
\(\dagger\)
|
0.4135
\(\dagger\)
|
0.2440
\(\dagger\)
| |||
Elite | T |
\(\vee\)
| 1.0 | 1.0 | 0.0688 \(\dagger\) | 0.2914 \(\dagger\) | 0.1480 \(\dagger\) | |
\(\wedge\)
| 1.0 | 1.0 | 0.0688 \(\dagger\) | 0.2914 \(\dagger\) | 0.1480 \(\dagger\) | |||
eHealth’14 | S | – | – | – | 0.1166 | 0.3361 | 0.2640 | |
Non-elite | T |
\(\vee\)
| 1.0 | 1.0 |
0.1623
\(\dagger\)
|
0.4177
\(\dagger\)
|
0.3220
| |
\(\wedge\)
| 1.0 | 1.0 |
0.1623
\(\dagger\)
|
0.4177
\(\dagger\)
|
0.3220
| |||
Elite | T |
\(\vee\)
| 1.0 | 1.0 | 0.1231 \(\dagger\) | 0.3502 \(\dagger\) | 0.2780 | |
\(\wedge\)
| 1.0 | 1.0 | 0.1231 \(\dagger\) | 0.3502 \(\dagger\) | 0.2780 | |||
Web’02 | S | – | – | – | 0.0171 | 0.1387 | 0.0260 | |
Non-elite | T |
\(\vee\)
| 1.0 | 1.0 |
0.0249
\(\dagger\)
|
0.1865
\(\dagger\)
|
0.0460
\(\dagger\)
| |
\(\wedge\)
| 1.0 | 1.0 |
0.0249
\(\dagger\)
|
0.1865
\(\dagger\)
|
0.0460
\(\dagger\)
| |||
Elite | T |
\(\vee\)
| 1.0 | 1.0 | 0.0183 \(\dagger\) | 0.1456 \(\dagger\) | 0.0280 | |
\(\wedge\)
| 1.0 | 1.0 | 0.0183 \(\dagger\) | 0.1456 \(\dagger\) | 0.0280 |
P | Q | C | k1 | b | a | HARD’05 | Ad Hoc 8 | eHealth’14 | Web’02 |
---|---|---|---|---|---|---|---|---|---|
Non-elite |
\(\text {TF}_{\text {total}}\)
| – |
\(>0\)
|
\(*\)
| – | 0.0873 | 0.0927 | 0.2594 | 0.0543 |
\(\vee\)
|
\(>0\)
|
\(*\)
|
\(*\)
| 0.0873 | 0.0927 | 0.2594 | 0.0543 | ||
\(\wedge\)
|
\(>0\)
|
\(*\)
|
\(*\)
| 0.0942 | 0.1058 | 0.2699 | 0.0523 | ||
\(\text {TF}_{\text {log}}\)
| – |
\(*\)
|
\(*\)
| – | 0.2005 | 0.2436 | 0.4136 | 0.1911 | |
\(\vee\)
|
\(*\)
|
\(*\)
|
\(*\)
| 0.2293 | 0.2591 |
0.6081
|
0.2058
| ||
\(\wedge\)
|
\(*\)
|
\(*\)
|
\(*\)
| 0.2257 | 0.2679 | 0.5985 | 0.2048 | ||
\(\text {TF}_{\text {BM25}}\)
|
\(\vee\)
| 1.2 | 0.7 |
\(*\)
| 0.2228 |
0.2718
| 0.5679 | 0.2033 | |
– |
\(*\)
|
\(*\)
| – | 0.1983 | 0.2597 | 0.3987 | 0.1937 | ||
\(\vee\)
|
\(*\)
|
\(*\)
|
\(*\)
|
0.2316
| 0.2671 | 0.6050 | 0.2042 | ||
\(\wedge\)
|
\(*\)
|
\(*\)
|
\(*\)
| 0.2006 | 0.2634 | 0.3990 | 0.1892 | ||
\(\text {TF}_{\text {constant}}\)
| – |
\(>0\)
|
\(*\)
| – | 0.0735 | 0.1868 | 0.0727 | 0.0309 | |
\(\vee\)
|
\(>0\)
|
\(*\)
|
\(*\)
| 0.1215 | 0.2087 | 0.2647 | 0.0559 | ||
\(\wedge\)
|
\(>0\)
|
\(*\)
|
\(*\)
| 0.0740 | 0.1881 | 0.0735 | 0.0291 | ||
Elite |
\(\text {TF}_{\text {total}}\)
| – |
\(>0\)
|
\(*\)
| – | 0.0873 | 0.0927 | 0.2594 | 0.0543 |
\(\vee\)
|
\(>0\)
|
\(*\)
|
\(*\)
| 0.1495 | 0.1206 | 0.5188 | 0.0965 | ||
\(\wedge\)
|
\(>0\)
|
\(*\)
|
\(*\)
| 0.0942 | 0.1058 | 0.2699 | 0.0523 | ||
\(\text {TF}_{\text {log}}\)
| – |
\(*\)
|
\(*\)
| – | 0.2005 | 0.2436 | 0.4136 | 0.1911 | |
\(\vee\)
|
\(*\)
|
\(*\)
|
\(*\)
| 0.2268 | 0.2591 | 0.6070 | 0.2060 | ||
\(\wedge\)
|
\(*\)
|
\(*\)
|
\(*\)
| 0.2265 | 0.2593 |
0.6131
|
0.2062
| ||
\(\text {TF}_{\text {BM25}}\)
|
\(\vee\)
| 1.2 | 0.7 |
\(*\)
| 0.2301 | 0.2573 | 0.5631 | 0.2033 | |
– |
\(*\)
|
\(*\)
| – | 0.1983 | 0.2597 | 0.3987 | 0.1937 | ||
\(\vee\)
|
\(*\)
|
\(*\)
|
\(*\)
|
0.2339
|
0.2718
| 0.6028 | 0.2023 | ||
\(\wedge\)
|
\(*\)
|
\(*\)
|
\(*\)
| 0.2010 | 0.2636 | 0.4089 | 0.1926 | ||
\(\text {TF}_{\text {constant}}\)
| – |
\(>0\)
|
\(*\)
| – | 0.0735 | 0.1868 | 0.0727 | 0.0309 | |
\(\vee\)
|
\(>0\)
|
\(*\)
|
\(*\)
| 0.1198 | 0.2075 | 0.2645 | 0.0553 | ||
\(\wedge\)
|
\(>0\)
|
\(*\)
|
\(*\)
| 0.0740 | 0.1881 | 0.0735 | 0.0291 |
Challenge | P | C | D-LM | TF-\(\text {IDF}_{\text {L}}\) |
---|---|---|---|---|
HARD’05 | Non-elite |
\(\vee\)
|
0.2288
|
0.1523
|
\(\wedge\)
| 0.1998 | 0.0967 | ||
Elite |
\(\vee\)
| 0.2258 | 0.1369 | |
\(\wedge\)
| 0.1912 | 0.0753 | ||
Ad Hoc 8 | Non-elite |
\(\vee\)
|
0.2679
|
0.1600
|
\(\wedge\)
| 0.2539 | 0.1500 | ||
Elite |
\(\vee\)
| 0.2653 | 0.0821 | |
\(\wedge\)
| 0.2556 | 0.0688 | ||
eHealth’14 | Non-elite |
\(\vee\)
| 0.5740 |
0.4545
|
\(\wedge\)
| 0.4060 | 0.1623 | ||
Elite |
\(\vee\)
|
0.5769
| 0.4116 | |
\(\wedge\)
| 0.3927 | 0.1231 | ||
Web’02 | Non-elite |
\(\vee\)
| 0.2051 |
0.0450
|
\(\wedge\)
| 0.2011 | 0.0250 | ||
Elite |
\(\vee\)
|
0.2092
| 0.0393 | |
\(\wedge\)
| 0.2010 | 0.0183 |