1 Introduction
Round-robin tournaments are abundantly used in all kinds of sport competitions, both in professional leagues, and in amateur leagues. The setting where each pair of teams meets once (single round robin (SRR)), or twice (double round robin (DRR)) has proven to be a very popular format to arrive at a ranking of the participating teams Goossens and Spieksma (
2012). Very often, each team has a venue, and a match between two teams takes place at the venue of one of the two teams, meaning that one team plays home (H), while the other team plays away (A). Deciding upon the matches to be played in each round of a single or double round-robin tournament is known to be of great practical importance in scheduling professional leagues such as soccer competitions. Indeed, scheduling a round-robin tournament has attracted a lot of attention in the scientific literature Kendall et al. (
2010), Van Bulck et al. (
2020); a popular approach to do so is known as a first-break, then-schedule approach Nemhauser and Trick (
1998). The relevance of the first-break, then-schedule approach is strengthened by the observation that, when scheduling multiple leagues consecutively, first-break then-schedule is able to take capacity considerations of clubs that have teams playing in different leagues, into account Davari et al. (
2020). We now informally describe this approach (see Sect.
2 for more precise terminology).
First-break, then-schedule is a hierarchical approach consisting of two phases. In the first phase, each team receives a home-away pattern (HAP), i.e., it is specified for each round whether the team plays at its home venue, or not. Next, in the second phase, each match that needs to be played is assigned to a round; of course, for such an assignment of matches to be feasible, it must hold that each team plays at most one match in each round and that the assignment is compatible with the patterns from phase 1. In such a hierarchical approach, it is clear that the scheduling decision (which matches to play in which round) crucially depends on the HAP-set that is chosen in the first phase. For instance, it is conceivable that in the second phase, a set of constraints is revealed that are incompatible with the given HAP-set. This would need to be solved by either changing the HAP-set, or putting energy into mitigating the effects of violating that specific set of constraints.
The theme of this contribution is that not all HAP-sets have the same risk of leading to incompatible constraints. Indeed, some HAP-sets are more flexible than others. In Sect.
2, we give a number of definitions, and in Sect.
3 we introduce our measures for the flexibility of a HAP-set. We also show how single-break HAP-sets, in particular the canonical pattern set, behave with respect to these measures. In Sect.
4, we describe how to compute these measures using integer programming; we conclude in Sect.
5.
2 Preliminaries
We consider a time-constrained SRR with an even number of teams, denoted by
\(2n, n \in {\mathbb {N}}\). To avoid trivialities, we assume
\(2n \ge 4\). A
match between two distinct teams
i and
j is denoted by an unordered pair
\(\{i,j\}\),
\(1 \le i \ne j \le 2n\). Every team plays every other team exactly once. A
schedule is a specification of all
\(\left( {\begin{array}{c}2n\\ 2\end{array}}\right) \) matches in
\(2n-1\) rounds, including the venues. Thus, in a schedule every match
\(\{i,j\}\) is assigned to a specific round, the venue of the match is specified, and every team must play exactly one match in every round. For a survey of round-robin scheduling, we refer to Rasmussen and Trick (
2008) and Drexl and Knust (
2007).
Although we restrict our analysis to SRRs, we claim that many of the ideas can be generalized to DRRs as well (or
k-round-robin settings with
\(k \ge 2\)). In particular, the definitions of the measures (see Sect.
3) can be generalized to DRRs. However, we want to point out that a match in a DRR tournament is generally seen as an ordered pair (
i,
j), rather than an unordered pair of teams. Indeed, whereas in a single round-robin tournament, the home advantage is to be decided in the schedule, a double round-robin tournament typically assumes that each team plays at home against each other team exactly once. Moreover, very often it is required that one encounter between a pair of teams occurs in the first half, whereas the other encounter occurs in the second half of the schedule; these two encounters should be separated by a given number of rounds. Taking such issues into account would impact the corresponding definitions for DRR tournaments. We now proceed with defining our terminology.
An important property of a HAP is formed by the occurrence of so-called
breaks: the presence of two consecutive symbols that are identical (see de Werra
1981 and Goossens and Spieksma
2011). To simplify notation, we take a circular view of a HAP, i.e., we define
\(h_0 \equiv h_{2n-1}\).
Notice that, since \(2n-1\) is odd, and given our circular view of a HAP, any HAP has at least one break. Indeed, observe that a HAP that consists of alternating H’s and A’s has a single break in round 1. In fact, motivated by practice, we are interested in the number of breaks present in a HAP, and we define the break number of a HAP, denoted by bn(h), to capture this number.
Notice that an entire HAP h is determined by the rounds in which its breaks occur and by its value in the final round. Thus, we can write a HAP h as follows: \(P^H(r_1,r_2, \ldots , r_{bn(h)})\), where h denotes a HAP that has (i) bn(h) breaks occurring in rounds \(r_1, r_2, \ldots , r_{bn(h)}\), and (ii) an H in the final round. In case HAP h has an A in its final round and has breaks in rounds \(r_1, r_2, \ldots , r_{bn(h)}\), we write \(P^A(r_1, r_2, \ldots , r_{bn(h)})\). Clearly, \(P^H(r_1,r_2, \ldots ,r_{bn(h)})\) and \(P^A(r_1,r_2, \ldots ,r_{bn(h)})\) are complementary patterns.
We sometimes omit the distinction between
\(P^H,P^A\) and simply use
P. When discussing complementary pairs of patterns, we denote by
\(P^c\) the complement of HAP
P.
Table 1
HAPs for the 2019–2020 top Dutch male tennis league
Team 1 (Lewabo) | A | H | A | H | A | H | H | \(P^H(7)\) |
Team 2 (Spijkenisse) | A | H | A | H | H | A | H | \(P^H(5)\) |
Team 3 (Suthwalda) | H | A | H | A | H | A | H | \(P^H(1)\) |
Team 4 (Nieuwekerk) | H | A | H | A | A | H | A | \(P^A(5)\) |
Team 5 (Arnolduspark) | H | A | H | A | H | A | A | \(P^A(7)\) |
Team 6 (Leimonias) | A | H | A | H | H | A | A | \(P^A(5,7)\) |
Team 7 (Naaldwijk) | A | H | A | H | A | H | A | \(P^A(1)\) |
Team 8 (Kimbria) | H | A | H | A | A | H | H | \(P^H(5,7)\) |
As a real-life example illustrating this terminology, we consider the HAPs used by the professional Dutch tennis competition for teams in 2019, organized by the Royal Dutch Tennis Association. This is a competition between 8 teams who play a single round robin, according to the HAPs given in Table
1 (for the precise schedule, we refer to the KNLTB
2019). The first column shows the team’s names, and the next 7 columns describe the home-away patterns. The final column gives the corresponding description of the particular pattern.
We remark that many other real-life examples of SRR tournaments exist. For instance, the TATA Steel Chess tournament is organized as a single round robin, where playing with white (black) can be regarded as playing home (away) Lambers et al. (
2020).
We now turn our attention to sets of HAPs; we define a HAP-set as follows:
A schedule is called compatible with a HAP-set if the set of HAPs implied by the schedule coincides with the given HAP-set. We formally define the following notions:
The HAP-set given in Table
1 is an example of a complementary HAP-set; 6 of the 8 HAPs are single-break, the remaining two patterns each have two breaks.
Identifying whether a given HAP-set is feasible is known as the pattern set feasibility problem. As far as we are aware, the complexity status of this problem is not settled; Miyashiro et al. (
2003) describe a necessary condition for feasibility, see also Briskorn (
2008), Horbach (
2010). In addition, observe that when given a schedule compatible with a certain HAP-set for a SRR, one can (i) interchange any two rounds and (ii) change in a single round each H to an A, and vice versa, and arrive at another HAP-set that must be feasible.
We are primarily interested in feasible HAP-sets that consist of patterns with a limited number of breaks, as witnessed by the following definition.
In particular, \({\mathcal {H}}_{2n,1}\) is the collection of single-break HAP-sets of order 2n.
Finally, we point out that Definitions
2.1-
2.8 can be generalized to double round-robin tournaments by using
\(4n-2\) (instead of
\(2n-1\)) rounds.
2.1 About single-break HAP-sets
Single-break HAP-sets are of special interest. de Werra (
1981) showed that a feasible single-break HAP-set exists for any 2
n teams. Furthermore, for single-break HAP-sets, the conditions of Miyashiro et al. (
2003) have been (experimentally) shown to be sufficient for
\(2n \le 26\) Miyashiro et al. (
2003). de Werra (
1981) showed that each feasible single-break HAP-set for 2
n teams is complementary; as a consequence, in each round either 0 or 2 HAPs will feature a break. Notice that two teams with identical HAPs
\(h^1,h^2 \in F_{2n}\) are never able to play against each other.
We can describe a feasible, single-break HAP-set
\(F_{2n}\) quite compactly as follows. It follows from de Werra (
1981) that out of the
\(2n-1\) rounds, there are
n pairwise distinct rounds in which two patterns have a break, one team having an away break, and the other having a home break. Let us denote these
n rounds by
\(r_i\),
\(i=1, \ldots , n\).
We sort these
\(r_i\)’s in increasing order. Next, we define the difference between consecutive rounds as follows:
\(d_i = r_{i+1} - r_{i}\),
\(i=1, \ldots , n\), where we use
\(r_{n+1} \equiv r_1+2n-1\). So the sequence
\(D=(d_1, d_2, \ldots , d_n)\) gives the
break-gaps, i.e., the number of rounds between two consecutive breaks (notice that
\(d_i \geqslant 1\),
\(i=1, \ldots , n\)). We refer to this representation of a feasible, single-break HAP-set as the
break-gap representation or D-notation (see also de Werra
1981 and Knust and Lücking
2009).
Not only can a feasible, single-break HAP-set \(F_{2n}\) be represented by a sequence \((d_1, d_2, \ldots , d_{n})\), it is also true that any set of n positive integers that sum to \(2n-1\) corresponds to a (set of) single-break HAP-sets (which is not necessarily feasible). Although a HAP-set, when specified in terms of \(r_i\) values, uniquely determines a corresponding D sequence, a D sequence can correspond to multiple HAP-sets with different \(r_i\)’s. Consider, for instance, \(n=2\) and the following two distinct HAP-sets: \(\{HAA, HHA, AHH, AAH\}\) and \(\{HAH, HHA, AHA, AAH\}\). The first one has \(r_1=2\) and \(r_2=3\) leading to \(D=(1,2)\), while the second one has \(r_1=1\) and \(r_2=2\), also leading to \(D=(1,2)\).
Furthermore, de Werra (
1981) shows that a sequence
D is feasible if and only if all cyclic permutations of
D are feasible. Hence, without loss of generality, when dealing with single-break HAP-sets
\(F_{2n}\), we will assume that the two (complementary) patterns with a break in round 1 are always present in
\(F_{2n}\), and we restrict ourselves to the lexicographically largest sequence among this set of equivalent sequences.
A number of observations follow, where we identify a pattern with a team:
This is explained in the following way. As \(h^1,h^2\) start with the same symbol, until round \(r-1\), we have \(h^1_j=h^2_j\), \(j=1, \ldots , r-1\). Then, since \(h^1\) has a break in round r, we get \(h^1_r\ne h^2_r\). However, as \(h^2\) has a break in round \(r+1\), we see that \(h^1_{r+1}=h^2_{r+1}\) and as both patterns are single break, \(h^1_{j}=h^2_{j}\) for \(r+1 \le j \le 2n-1\). Ergo, \(h^1,h^2\) can only play each other in round r.
One of the most popular single-break HAP-sets in practice is the canonical pattern set (CPS) of order 2
n (see de Werra
1980); it is defined as follows.
In other words, we can order the teams whose HAPs start with an H (A) such that the break-gap is 2 for any two consecutive teams, except for one pair of teams, for which the break-gap equals 1. This is indeed a possible representation, since \(\sum _i d_i = 2n-1\).
3 Measuring the flexibility of a HAP-set
Given a HAP-set
\(F_{2n}\), we are interested in the diversity of the schedules that are compatible with
\(F_{2n}\) (
\(2n \ge 4\)). To make this concrete, we introduce three distinct measures called the width (Sect.
3.1), the fixed part (Sect.
3.2), and the spread (Sect.
3.3), and we show how the CPS fares on these measures.
3.1 Measure 1: the width
We call two schedules, each compatible with a given HAP-set \(F_{2n}\), distinct when there exists a match that is played in one round in schedule 1 and in another round in schedule 2. Let us define here a stronger notion of distinctness.
Using Definition
3.1, we define the width of a HAP-set:
For instance, if
\(F_{2n}\) is infeasible,
\(width(F_{2n})=0\), and if a HAP-set has width 2 (or higher), then there exist two schedules such that each match occurs in different rounds in the two schedules. As an example, the width of the HAP-set given in Table
1 equals 1, implying that no pair of match-distinct schedules compatible with that HAP-set exists. Further, we refer to Table
2 (right) for an example of a HAP-set with width 2.
We remark here that, in order to generalize the definition of width to DRR tournaments, it suffices to make a distinction between matches (i, j) and (j, i).
Single-break HAP-sets do not allow large width; in fact, there are always matches that must be played in a particular round, as witnessed by the following theorem.
In fact, the argument above implies that any feasible HAP-set \(F_{2n}\) which contains two single-break patterns of the form \(P^H(r)\) and \(P^H(s)\) (or \(P^A(r)\) and \(P^A(s)\)) with \(|r-s|=1\) has width 1.
Theorem
3.1 is tight in the following sense: even when all patterns except one are single-break patterns, HAP-sets with width 2 exist. In fact, even for
\(2n=4\), it is possible to find a HAP-set with width equal to 2 containing only one pattern that is not a single-break pattern. Such a HAP-set consisting of 4 teams is given by
Q in the following example.
Table 2
On the left CPS\(_4\), on the right HAP-set Q
1 | H | A | H | 1 | H | H | H |
2 | H | A | A | 2 | H | A | A |
3 | A | H | A | 3 | A | H | A |
4 | A | H | H | 4 | A | A | H |
The HAP-set on the left is
\(CPS_4\); the HAP-set on the right differs from
\(CPS_4\) in two entries and is given by
\(Q=\{P^H(1,2,3), P^A(3), P^A(1), P^H(2)\}\). Both HAP-sets are compatible with two distinct schedules; however, there are two schedules compatible with HAP-set
Q that are match-distinct, while this is not the case for
\(CPS_4\). Table
3 displays two schedules compatible with
\(CPS_4\), and two that are compatible with
Q.
Table 3
S1, S2 are compatible with \(CPS_4\); S3, S4 are compatible with Q
R1 | (1,3),(2,4) | (1,4),(2,3) | 1 | (1,3),(2,4) | (1,4),(2,3) |
R2 | (4,1),(3,2) | (3,1),(4,2) | 2 | (1,4),(3,2) | (1,2),(3,4) |
R3 | (1,2),(4,3) | (1,2),(4,3) | 3 | (1,2),(4,3) | (1,3),(4,2) |
Thus, while \(CPS_4\) has width 1, the HAP-set Q has width 2.
3.2 Measure 2: the fixed part
As can be deduced from Theorem
3.1, the width is a measure that does not differentiate between feasible single-break HAP-sets. Still, different feasible, single-break HAP-sets are not necessarily equal when it comes to the
number of matches that must take place in the same round. Therefore, we introduce here a more refined measure that is able to do so.
Notice that if there is just a single schedule compatible with
\(F_{2n}\), the value of this measure equals
\(\left( {\begin{array}{c}2n\\ 2\end{array}}\right) \). Notice also that if the width of a HAP-set
\(F_{2n}\) equals 2, it follows that
\(FP(F_{2n}) = 0\). For example, the fixed part of the HAP-set given in Table
1 is 4. More in particular, it consists of the matches Team 1 - Team 7, Team 2 - Team 6, Team 3 - Team 5, and Team 4 - Team 8, which all have to be played in round 7.
The following lower and upper bounds for \(\text{ FP }(F_{2n})\) are readily obtained. Given a feasible, single-break HAP-set, let \((d_1, d_2, \ldots , d_n)\) be its break-gap representation. We use \(I = |i:d_i=1|\).
3.2.1 The FP of the CPS
We now proceed by investigating the FP of the CPS. We start by recalling the following definition and insights that come from Miyashiro et al. (
2003).
Notice that the teams of a set
U all have to play each other once in a feasible schedule, thus resulting in a total amount of
\(\frac{|U|(|U|-1)}{2}\) matches that have to be scheduled between the teams of
U. We introduce
\(A(U)=\sum _r m^-_r(U)\), summing the maximum number of the matches within
U that can be scheduled per round. Next, we define
\(\alpha (U) = A(U)-\frac{|U|(|U|-1)}{2}\) and as observed in Miyashiro et al. (
2003), if
\(\alpha (U)<0\), the HAP-set is infeasible, since not all matches within
U can be scheduled.
If \(\alpha (U)=0\), we say that U is tight, meaning that in every round r, exactly \(m^-_r(U)\) matches between teams in U have to be scheduled and that all teams \(t \in M^-_r(U)\) have to be scheduled against other teams of \(U\setminus M^-_r(U)\).
Now, to facilitate our analysis, we first identify subsets of patterns (teams) from the CPS that are tight.
Notice that \(\{P^A(2n-1), P^H(1)\}=U_1 \subset U_2 \subset \ldots \subset U_{\lfloor \frac{n}{2} \rfloor }\) and that \(|U_i| = |U^c_i| = 2i\), \(1 \le i \le \lfloor \frac{n}{2} \rfloor \). We also identify a set of rounds as follows:
Observe that \(\{2n-1\}=R_1 \subset R_2 \subset \ldots \subset R_{\lfloor \frac{n}{2} \rfloor }\) and that \(|R_i| = 4i-3\), \(1 \le i \le \lfloor \frac{n}{2} \rfloor \). Indeed, notice that round \(2n-1 \in R_i\) for all \(i=1, \ldots , \lfloor \frac{n}{2} \rfloor \), and that with each unit increase of index i four rounds are added, two to the “left” of round \(2n-1\) and two to the “right” of round \(2n-1\).
We are now in a position to prove that, for schedules that are compatible with \(CPS_{2n}\), exactly n matches have precisely one round in which they can be played.
By symmetry, it follows that a similar analysis for the teams from the sets \(U^c_i\), \(1 \le i\le \lfloor \frac{n}{2} \rfloor \) implies that all teams from these sets play a match that can only be played in round \(2n-1\). Together with the claim, this means that we have shown that \(\lfloor \frac{n}{2} \rfloor + \lfloor \frac{n}{2} \rfloor \) matches can only be played in round \(2n-1\). If n is odd, this leaves out two teams: \(P^A(n)\) and \(P^H(n)\). However, since these two teams have to play some match in round \(2n-1\), and as all other teams are already paired, they have to play each other. Thus, in any feasible schedule compatible with the CPS, the entire round \(2n-1\) consists of matches that can only be played in this round.
Finally, we point out that, due to Observation
2.3, in any feasible schedule compatible with
\(CPS_{2n}\), the matches in round
r can be interchanged with the matches in round
\(r+1\) (
\(r-1\)) if
r is odd (even), for
\(r=1, \ldots , 2n-2\). This proves the theorem.
\(\square \)
Observe that the proof of Theorem
3.2 implies that all matches in the last round of a schedule compatible with a CPS are
fixed, i.e., these matches cannot be played in any other round.
3.3 Measure 3: the spread
Given a HAP-set, we call the spread of a match \(\{i,j\}\) the number of distinct rounds in which it can be played, i.e., for which there is a schedule compatible with the given HAP-set that has this match in this round. Notice that this is not necessarily equal to the number of rounds r where \(h^i_r\) and \(h^j_r\) differ. Next, by simply summing the spread of a match over all matches, we arrive at the spread of a HAP-set. Formally:
A higher value of the spread for a feasible HAP-set indicates more flexibility in scheduling individual matches. As an example, the spread of the HAP-set given in Table
1 is 84 (16 matches can be played in 4 rounds, 8 matches have 2 possible rounds, and the fixed part consists of 4 matches). We now state an upper bound on the spread of CPS
\(_{2n}\).
We can calculate the actual value of the spread of CPS
\(_{2n}\) using integer programming (see Sect.
4). In Table
4, we explicitly list the value of the spread of CPS
\(_{2n}\) and its upper bound for values of 2
n up to 30, and note that this bound is tight. In fact, we conjecture that this bound is tight for any even number of teams.
Table 4
Values for the spread and the upper bound for CPS with 2n teams
upper bound | 10 | 35 | 88 | 177 | 314 | 507 | 768 | 1105 | 1530 | 2051 | 2680 | 3425 | 4298 | 5307 |
spread(CPS) | 10 | 35 | 88 | 177 | 314 | 507 | 768 | 1105 | 1530 | 2051 | 2680 | 3425 | 4298 | 5307 |
5 Conclusion
In a first-break-then-schedule approach, it is important to have freedom to schedule the individual matches after the home-away patterns have been chosen. We have proposed three measures that indicate the amount of freedom associated with a particular HAP-set. We have theoretically established how the most popular HAP-set, the canonical pattern set (CPS), fares on these measures. Also, we have computed, using integer programming, for all feasible single-break HAP-sets up to order 16, the numerical values of the proposed measures. It is interesting to see that, when the number of teams exceeds 10, the CPS is not the most flexible HAP-set.
More generally, while we have focused here on single break HAP-sets, we expect that, when allowing HAPs with break-numbers of value 2 and larger, one can construct HAP-sets that score better on our flexibility measures, i.e., achieve a better value for the width, fixed part, and spread, respectively. We leave this as an interesting direction to explore.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.