40

The Power of Double Elimination

Comments:
Threaded Linear
#1
FerahgoTheGreat

TLDR Double elimination is the best tournament format for producing results that reflect the strength of the teams. A stochastic MATLAB code was written to provide some stats to support this claim.

Double elimination has long been one of the most popular tournament formats because it provides a cushion for when a team has a bad day or gets a bad seeding. Most people intuitively realize that single elimination leads to higher statistical fluctuations. There are downsides to double elimination including the fact that more games need to be played (this could be good or bad), and the finals problem rears its ugly head. For double elimination finals, the team that wins out to reach the finals must either be eliminated after only one series loss or be given an advantage in the finals which makes for a much less exciting finale.

There are four tournament formats with eight teams that are under consideration in this analysis.

  1. DES: Double elimination standard. DES example
  2. DENS: Double elimination non-standard. Notice in this DENS example how Gamelanders are not swapped to the other side of the bracket when they drop to lower round 2.
  3. SE: Single elimination. SE example
  4. GSS: Two groups of four with double elimination that leads to a four team single elimination playoffs. GSS example

METHODOLOGY
Every team is given a rank from 1-8. This analysis is primarily concerned with two systems. When the seeding is perfect (seed 1 vs seed 8, seed 4 vs seed 5, etc) and when the seeding is random. Random seeding imitates when a better team gets a lower seed than a worse team. Obviously most real tournaments happen somewhere between these two outcomes, but by looking at the extremes, the middle ground can be interpolated.

Each tournament is simulated with a MATLAB code at least ten thousand times to ensure the results converge on an answer. The winner of each matchup is determined by a random number. Two approaches were taken in this analysis. The first consists of assuming that every higher seed wins. The second is to assign each team (1-8) a value along a Gaussian distribution (bell curve) and the probability that the higher rank wins is determined by these values. The Gaussian distribution used in this analysis gives a 90% chance that the rank 1 team beats the rank 8 team 1(90)8, a 57.3% chance that the rank 3 team beats the rank 6 team 3(57.3)6, 2(71.1)7, and so on. Most tournaments don’t follow an ideal bell curve, but the exact distribution does not matter as long as the higher ranked teams have a higher chance of beating lower ranked teams. The exact numerical values of the results that are achieved will be different, but the ranking of the different tournament formats will be the same.

What result qualifies as a good result? Upsets are good for specific narratives and overall excitement, but if there are too many upsets, the best teams don’t reach the end, and the tournament winds up with very anticlimactic matches. (For example, most of Astralis’ opponents in the CSGO major finals had previous great upsets but were completely outmatched in the finals.) A tournament format that allows the best teams to more consistently reach the later matches creates better narratives, better viewing experiences, and a better quality of tournament. There are other factors that go into making a tournament great, but this is very important for the format.

For this code, after every single simulation of a tournament, the placement of each team is compared to their rank. For instance, if the rank 1 team actually placed third, they would add +2 to a “variance” tally that is averaged over all the teams in the tournament. If the tournament variance tally is averaged to be 1.5 that means that each team placed an average of 1.5 spots away from where they were expected to place based on their rank. The values of these variance tallies are averaged over the thousands of times the tournament format is simulated. A smaller number is better for the overall health of the tournament. (A very tiny number is bad because that would mean only the best teams would have a chance, but all of these formats have reasonable numbers).

RESULTS

The easiest question to answer is “what format struggles with bad seeding?”. Assume that the higher rank wins every single time and that the seedings are random. (Results from where there is upset potential will be dealt with later). The numbers shown represent the average “variance” tallies described in the previous section.

DES    0.4744
DENS   0.5572
GSS    0.6624
SE     1.1429

If you randomly seed single elimination (SE), a lot of high seeds will face each other in the early rounds. That leads to the teams being on average 1.1429 placements away from their rank. Standard double elimination (DES) does the best with only 0.4744 placements on average per team. If a tournament does not have the capacity to seed properly through qualifiers or rankings they should never use single elimination. If there are simply too many games for double elimination such as large open qualifiers with tons of teams, single elim is fine, but playoffs should use double elimination.

So double elimination is better at dealing with poor seeding, but how does it deal with upsets? As previously stated, upsets are not a bad thing, but the best teams should still be able to reach the later matches more consistently. Using the Gaussian distribution to create a % chance of an upset depending on the difference in rank between the teams, the four tournaments were simulated again. UV stands for “upset variance” and is the variance when the seeding is perfect. SV stands for “seeding variance” and is the variance when the seeding is random. Most real tournaments will fall somewhere in between.

       UV       SV
DES    1.7063     1.8892
DENS   1.6796     1.8913
GSS    1.7516     1.9249
SE     1.7724     2.0727

Interestingly enough, non standard double elimination fares better than the standard version with perfect seeding because it makes upsets less common in the lower bracket round 2. This is the only result which is close enough to make the method used to determine the % chance of an upset matter. The SV is usually worse for the non standard version.

Another stat that might be of interest is the quality of the grand finals which is easily the most important match of the tournament. The finals quality FQ is calculated as the average rank of the two teams that make the grand finals. (For instance, if the rank 1 team and the rank 3 team made the finals, the FQ would be 2). FQR is the same stat but for random seeding instead of perfect seeding.

       FQ       FQR
DES    2.8035     3.0098
DENS   2.7838     3.0118
GSS    3.0284     3.1399
SE     2.9785     3.3630

Pretty much everything that can be said about the UV and SV results can be said about the finals quality. GSS finally dethroned single elimination as the worst format with perfect seeding although it still fares better with random seeding.

CONCLUSIONS
If a tournament organizer is happy with having more games and dealing with the finals problem, double elimination is clearly the best tournament to run. A lot of tournaments run single elimination simply because that format is the simplest for human brains to understand and build hype around. Throwing in some double elimination in order to reach a single elimination playoffs can help alleviate the problems of single elimination particularly when there are more than eight teams.

#2
Vexlic
-13
Frags
+

Nerdge

#3
YalohA
15
Frags
+

WAYTOODANK

#4
Fyujin
6
Frags
+

vouch + didn't read

#5
ValdemieGC
19
Frags
+

Damn! Maths and valorant, two of my favourite things.

Saving to read it later carefully

#6
ItsMeDio
-5
Frags
+

nice to know
nerd

#7
arecyc
-7
Frags
+

didnt read

#8
valvenq
4
Frags
+

I'm gonna save this for when I'm gonna read it. Love this type of stuff

#9
Vortexy
13
Frags
+
arecyc [#7]

didnt read

2nd easiest ratio of my life

#10
Knightsofdoom
2
Frags
+

Respect for writing this

#11
Vortexy
7
Frags
+

W
quality posts like this from the community is what make this site good

#12
Dolphin
-3
Frags
+

sorry, will be bookmarking this (ps: i will not) so i can read it later because i can't focus right now. CHeers!

#13
taker
2
Frags
+

good read, I heckin love math

#14
FerahgoTheGreat
23
Frags
+
Vexlic [#2]

Nerdge

I am a very big nerd indeed.

#15
FerahgoTheGreat
2
Frags
+

I might do more than 8 team formats especially with Reykjavik around the corner.

#16
1234567890
2
Frags
+

W post! thanks for taking the time to explain and post this

#17
RedHood14
1
Frags
+

common Ferahgo W, but i'm too dumb to understand this

#18
Valentheon
1
Frags
+

is ur name Ferahgo from Redwall? surely right?

#19
looklook
-1
Frags
+

whoa thats alota words i didnt read! thanks!

#20
FerahgoTheGreat
1
Frags
+
Valentheon [#18]

is ur name Ferahgo from Redwall? surely right?

Yep, Ferahgo The Assassin

#21
inlucid
0
Frags
+

bump

#22
FerahgoTheGreat
0
Frags
+
inlucid [#21]

bump

^

#23
FerahgoTheGreat
6
Frags
+
ValdemieGC [#5]

Damn! Maths and valorant, two of my favourite things.

Saving to read it later carefully

Maths and Valorant. What a strange combo. I approve.

#24
deathlyclaws
0
Frags
+

W post

#25
queueK
0
Frags
+

stats post? POGGERCHAMPION

100% will read this later

#26
GlaZeD
0
Frags
+
Vortexy [#9]

2nd easiest ratio of my life

4th easiest assist of my life

#27
gtngodyorukingdestruction
0
Frags
+

Big giant Zombs/0 great W

#28
kusoboshi
0
Frags
+

w ya love to see it

#29
queueK
1
Frags
+

The DENS format performing better than the normal DES format in any situation was surprising.

I wonder what the results end up being if the winner is chosen completely at random, independent of their seeding and rank.

About the finals issue with double elim, it'd be interesting to see the number or percentage of times the upper finals and grand finals ended up being between different teams.

Additionally, I wonder how, or if, these results would change given an abnormal amount of teams (Iceland 2021, EMEA Challengers 1 Playoffs 2022).

This is an excellent write-up! I hope there are more of these in the future

#30
anticemyt
-1
Frags
+

ratio

#31
fatpoipoiiii
0
Frags
+

can u make a more simple conclusion so people like me can understand

#32
Fluffyy
-1
Frags
+

didnt read

#33
fatpoipoiiii
0
Frags
+
Fluffyy [#32]

didnt read

lmao liquid fan cant read

#34
Fluffyy
0
Frags
+
fatpoipoiiii [#33]

lmao liquid fan cant read

Bonkers bandwagon cant even understand even after reading #31

#35
FerahgoTheGreat
2
Frags
+
fatpoipoiiii [#31]

can u make a more simple conclusion so people like me can understand

There are still some aesthetic problems with double elimination, but it produces better results where the better teams make it farther on average. Particularly if the seeding isn't perfect.

#36
fictional
0
Frags
+

i bet u watch at least one indian guy videos on YouTube to do this

#37
guNKiller
0
Frags
+
fictional [#36]

i bet u watch at least one indian guy videos on YouTube to do this

what?

#38
LouBag
0
Frags
+
Fluffyy [#32]

didnt read

I wonder when ignorance became something people intentionally flaunt to somehow seem cool.

Like bro, we get it… You’re to dumb to care and to lazy to read.

#39
FerahgoTheGreat
1
Frags
+
fictional [#36]

i bet u watch at least one indian guy videos on YouTube to do this

No, I just coded it from scratch with a lot of trial and error. There are probably some other technical ways to collect data, but I just did this blind.

#40
MilanTheMyth
1
Frags
+

Holy shit, have my babies

#41
FerahgoTheGreat
1
Frags
+
queueK [#29]

The DENS format performing better than the normal DES format in any situation was surprising.

I wonder what the results end up being if the winner is chosen completely at random, independent of their seeding and rank.

About the finals issue with double elim, it'd be interesting to see the number or percentage of times the upper finals and grand finals ended up being between different teams.

Additionally, I wonder how, or if, these results would change given an abnormal amount of teams (Iceland 2021, EMEA Challengers 1 Playoffs 2022).

This is an excellent write-up! I hope there are more of these in the future

The difference between the double elimination formats for proper seeding is the only result that changes if I modify the % chance of teams beating each other. If I use a linear distribution which means that each team is evenly spaced in skill, the UV values are pretty much the same. I would still recommend the standard version since it prevents early rematches.

If the winner is chosen at random, every team has an equal chance of finishing anywhere and the results don't really mean anything.

I am still interested in tournament formats with more teams and other stats that would provide useful information. I will probably try to do something with 12 team formats before Masters 1 since that appears to be the norm now.

#42
FerahgoTheGreat
0
Frags
+
gtngodyorukingdestruction [#27]

Big giant Zombs/0 great W

It is still bewildering to me how much drama that caused lol.

#43
fatpoipoiiii
0
Frags
+
FerahgoTheGreat [#35]

There are still some aesthetic problems with double elimination, but it produces better results where the better teams make it farther on average. Particularly if the seeding isn't perfect.

Hoooo okay, good job! U r making the community more alive

#44
ayanzx
1
Frags
+

I HECKIN LOVE MATLAB BATCHEST

#45
FerahgoTheGreat
0
Frags
+
ayanzx [#44]

I HECKIN LOVE MATLAB BATCHEST

The ease of use really got me into coding.

#46
S0v4k
0
Frags
+

Common genius W

#47
ayanzx
0
Frags
+
FerahgoTheGreat [#45]

The ease of use really got me into coding.

yeah i really enjoy the simplicity of it and yeah matlab and octave really got me into coding as well

#48
Valentheon
0
Frags
+
FerahgoTheGreat [#20]

Yep, Ferahgo The Assassin

That's awesome. Redwall was one of my fav childhood book series, need to reread them sometime.

#49
shantanu
0
Frags
+

W

  • Preview
  • Edit
› check that that your post follows the forum rules and guidelines or get formatting help
Sign up or log in to post a comment