By Daniel Takata Gomes
In this article, we present each swimmer’s chances of winning the gold medal at the Tokyo Olympic Games.
Statistical methods based on computational simulations were used in order to estimate the probability of winning the gold medal for each swimmer. Results from 2018 were considered in the calculations. The more recent the result is, more impact it has in the probabilities.
The method was determined by observing results from 2014 to 2016 and comparing them to what in fact occurred at the 2016 Olympics, in order to find the most suitable probability distribution.
Some remarks: Adam Peaty has a 97.09% chance of winning the 100 breast. Caeleb Dressel, 93.24% in the 100 fly. Regan Smith, 86.43% in the 200 back. Kristof Milak, 83.54% in the 200 fly. Katie Ledecky, 72.3% in the 800 and 1500 free.
The most disputed races are the women’s 100 free (Simone Manuel 35.48%, Cate Campbell 33.34% and Sarah Sjostrom 21.96%) and the men’s 1500 free (Florian Wellbrock 31.31%, Gregorio Paltrinieri 30.81% and Mykhailo Romanchuk 25.99%).
Of course, these probabilities will change, depending on how swimmers will swim until the Olympics. The probabilities will be updated on a regular basis.
METHODOLOGY EXPLAINED
The approach here is to determine the probabilities in an empirical fashion. If you have a coin and you don’t know the chances of tails and heads, you can toss the coin, let’s say, 1,000 times. If you get 512 tails and 488 heads, the estimated probabilities are 51.2% and 48.8%, respectively.
So we don’t know what’s the chance of, let’s say, Simone Manuel winning the 50 freestyle. Manuel could swim the event 1,000 times, as well as her adversaries, and we could count how many times she would win to estimate her winning probability. Obviously this can’t be done. In this situation, we simulate the possible outcomes in a computer program using a statistical method.
How do we do that? Let’s go back to 2016. Manuel arrived in Rio with a 24.33 from the US Olympic Trials. In Rio, she managed to a 24.09. Cate Campbell came in with a 23.84, and in Rio, she went 24.15. And so on. I conducted the calculations and realized that, comparing to the times registered from 2014 until the Olympics, the times of the top swimmers in the Olympics present a specific pattern of variability, well explained by the so-called normal distribution of probability.
Using the very same pattern of variability, it is possible to simulate the possible outcomes for Tokyo. Manuel has a 23.97 from 2019. So, her time for Tokyo is simulated according to that pattern of variability. In 1,000 trials, let’s say that her simulated times are 24.08, 23.90, 24.18, 24.01, 23.79, 23.89, etc. Her times will float around 23.97 with some variability. We do the same for every other swimmer. Let’s say that, in 1,000 trials, Manuel has the best time of all swimmers in 800, so she has an 8% chance of winning.
The same procedure was conducted in all events, according to the respective pattern of variation. In relays, the sum of the times of the fastest swimmers of each country in each round of simulation was considered.
CURRENT PROBABILITIES
Women’s events
50m freestyle
- Sarah Sjostrom (SWE) – 52.31%
- Cate Campbell (AUS) – 25.15%
- Pernille Blume (DEN) – 9.87%
- Simone Manuel (USA) – 8.49%
- Liu Xiang (CHN) – 2.32%
- Bronte Campbell (AUS) – 0.58%
- Emma McKeon (AUS) – 0.48%
- Ranomi Kromowidjojo (NED) – 0.30%
100m freestyle
- Simone Manuel (USA) – 35.48%
- Cate Campbell (AUS) – 33.34%
- Sarah Sjostrom (SWE) – 21.96%
- Emma McKeon (AUS) – 5.58%
- Bronte Campbell (AUS) – 1.86%
- Taylor Ruck (CAN) – 0.34%
- Pernille Blume (DEN) – 0.24%
- Mallory Comerford (USA) – 0.17%
200m freestyle
- Federica Pellegrini (ITA) – 33.65%
- Ariarne Titmus (AUS) – 23.77%
- Sarah Sjostrom (SWE) – 16.55%
- Emma McKeon (AUS) – 10.97%
- Katie Ledecky (USA) – 4.16%
- Siobhan Haughey (HKG) – 3.59%
- Yang Junxuan (CHN) – 2.83%
- Charlotte Bonnet (FRA) – 1.09%
400m freestyle
- Katie Ledecky (USA) – 51.43%
- Ariarne Titmus (AUS) – 34.51%
- Leah Smith (USA) – 10.25%
- Ajna Kesely (HUN) – 2.62%
- Wang Jianjiahe (CHN) – 0.41%
- Li Bingjie (CHN) – 0.39%
- Katinka Hosszu (HUN) – 0.11%
- Simona Quadarella (ITA) – 0.09%
800m freestyle
- Katie Ledecky (USA) – 72.30%
- Simona Quadarella (ITA) – 9.45%
- Ariarne Titmus (AUS) – 7.71%
- Wang Jianjiahe (CHN) – 4.54%
- Leah Smith (USA) – 3.66%
- Sarah Koehler (GER) – 1.95%
- Kiah Melverton (AUS) – 0.11%
- Mireia Belmonte (ESP) – 0.08%
1500m freestyle
- Katie Ledecky (USA) – 72.25%
- Simona Quadarella (ITA) – 16.21%
- Wang Jianjiahe (CHN) – 5.96%
- Sarah Koehler (GER) – 3.37%
- Delfina Pignatiello (ARG) – 0.75%
- Erica Sullivan (USA) – 0.30%
- Ashley Twichell (USA) – 0.29%
- Ajna Kesely (HUN) – 0.27%
100m backstroke
- Regan Smith (USA) – 47.14%
- Kylie Masse (CAN) – 30.49%
- Minna Atherton (AUS) – 5.81%
- Kathleen Baker (USA) – 5.26%
- Olivia Smoliga (USA) – 4.25%
- Taylor Ruck (CAN) – 2.67%
- Phoebe Bacon (USA) – 1.79%
- Emily Seebohm (AUS) – 0.89%
200m backstroke
- Regan Smith (USA) – 86.43%
- Kylie Masse (CAN) – 6.75%
- Kaylee McKeown (AUS) – 2.76%
- Margherita Panziera (ITA) – 2.26%
- Taylor Ruck (CAN) – 0.51%
- Katinka Hosszu (HUN) – 0.41%
- Minna Atherton (AUS) – 0.40%
- Kathleen Baker (USA) – 0.27%
100m breaststroke
- Lilly King (USA) – 60.06%
- Yuliya Efimova (RUS) – 34.23%
- Reona Aoki (JPN) – 1.18%
- Martina Carraro (ITA) – 1.17%
- Annie Lazor (USA) – 1.01%
- Tatjana Schoenmaker (RSA) – 0.43%
- Arianna Castiglioni (ITA) – 0.23%
- Molly Hannis (USA) – 0.22%
200m breaststroke
- Yuliya Efimova (RUS) – 57.62%
- Annie Lazor (USA) – 10.62%
- Lilly King (USA) – 7.54%
- Evgeniia Chikunova (RUS) – 7.52%
- Tatjana Schoenmaker (RSA) – 7.07%
- Bethany Galat (USA) – 2.57%
- Sydney Pickrem (CAN) – 2.27%
- Emily Escobedo (USA) – 1.72%
100m butterfly
- Maggie Macneil (CAN) – 61.83%
- Sarah Sjostrom (SWE) – 25.98%
- Emma McKeon (AUS) – 9.02%
- Kelsi Worrell (USA) – 1.43%
- Marie Wattel (FRA) – 0.39%
- Elena Di Liddo (ITA) – 0.35%
- Brianna Throssell (AUS) – 0.26%
- Louise Hansson (SWE) – 0.22%
200m butterfly
- Hali Flickinger (USA) – 31.15%
- Boglarka Kapas (HUN) – 16.06%
- Katherine Drabot (USA) – 13.79%
- Katinka Hosszu (HUN) – 8.70%
- Alys Margaret Thomas (GBR) – 7.65%
- Franziska Hentke (GER) – 4.06%
- Zhang Yufei (CHN) – 2.70%
- Regan Smith (USA) – 1.88%
200m ind. medley
- Katinka Hosszu (HUN) – 52.63%
- Shiho Matsumoto (JPN) – 26.10%
- Ye Shiwen (CHN) – 7.50%
- Sydney Pickrem (CAN) – 5.33%
- Yui Ohashi (JPN) – 3.00%
- Rika Omoyo (JPN) – 1.77%
- Melanie Margalis (USA) – 1.11%
- Alex Walsh (USA) – 0.93%
400m ind. medley
- Katinka Hosszu (HUN) – 47.47%
- Ye Shiwen (CHN) – 21.10%
- Yui Ohashi (JPN) – 18.76%
- Shiho Matsumoto (JPN) – 8.62%
- Mireia Belmonte (ESP) – 1.60%
- Emma Weyant (USA) – 0.48%
- Sydney Pickrem (CAN) – 0.47%
- Brooke Forde (USA) – 0.29%
4x100m freestyle relay
- Australia – 63.87%
- United States – 35.24%
- Canada – 0.47%
- France – 0.17%
- Netherlands – 0.08%
- Great Britain – 0.05%
- Japan – 0.04%
- China – 0.04%
4x200m freestyle relay
- Australia – 48.16%
- United States – 37.67%
- China – 8.91%
- Canada – 3.02%
- Japan – 1.09%
- Russia – 0.56%
- Italy – 0.40%
- Great Britain – 0.08%
4x100m medley relay
- United States – 80.31%
- Australia – 9.78%
- Canada – 8.48%
- Russia – 0.67%
- Italy – 0.32%
- Japan – 0.32%
- Sweden – 0.06%
- Great Britain – 0.03%
Men’s events
50m freestyle
- Caeleb Dressel (USA) – 68.38%
- Bruno Fratus (BRA) – 11.76%
- Vladimir Morozov (RUS) – 7.41%
- Benjamin Proud (GBR) – 6.02%
- Kristian Gkolomeev (GRE) – 3.63%
- Andrea Vergani (ITA) – 1.22%
- Florent Manaudou (FRA) – 0.68%
- Michael Andrew (USA) – 0.45%
100m freestyle
- Caeleb Dressel (USA) – 41.03%
- Kyle Chalmers (AUS) – 39.59%
- Vladislav Grinev (RUS) – 8.56%
- Ryan Held (USA) – 4.47%
- Maxime Rooney (USA) – 1.53%
- Marcelo Chierighini (BRA) – 1.20%
- Zach Apple (USA) – 1.04%
- Vladimir Morozov (RUS) – 0.42%
200m freestyle
- Danas Rapsys (LTU) – 33.37%
- Duncan Scott (GBR) – 16.10%
- Sun Yang (CHN) – 10.85%
- Katsuhiro Matsumoto (JPN) – 9.47%
- Clyde Lewis (AUS) – 6.81%
- Martin Malyutin (RUS) – 6.18%
- Kyle Chalmers (AUS) – 2.73%
- Ji Xinjie (CHN) – 2.30%
400m freestyle
- Sun Yang (CHN) – 37.10%
- Mack Horton (AUS) – 24.04%
- Gabriele Detti (ITA) – 22.41%
- Danas Rapsys (LTU) – 5.99%
- Jack McLoughlin (AUS) – 2.64%
- Elijah Winnington (AUS) – 1.69%
- Marco De Tullio (ITA) – 1.61%
- Alexander Krasnykh (RUS) – 0.68%
800m freestyle
- Gregorio Paltrinieri (ITA) – 45.87%
- Henrik Christiansen (NOR) – 17.98%
- Gabriele Detti (ITA) – 10.65%
- David Aubry (FRA) – 5.74%
- Mykhailo Romanchuk (UCR) – 4.60%
- Jack McLoughlin (AUS) – 4.46%
- Florian Wellbrock (GER) – 3.62%
- Sun Yang (CHN) – 2.84%
1500m freestyle
- Florian Wellbrock (GER) – 31.31%
- Gregorio Paltrinieri (ITA) – 30.81%
- Mykhaulo Romanchuk (UKR) – 25.99%
- David Aubry (FRA) – 2.27%
- Henrik Christiansen (NOR) – 2.05%
- Franko Grgic (CRO) – 1.45%
- Daniel Jervis (GBR) – 1.39%
- Alexander Norgaard (DEN) – 1.03%
100m backstroke
- Evgeny Rylov (RUS) – 43.43%
- Xu Jiayu (CHN) – 25.59%
- Mitchell Larkin (AUS) – 14.28%
- Ryan Murphy (USA) – 11.86%
- Matt Grevers (USA) – 1.81%
- Ryosuke Irie (JPN) – 0.86%
- Shaine Casas (USA) – 0.79%
- Kliment Kolesnikov (RUS) – 0.73%
200m backstroke
- Evgeny Rylov (RUS) – 54.55%
- Ryan Murphy (USA) – 33.90%
- Xu Jiayu (CHN) – 5.41%
- Mitchell Larkin (AUS) – 2.57%
- Ryosuke Irie (JPN) – 1.03%
- Luke Greenbank (GBR) – 0.98%
- Austin Katz (USA) – 0.35%
- Keita Sunama (JPN) – 0.24%
100m breaststroke
- Adam Peaty (GBR) – 97.09%
- James Wilby (GBR) – 1.02%
- Ilya Shymanovich (BLR) – 0.70%
- Yan Zibei (CHN) – 0.56%
- Arno Kamminga (NED) – 0.22%
- Anton Chupkov (RUS) – 0.11%
- Nicolo Martinenghi (ITA) – 0.08%
- Yasuhiro Koseki (JPN) – 0.05%
200m breaststroke
1. Anton Chupkov (RUS) – 46.71%
2. Matthew Wilson (AUS) – 17.69%
3. Ippei Watanabe (JPN) – 16.62%
4. Zac Stubblety-Cook (AUS) – 3.40%
5. James Wilby (GBR) – 2.04%
6. Marco Koch (GER) – 1.87%
7. Will Licon (USA) – 1.61%
8. Josh Prenot (USA) – 1.39%
100m butterfly
- Caeleb Dressel (USA) – 93.24%
- Chad Le Clos (RSA) – 2.09%
- Andrei Minakov (RUS) – 1.79%
- Maxime Rooney (USA) – 1.33%
- Mehdy Metella (FRA) – 0.54%
- Kristof Milak (HUN) – 0.25%
- Jack Conger (USA) – 0.15%
- Grant Irvine (AUS) – 0.12%
200m butterfly
- Kristof Milak (HUN) – 83.54%
- Daiya Seto (JPN) – 13.54%
- Tamas Kenderesi (HUN) – 1.32%
- Chad Le Clos (RSA) – 1.00%
- Luca Urlando (USA) – 0.41%
- Nao Horomura (JPN) – 0.06%
- Federico Burdisso (ITA) – 0.05%
- Denys Kesyl (UKR) – 0.04%
200m ind. medley
- Daiya Seto (JPN) – 41.24%
- Chase Kalisz (USA) – 19.81%
- Mitchell Larkin (AUS) – 15.77%
- Wang Shun (CHN) – 7.88%
- Jeremy Desplanches (SUI) – 6.61%
- Duncan Scott (GBR) – 2.21%
- Philip Heintz (GER) – 1.88%
- Qin Haiyang (CHN) – 1.73%
400m ind. medley
- Daiya Seto (JPN) – 53.21%
- Jay Litherland (USA) – 19.05%
- Wang Shun (CHN) – 7.04%
- Chase Kalisz (USA) – 5.24%
- Qin Haiyang (CHN) – 3.80%
- Lewis Clareburt (NZL) – 2.55%
- Max Litchfield (GBR) – 2.12%
- David Verraszto (HUN) – 1.59%
4x100m freestyle relay
- United States – 86.00%
- Russia – 7.48%
- Australia – 4.04%
- Brazil – 1.75%
- Italy – 0.24%
- Japan – 0.23%
- Great Britain – 0.13%
- China – 0.05%
4x200m freestyle relay
- Australia – 43.47%
- United States – 27.87%
- Russia – 17.77%
- Great Britain – 5.69%
- China – 4.27%
- Brazil – 0.54%
- Japan – 0.25%
- Italy – 0.09%
4x100m medley relay
- United States – 76.70%
- Russia – 12.77%
- Australia – 3.74%
- Great Britain – 3.50%
- China – 1.26%
- Japan – 1.08%
- Italy – 0.47%
- Brazil – 0.46%
Mixed Events
4x100m medley relay
- United States – 86.61%
- Australia – 8.25%
- Great Britain – 2.71%
- Russia – 1.22%
- China – 0.37%
- Italy – 0.27%
- Japan – 0.27%
- Canada – 0.19%
You can keep up to date with the probabilities here.
About Daniel Takata Gomes
Daniel Takata is editor of Swim Channel Magazine from Brazil, has a PhD in Statistics and is a college professor. He also works as a swimming TV commentator on SporTV and holds a website dedicated to sports analytics, Esportístico (www.esportistico.com.br).
What about Park Tae Hwan?
Sjöström will not lose two 100 fly finals in a row. I’d give her a 70% chance of winning the gold in Tokyo, stats be damned.
56.71 in Luxembourg is Sarah’s second best January time ever. I’m guessing that her training for the 200 free last year was mostly done to increase her endurance, in preparation for the 100 fly at the upcoming Olympics. She’s not swimming the 200 free in Tokyo.
Gold in 2013, 2015, 2016, 2017. It will be business as usual again in 2020. 🙂
I would not leave out Baker from the 200 IM. She had a very unlucky year in 2019. Both in 2018 and this year she had sub 2:09 and has the potential to further improve if she focuses on IM. Go Kathleen 🙂
The statistics deems to take the best and worst times equally important and ignore that swimmers prepare specially for the olympics. In most cases the winners produce close to their lifetime best or better. For instance Hosszu had a few poor times lately in Luxemburg, but IMHO this is totally irrelevant for her chances in Tokyo
A great piece of research and very interesting. It would be good to run a similar analysis on likelihood of a medal (not just take the top 3 out of the Gold medal predictions). I will be watching this with interest as we track through the season and the various country’s Trials Meets.
Let’s say that, in 1,000 trials, Manuel has the best time of all swimmers in 800, so she has an 8% chance of winning.
How does she has an 8% chance of winning if she wins 800 out of 1000?
Poor Cody.
M Mede