Quintessa

How did Quintessa’s Sports Rating Algorithm Perform for the 2022 FIFA World Cup?

Quintessa mathematicians and scientists enjoy analysing numbers. Over the last few months, Simon Rookyard and Jodie Stone have put Quintessa’s Sports Rating Algorithm to the test, providing predictions for the 2022 FIFA World Cup. Now that the competition is complete, it’s time to evaluate the algorithm’s performance.

It's the moment of truth! Having used our "N-Estimates" algorithm to predict the results for all the games of the FIFA World Cup 2022, it’s time to evaluate its performance. Previously, each prediction was accompanied by a plot displaying all the possible scoreline probabilities with the most likely outcome embellished by a cross. These plots have been updated with a green scoreline representing the actual result. The final plots for all the 2022 FIFA World Cup matches can be found at the end of this news story.

When assessing the performance of the algorithm during the competition, we have considered three performance metrics:

  • the percentage of correct outcome (win/draw/loss) predictions;
  • the percentage of correct goal difference predictions;
  • and the percentage of correct exact scoreline predictions.

As the data used to train the algorithm consist primarily of matches that finish after 90 minutes (regardless of the score), it is most appropriate to compare our predictions against the scores at the end of normal time (this choice, as opposed to using scores after extra time, only made a difference in one match during the tournament). Figure 1 compares the algorithm’s performance over the whole of the 2022 World Cup competition against a benchmark in each of our three metrics. The benchmarks used for the correct outcome, goal difference and scoreline have been generated by a Monte Carlo simulation and represent the expected success rates if scores had been predicted by chance.

A horizontal bar chart showing the performance of the algorithm against three metrics, top to bottom, Correct Outcome, Correct Goal Difference, and Correct Scoreline. For each metric, three bars are plotted, showing the percentage of matches correctly predicted by (i) a statistical benchmark, (ii) the Quintessa N-Estimates Algorithm, and (iii) Chris Sutton, a BBC pundit. The data plotted are as follows. Correct Outcome: 34%, 56%, 56%. Correct Goal Difference: 16%, 25%, 27%. Correct Scoreline: 7%, 16%, 16%.

Figure 1: Metrics of algorithm performance across all the games played in the 2022 FIFA World Cup. View in full resolution.

We can see that the algorithm performed well in predicting the correct outcomes, goal differences and scorelines. The correct outcome (not including penalties) was predicted in 56% of matches, approximately 22% more than we would expect by chance. The correct goal difference and correct scoreline were predicted in approximately 9% and 8% more matches (respectively) than expected by chance. A closer investigation reveals that the success rate of N-Estimates predicted match outcomes lies 3.7 standard deviations above the expectation of chance, 1.9 standard deviations above the expectation for predicted goal difference and 2.5 standard deviations above the expectation for predicted scorelines.

It is also interesting to compare the algorithm’s performance with that of an expert. BBC pundit Chris Sutton predicted the result of each match during the tournament, and his success rates against our three metrics are also shown in Figure 1. It was very close, with both Chris and N-Estimates making the same number of correct outcome and scoreline predictions. Chris just pipped the algorithm by one correct goal difference prediction. Congratulations Chris!

Links to Chris Sutton’s predictions: Group Stages 1, Group Stages 2, Group Stages 3, Last 16, Quarter Finals, Semi Finals, Final & Third Place Playoff

There is scope for further improvement of N-Estimates. There appears to be a source of variability in the results that has not been captured by the algorithm; as the algorithm was tuned predominantly to qualifying matches for the continental championships and World Cups, this implies a source of variability that is only significant at major competitions. One possible cause is the lack of inter-continent games – the vast majority of matches in the training data set are between teams from the same continent. This could lead to a shortage of data with which the algorithm can judge the relative average strengths of different continents. One response to this could be to introduce a two-tier rating system, in which teams are rated within their continent and the few intercontinental matches in the dataset are used to rate continents against each other. Given that the number of continents is much lower than the number of teams, the algorithm should, in principle, be able to make a better assessment of continent ratings with such a sparse dataset.

November 20 2022, Qatar vs. Ecuador. Central normal time prediction: 0 - 2. Confidence range for goal difference (Qatar minus Ecuador): -3 to 0. Actual normal time result: 0 - 2.November 21 2022, Senegal vs. Netherlands. Central normal time prediction: 0 - 4. Confidence range for goal difference (Senegal minus Netherlands): -6 to -3. Actual normal time result: 0 - 2.November 21 2022, England vs. Iran. Central normal time prediction: 1 - 0. Confidence range for goal difference (England minus Iran): 0 to 3. Actual normal time result: 6 - 2.November 21 2022, USA vs. Wales. Central normal time prediction: 0 - 1. Confidence range for goal difference (USA minus Wales): -1 to 0. Actual normal time result: 1 - 1.November 22 2022, Denmark vs. Tunisia. Central normal time prediction: 3 - 0. Confidence range for goal difference (Denmark minus Tunisia): 1 to 4. Actual normal time result: 0 - 0.November 22 2022, Argentina vs. Saudi Arabia. Central normal time prediction: 2 - 0. Confidence range for goal difference (Argentina minus Saudi Arabia): 1 to 3. Actual normal time result: 1 - 2.November 22 2022, Mexico vs. Poland. Central normal time prediction: 0 - 1. Confidence range for goal difference (Mexico minus Poland): -1 to 0. Actual normal time result: 0 - 0.November 22 2022, France vs. Australia. Central normal time prediction: 2 - 0. Confidence range for goal difference (France minus Australia): 1 to 3. Actual normal time result: 4 - 1.November 23 2022, Spain vs. Costa Rica. Central normal time prediction: 2 - 0. Confidence range for goal difference (Spain minus Costa Rica): 1 to 3. Actual normal time result: 7 - 0.November 23 2022, Germany vs. Japan. Central normal time prediction: 3 - 0. Confidence range for goal difference (Germany minus Japan): 1 to 5. Actual normal time result: 1 - 2.November 23 2022, Belgium vs. Canada. Central normal time prediction: 1 - 0. Confidence range for goal difference (Belgium minus Canada): 0 to 2. Actual normal time result: 1 - 0.November 23 2022, Morocco vs. Croatia. Central normal time prediction: 0 - 2. Confidence range for goal difference (Morocco minus Croatia): -3 to 0. Actual normal time result: 0 - 0.November 24 2022, Portugal vs. Ghana. Central normal time prediction: 4 - 0. Confidence range for goal difference (Portugal minus Ghana): 2 to 6. Actual normal time result: 3 - 2.November 24 2022, Brazil vs. Serbia. Central normal time prediction: 1 - 0. Confidence range for goal difference (Brazil minus Serbia): 0 to 2. Actual normal time result: 2 - 0.November 24 2022, Switzerland vs. Cameroon. Central normal time prediction: 3 - 0. Confidence range for goal difference (Switzerland minus Cameroon): 2 to 5. Actual normal time result: 1 - 0.November 24 2022, Uruguay vs. South Korea. Central normal time prediction: 1 - 0. Confidence range for goal difference (Uruguay minus South Korea): -1 to 2. Actual normal time result: 0 - 0.November 25 2022, Qatar vs. Senegal. Central normal time prediction: 1 - 0. Confidence range for goal difference (Qatar minus Senegal): 1 to 2. Actual normal time result: 1 - 3.November 25 2022, England vs. USA. Central normal time prediction: 2 - 1. Confidence range for goal difference (England minus USA): 0 to 3. Actual normal time result: 0 - 0.November 25 2022, Netherlands vs. Ecuador. Central normal time prediction: 1 - 0. Confidence range for goal difference (Netherlands minus Ecuador): 0 to 3. Actual normal time result: 1 - 1.November 25 2022, Wales vs. Iran. Central normal time prediction: 1 - 0. Confidence range for goal difference (Wales minus Iran): 0 to 2. Actual normal time result: 0 - 2.November 26 2022, Argentina vs. Mexico. Central normal time prediction: 2 - 0. Confidence range for goal difference (Argentina minus Mexico): 1 to 3. Actual normal time result: 2 - 0.November 26 2022, Saudi Arabia vs. Poland. Central normal time prediction: 0 - 1. Confidence range for goal difference (Saudi Arabia minus Poland): -1 to 1. Actual normal time result: 0 - 2.November 26 2022, France vs. Denmark. Central normal time prediction: 1 - 0. Confidence range for goal difference (France minus Denmark): -1 to 2. Actual normal time result: 2 - 1.November 26 2022, Australia vs. Tunisia. Central normal time prediction: 0 - 0. Confidence range for goal difference (Australia minus Tunisia): 0 to 1. Actual normal time result: 1 - 0.November 27 2022, Spain vs. Germany. Central normal time prediction: 1 - 1. Confidence range for goal difference (Spain minus Germany): -1 to 1. Actual normal time result: 1 - 1.November 27 2022, Belgium vs. Morocco. Central normal time prediction: 2 - 0. Confidence range for goal difference (Belgium minus Morocco): 1 to 4. Actual normal time result: 0 - 2.November 27 2022, Costa Rica vs. Japan. Central normal time prediction: 0 - 1. Confidence range for goal difference (Costa Rica minus Japan): -2 to 1. Actual normal time result: 1 - 0.November 27 2022, Canada vs. Croatia. Central normal time prediction: 0 - 1. Confidence range for goal difference (Canada minus Croatia): -2 to 1. Actual normal time result: 1 - 4.November 28 2022, Portugal vs. Uruguay. Central normal time prediction: 2 - 0. Confidence range for goal difference (Portugal minus Uruguay): 0 to 3. Actual normal time result: 2 - 0.November 28 2022, Ghana vs. South Korea. Central normal time prediction: 0 - 1. Confidence range for goal difference (Ghana minus South Korea): -2 to 0. Actual normal time result: 3 - 2.November 28 2022, Brazil vs. Switzerland. Central normal time prediction: 1 - 0. Confidence range for goal difference (Brazil minus Switzerland): 0 to 2. Actual normal time result: 1 - 0.November 28 2022, Serbia vs. Cameroon. Central normal time prediction: 3 - 0. Confidence range for goal difference (Serbia minus Cameroon): 2 to 4. Actual normal time result: 3 - 3.November 29 2022, Qatar vs. Netherlands. Central normal time prediction: 0 - 4. Confidence range for goal difference (Qatar minus Netherlands): -6 to -2. Actual normal time result: 0 - 2.November 29 2022, Ecuador vs. Senegal. Central normal time prediction: 2 - 0. Confidence range for goal difference (Ecuador minus Senegal): 1 to 3. Actual normal time result: 1 - 2.November 29 2022, Wales vs. England. Central normal time prediction: 0 - 1. Confidence range for goal difference (Wales minus England): -2 to 1. Actual normal time result: 0 - 3.November 29 2022, Iran vs. USA. Central normal time prediction: 0 - 1. Confidence range for goal difference (Iran minus USA): -1 to 1. Actual normal time result: 0 - 1.November 30 2022, Argentina vs. Poland. Central normal time prediction: 1 - 0. Confidence range for goal difference (Argentina minus Poland): 0 to 2. Actual normal time result: 2 - 0.November 30 2022, Saudi Arabia vs. Mexico. Central normal time prediction: 0 - 0. Confidence range for goal difference (Saudi Arabia minus Mexico): -1 to 1. Actual normal time result: 1 - 2.November 30 2022, France vs. Tunisia. Central normal time prediction: 3 - 0. Confidence range for goal difference (France minus Tunisia): 2 to 4. Actual normal time result: 0 - 1.November 30 2022, Australia vs. Denmark. Central normal time prediction: 0 - 2. Confidence range for goal difference (Australia minus Denmark): -3 to 0. Actual normal time result: 1 - 0.December 01 2022, Spain vs. Japan. Central normal time prediction: 2 - 0. Confidence range for goal difference (Spain minus Japan): 0 to 3. Actual normal time result: 1 - 2.December 01 2022, Costa Rica vs. Germany. Central normal time prediction: 0 - 3. Confidence range for goal difference (Costa Rica minus Germany): -4 to -1. Actual normal time result: 2 - 4.December 01 2022, Belgium vs. Croatia. Central normal time prediction: 1 - 1. Confidence range for goal difference (Belgium minus Croatia): -1 to 2. Actual normal time result: 0 - 0.December 01 2022, Canada vs. Morocco. Central normal time prediction: 1 - 0. Confidence range for goal difference (Canada minus Morocco): -1 to 2. Actual normal time result: 1 - 2.December 02 2022, Brazil vs. Cameroon. Central normal time prediction: 5 - 0. Confidence range for goal difference (Brazil minus Cameroon): 4 to 6. Actual normal time result: 0 - 1.December 02 2022, Serbia vs. Switzerland. Central normal time prediction: 1 - 1. Confidence range for goal difference (Serbia minus Switzerland): -1 to 1. Actual normal time result: 2 - 3.December 02 2022, Portugal vs. South Korea. Central normal time prediction: 3 - 0. Confidence range for goal difference (Portugal minus South Korea): 1 to 5. Actual normal time result: 1 - 2.December 02 2022, Ghana vs. Uruguay. Central normal time prediction: 0 - 2. Confidence range for goal difference (Ghana minus Uruguay): -3 to 0. Actual normal time result: 0 - 2.December 03 2022, Netherlands vs. USA. Central normal time prediction: 2 - 0. Confidence range for goal difference (Netherlands minus USA): 1 to 4. Actual normal time result: 3 - 1.December 03 2022, Argentina vs. Australia. Central normal time prediction: 2 - 0. Confidence range for goal difference (Argentina minus Australia): 1 to 3. Actual normal time result: 2 - 1.December 04 2022, England vs. Senegal. Central normal time prediction: 2 - 0. Confidence range for goal difference (England minus Senegal): 0 to 3. Actual normal time result: 3 - 0.December 04 2022, France vs. Poland. Central normal time prediction: 1 - 0. Confidence range for goal difference (France minus Poland): 0 to 1. Actual normal time result: 3 - 1.December 05 2022, Japan vs. Croatia. Central normal time prediction: 0 - 1. Confidence range for goal difference (Japan minus Croatia): -3 to 1. Actual normal time result: 1 - 1.December 05 2022, Brazil vs. SouthKorea. Central normal time prediction: 0 - 0. Confidence range for goal difference (Brazil minus SouthKorea): -1 to 1. Actual normal time result: 4 - 1.December 06 2022, Morocco vs. Spain. Central normal time prediction: 0 - 2. Confidence range for goal difference (Morocco minus Spain): -3 to -1. Actual normal time result: 0 - 0.December 06 2022, Portugal vs. Switzerland. Central normal time prediction: 1 - 0. Confidence range for goal difference (Portugal minus Switzerland): -1 to 2. Actual normal time result: 6 - 1.December 09 2022, Netherlands vs. Argentina. Central normal time prediction: 0 - 0. Confidence range for goal difference (Netherlands minus Argentina): 0 to 1. Actual normal time result: 2 - 2.December 09 2022, Croatia vs. Brazil. Central normal time prediction: 0 - 0. Confidence range for goal difference (Croatia minus Brazil): -1 to 0. Actual normal time result: 1 - 1.December 10 2022, Morocco vs. Portugal. Central normal time prediction: 0 - 2. Confidence range for goal difference (Morocco minus Portugal): -4 to 0. Actual normal time result: 1 - 0.December 10 2022, England vs. France. Central normal time prediction: 1 - 1. Confidence range for goal difference (England minus France): -1 to 1. Actual normal time result: 1 - 2.December 13 2022, Argentina vs. Croatia. Central normal time prediction: 0 - 0. Confidence range for goal difference (Argentina minus Croatia): -1 to 1. Actual normal time result: 3 - 0.December 14 2022, France vs. Morocco. Central normal time prediction: 1 - 0. Confidence range for goal difference (France minus Morocco): 0 to 2. Actual normal time result: 2 - 0.December 17 2022, Croatia vs. Morocco. Central normal time prediction: 1 - 0. Confidence range for goal difference (Croatia minus Morocco): 0 to 3. Actual normal time result: 2 - 1.December 18 2022, Argentina vs. France. Central normal time prediction: 1 - 1. Confidence range for goal difference (Argentina minus France): 0 to 1. Actual normal time result: 3 - 3.

Figure 2: Predictions and actual scores for every match in the tournament. For each plot, the circles represent possible final scores after 90 minutes, with the number of goals scored by each team plotted on the axes. Each circle has been colour coded to indicate the probability of that result occurring, with the most likely outcome marked with a black cross. The solid and dashed orange lines represent the mean and 1σ uncertainty range respectively for the predicted goal difference, and the dashed blue lines indicate a goal difference of zero. The actual result after 90 minutes has been shaded green.

Quintessa is not affiliated in any way with FIFA or the BBC. Its application of the N-estimates algorithm to the FIFA World Cup 2022 competition is an independent and non-commercial endeavour.