This methodology article is for an old version of our club soccer forecasts. See how our latest club soccer predictions work.
We’ve launched a big expansion of our soccer predictions here at FiveThirtyEight. We’re now forecasting 24 club soccer leagues, with two more — Champions League and Europa League — to come in a few weeks. We’ve added leagues from South America (Brasileirão and the Argentine Superliga), along with 14 new European leagues, including five second-tier leagues, such as the English Championship.
For the most part, the methodology behind our forecasts is the same as last year’s. We’re still using four metrics from each match — goals scored, adjusted goals, shot-based expected goals and non-shot expected goals — to evaluate team performances. Those evaluations are expressed as offensive and defensive ratings for each team. And those ratings, in turn, let us calculate win/loss/draw probabilities for future matches and simulate the season thousands of times to estimate each team’s chances of winning the title.
All our significant changes are in how we assess the relative strength of domestic leagues. The goal was to improve our forecasts for the Champions League and Europa League and to better compare clubs in different countries — say, Juventus in Italy to Ajax in the Netherlands. We’re using recent matches played between teams from different leagues, supplemented with league market values (from Transfermarkt), to assign a strength rating to every league that we’re forecasting. Our new league ratings also give us the ability to a calculate a global Soccer Power Index (SPI) rating for each team — a number from 0 to 100 that represents the overall strength of each team.
We’ll get into more detail about that methodology below, but to start, here are our league strengths for most of the top-tier domestic leagues in Europe, North America and South America:
Relative strength of club soccer leagues from Europe, North America and South America, by country
Based on matches played in the past five years and the current market value of each league
|STRENGTH BASED ON …|
There aren’t too many surprises at the top; out of the five biggest European leagues, four are in the top five, with La Liga in Spain and the Bundesliga in Germany pretty far ahead of the pack. The Premier League in England, despite being the most valuable league in the world, has struggled to compete in the Champions League recently, and their match-based rating lags far behind their market value rating. Another league whose recent performance according to our ratings has lagged behind their market value is Major League Soccer in the U.S., whose last CONCACAF Champions League title was in 2000. MLS shows up in 28th place, between the Danish and Croatian leagues and a long way behind their regional rivals Liga MX in Mexico.
To generate these league strength ratings, we’ve set up a system where we first assume that all leagues are of equal strength and determine how far above or below expectation each league has performed over the past five years. In order, we:
- Run through all domestic matches in history and calculate domestic team SPI ratings throughout time.
- Look at each inter-league match from the past five years and calculate the expected score of the match based purely on each team’s domestic rating at the time.
- Take the difference between our expected score of the match and the actual score and run these results through Massey’s Method to find a rating for each league, expressed in how many goals better or worse than average that league is.
- Regress these calculated ratings toward market-value based ratings, weighted by how many inter-league matches we have for each league.
- Run through all matches in history again, incorporating league strengths into the predictions for any inter-league matches to improve the final team ratings.
Our league strengths can be interpreted as a bonus (in goals) given to each team in an inter-league match. So, for example, if Real Madrid (league strength of 2.01) were playing PSG (league strength of 1.48) and Real Madrid were a 0.2 goal favorite based only on their domestic SPI ratings, our model would give Real Madrid an extra 0.53 goal bonus because of the difference in the two teams’ league strengths.
A nice feature of these league strength ratings is that they let us generate global SPI ratings for any club team in the world. These global SPI ratings are a combination of each team’s domestic SPI rating and the strength of the league they play in. To generate them, we set up a mock match on neutral ground against a team with a domestic offensive and defensive SPI rating of 1.35 that plays in a league with strength of 1.25. (These values are just arbitrary ratings that we use for a baseline team against which we can compare any other team.) We calculate the number of goals we expect each team to score in the match as well as the chances of each team winning. So each team’s global SPI ratings can be interpreted as follows:
Offensive SPI: the number of goals the team is expected to score in such a match
Defensive SPI: the number of goals the team is expected to concede in such a match
Overall SPI: the percentage of points the team is expected to take in such a match
We’ll be adding a list of all SPI-rated teams to our club soccer predictions soon. For now, you can find forecasts from 24 leagues. They’re available in three languages (English, Spanish and Portuguese) and will be updated after every match.