At what level of player would xg =goals? I’m still trying to understand these data models 🥹 Am I right I’m saying a human analyst is looking at every game and makes their xg % guess on a shot taken based on their own interpretation of ‘the probability of an average player’? So is any data between 2 x games comparable?
My question is more about the data collection- is it a human review of a shot, and then based on where the ball was when the shot was taken etc, a number/goal probability is manually attached to that action by the reviewer? As Alan does for each Celtic game?
Great post and really good to see contributions from the community on the site. If I find anything useful to say I will try and contribute in coming months.
Thanks a lot Stephen. Your focus on G/xG is very interesting. Any data-focused content I have consumed (including HB) usually suggests out- or under-performing xG is basically luck, unless you're Messi or Haaland. Do you think there is more to consider in terms of quality of shot, rather than just quality of chance? Sounds like you do, but interested in thoughts.
I came at this from two directions, Liam. The first was the pretty strong feeling that, in the clearest circumstance, a one-on-one between the striker and the goalie, there are some strikers who you’d always trust to score, and some that have you clenching everything until you see the outcome. The other viewpoint was simply that if a mathematical model describes the average behaviour of a population, then it won’t fit so well for cases that are far from that mean. Footballers come in a very wide range of capabilities, so I expected to see deviations from the model behaviour for players and teams that are notably above or below average.
G/xG seems to me a very natural way to assess this as it eliminates confusers such as volume of chances (eg a player scoring +2 over an xG of 10 is doing better than one scoring +2 over an xG of 30), and is relatively easy to interpret as being, say, 12% better than average or 30% worse.
Since this sort of data has become available, I’ve kept an eye on G/xG. You mentioned Messi, and he was indeed the first player I ever looked at. It’ll come as no surprise that he outguns xG every time. Of course he does; his balance and technique are superhuman. Messi is exceptional in every way, but think about all the goals James Forrest has scored from just inside the penalty area and then consider all the fullbacks who have shot from the same place and fairly reliably put the ball into the stand. Finishing is a critical skill and it’s real. Players like Harry Kane and Robert Lewandowski tend to have G/xG>1 and they still attract a huge premium well into their thirties. It’s not the only thing about being a striker, but it’s certainly important.
If you look at G/xG for teams in leagues with good data collection (eg from the Big 5 leagues on Opta), the ones at the top tend to have G/xG>1 and the ones at the bottom usually have G/xG<1. Of course, there are exceptions, and that’s interesting too.
So, xG is noisy and you have to be careful with it, but I think there’s plenty of evidence that we can relate G/xG to performance and use it as a differentiating metric.
At what level of player would xg =goals? I’m still trying to understand these data models 🥹 Am I right I’m saying a human analyst is looking at every game and makes their xg % guess on a shot taken based on their own interpretation of ‘the probability of an average player’? So is any data between 2 x games comparable?
I believe that most xG models are based on large numbers of observations, so the models are empirical rather than theoretical.
My question is more about the data collection- is it a human review of a shot, and then based on where the ball was when the shot was taken etc, a number/goal probability is manually attached to that action by the reviewer? As Alan does for each Celtic game?
Great post and really good to see contributions from the community on the site. If I find anything useful to say I will try and contribute in coming months.
The that will be most welcome Michael
Thanks a lot Stephen. Your focus on G/xG is very interesting. Any data-focused content I have consumed (including HB) usually suggests out- or under-performing xG is basically luck, unless you're Messi or Haaland. Do you think there is more to consider in terms of quality of shot, rather than just quality of chance? Sounds like you do, but interested in thoughts.
Cheers
I came at this from two directions, Liam. The first was the pretty strong feeling that, in the clearest circumstance, a one-on-one between the striker and the goalie, there are some strikers who you’d always trust to score, and some that have you clenching everything until you see the outcome. The other viewpoint was simply that if a mathematical model describes the average behaviour of a population, then it won’t fit so well for cases that are far from that mean. Footballers come in a very wide range of capabilities, so I expected to see deviations from the model behaviour for players and teams that are notably above or below average.
G/xG seems to me a very natural way to assess this as it eliminates confusers such as volume of chances (eg a player scoring +2 over an xG of 10 is doing better than one scoring +2 over an xG of 30), and is relatively easy to interpret as being, say, 12% better than average or 30% worse.
Since this sort of data has become available, I’ve kept an eye on G/xG. You mentioned Messi, and he was indeed the first player I ever looked at. It’ll come as no surprise that he outguns xG every time. Of course he does; his balance and technique are superhuman. Messi is exceptional in every way, but think about all the goals James Forrest has scored from just inside the penalty area and then consider all the fullbacks who have shot from the same place and fairly reliably put the ball into the stand. Finishing is a critical skill and it’s real. Players like Harry Kane and Robert Lewandowski tend to have G/xG>1 and they still attract a huge premium well into their thirties. It’s not the only thing about being a striker, but it’s certainly important.
If you look at G/xG for teams in leagues with good data collection (eg from the Big 5 leagues on Opta), the ones at the top tend to have G/xG>1 and the ones at the bottom usually have G/xG<1. Of course, there are exceptions, and that’s interesting too.
So, xG is noisy and you have to be careful with it, but I think there’s plenty of evidence that we can relate G/xG to performance and use it as a differentiating metric.
Thanks a lot Stephen 😊
Would love Alan and James's view on this, as I'm not sure quality of finishing is ever something that has been given much credence.
We'll take a look on the pod recorded this evening
🐄 🪕😄