10 Comments
User's avatar
Graham Cook's avatar

At what level of player would xg =goals? I’m still trying to understand these data models 🥹 Am I right I’m saying a human analyst is looking at every game and makes their xg % guess on a shot taken based on their own interpretation of ‘the probability of an average player’? So is any data between 2 x games comparable?

Expand full comment
Stephen McGeoch's avatar

I believe that most xG models are based on large numbers of observations, so the models are empirical rather than theoretical.

Expand full comment
Graham Cook's avatar

My question is more about the data collection- is it a human review of a shot, and then based on where the ball was when the shot was taken etc, a number/goal probability is manually attached to that action by the reviewer? As Alan does for each Celtic game?

Expand full comment
Michael Martin  Shields's avatar

Great post and really good to see contributions from the community on the site. If I find anything useful to say I will try and contribute in coming months.

Expand full comment
Alan Morrison's avatar

The that will be most welcome Michael

Expand full comment
Liam Osborn's avatar

Thanks a lot Stephen. Your focus on G/xG is very interesting. Any data-focused content I have consumed (including HB) usually suggests out- or under-performing xG is basically luck, unless you're Messi or Haaland. Do you think there is more to consider in terms of quality of shot, rather than just quality of chance? Sounds like you do, but interested in thoughts.

Cheers

Expand full comment
Stephen McGeoch's avatar

I came at this from two directions, Liam. The first was the pretty strong feeling that, in the clearest circumstance, a one-on-one between the striker and the goalie, there are some strikers who you’d always trust to score, and some that have you clenching everything until you see the outcome. The other viewpoint was simply that if a mathematical model describes the average behaviour of a population, then it won’t fit so well for cases that are far from that mean. Footballers come in a very wide range of capabilities, so I expected to see deviations from the model behaviour for players and teams that are notably above or below average.

G/xG seems to me a very natural way to assess this as it eliminates confusers such as volume of chances (eg a player scoring +2 over an xG of 10 is doing better than one scoring +2 over an xG of 30), and is relatively easy to interpret as being, say, 12% better than average or 30% worse.

Since this sort of data has become available, I’ve kept an eye on G/xG. You mentioned Messi, and he was indeed the first player I ever looked at. It’ll come as no surprise that he outguns xG every time. Of course he does; his balance and technique are superhuman. Messi is exceptional in every way, but think about all the goals James Forrest has scored from just inside the penalty area and then consider all the fullbacks who have shot from the same place and fairly reliably put the ball into the stand. Finishing is a critical skill and it’s real. Players like Harry Kane and Robert Lewandowski tend to have G/xG>1 and they still attract a huge premium well into their thirties. It’s not the only thing about being a striker, but it’s certainly important.

If you look at G/xG for teams in leagues with good data collection (eg from the Big 5 leagues on Opta), the ones at the top tend to have G/xG>1 and the ones at the bottom usually have G/xG<1. Of course, there are exceptions, and that’s interesting too.

So, xG is noisy and you have to be careful with it, but I think there’s plenty of evidence that we can relate G/xG to performance and use it as a differentiating metric.

Expand full comment
Liam Osborn's avatar

Thanks a lot Stephen 😊

Would love Alan and James's view on this, as I'm not sure quality of finishing is ever something that has been given much credence.

Expand full comment
Alan Morrison's avatar

We'll take a look on the pod recorded this evening

Expand full comment
Stephen McGeoch's avatar

🐄 🪕😄

Expand full comment