Expected Goals from Long Shots and Distance Metrics: Risk Analysis
You’re watching a match, and your team’s midfielder winds up from 30 yards out. The crowd holds its breath. The ball rockets toward the top corner, and the goalkeeper tips it over. It’s a moment of pure drama. But when you check the expected goals (xG) model later, that shot is valued at 0.03. Three percent chance of scoring? That feels like an insult to the spectacle. This tension—between the emotional weight of a long-range strike and the cold numbers of statistical models—is exactly where this guide lives. If you’ve ever wondered why your favorite long-shot taker seems undervalued by analytics, or why a team that launches shots from distance keeps losing, you’re in the right place. Here, we’ll break down the real risks embedded in distance metrics, show you how to spot when xG is misleading you, and offer practical steps to use this data without falling into common traps.
Why Long Shots Mess with Your xG Understanding
The core problem is simple: expected goals models are built on probability. A shot from 6 yards out in the center of the goal has a high xG because, historically, most of those shots go in. A shot from 35 yards out has a very low xG because, historically, almost none of those go in. But here’s the rub—long shots are high-variance events. A player might go an entire season without scoring from distance, then smash in a worldie in a cup final. The xG model will tell you that player was unlucky or lucky, but it won’t tell you whether that player has a special skill for long-range strikes.
This creates a troubleshooting scenario: you see a player with a low xG per shot but a high actual goal tally from long range. Is that sustainable? Is the model wrong? Or is the player genuinely exceptional? The answer often lies in the context of the shot.
The Real Problem: Sample Size and Skill
Most xG models treat all shots from a given distance zone as roughly equal. But a 30-yard free kick taken by a dead-ball specialist is not the same as a 30-yard hopeful punt from a center-back. The model doesn’t know the difference unless it incorporates additional variables like shot type (free kick, open play, header), angle, and defensive pressure. This is where the risk analysis kicks in: if you’re using raw distance metrics alone, you’re baking in a lot of noise.
Common user scenarios where this goes wrong:
- You’re evaluating a midfielder who takes a lot of long shots. Their xG per 90 is low, but they score a few screamers. You think they’re a hidden gem. In reality, they might be wasting possession.
- You’re comparing two teams. One takes many long shots, the other works the ball into the box. The long-shot team has a lower xG, but they win a match on a wonder goal. You conclude the xG model is broken.
- You’re using distance metrics to predict future performance. A player overperforms their xG from long range. You expect regression, but they keep scoring. You’re stuck.
Step-by-Step Troubleshooting Guide
Let’s walk through how to diagnose and address these issues. This is not about discarding xG—it’s about using it smarter.
Step 1: Separate Shot Types by Distance Bands
Don’t lump all “long shots” together. Create at least three distance bands:
- Inside the box (6-18 yards): High xG, low variance.
- Edge of the box (18-25 yards): Medium xG, moderate variance.
- Long range (25+ yards): Low xG, high variance.
Step 2: Add Shot Context Variables
Look for data that includes:
- Shot type: Is it a free kick, a volley, or a driven shot?
- Defensive pressure: Was the shot under pressure or with time to set?
- Angle: Is it central or from a wide position?
Step 3: Check for Regression Patterns
Plot a rolling average of the player’s xG from long shots over a season. If their actual goals are consistently above xG, but the xG itself is stable, you’re looking at a potential skill. If the xG spikes and drops wildly, it’s likely noise. Use a simple moving average over 10-match windows to smooth out variance.
Example: A midfielder takes 40 long-range shots in a season. Their xG from those shots is 2.0. They score 4 goals. That’s a 2-goal overperformance. Over 40 shots, that’s not crazy—but if it happens three seasons in a row, you’re onto something.
Step 4: Contextualize with Team Tactics
Long shots are often a symptom of a team that can’t break down a low block. If a team takes 15 long shots per game, their xG might be low, but the real issue is their inability to create high-quality chances. This is where Defensive Duels Winning Rate and Positioning Metrics become relevant. A team that wins few duels in the final third will naturally resort to long shots. The risk isn’t just the low xG—it’s the opportunity cost of not creating better chances.
When the Problem Requires a Specialist
Not every issue can be solved with a spreadsheet. Here are situations where you need to consult a data analyst or a tactical coach:
- You’re evaluating a player for a transfer. If a player has a high long-shot goal tally but low xG, a specialist can run a Bayesian analysis to estimate the probability that the skill is real versus luck. This is beyond basic xG models.
- You’re building a team strategy. If you want to decide whether to encourage or discourage long shots, you need a full possession and chance creation model. A specialist can simulate how reducing long shots affects overall xG.
- You’re dealing with a goalkeeper’s performance. Long shots are often the domain of goalkeepers’ shot-stopping metrics. A specialist can separate a goalkeeper’s ability to save long shots from the randomness of those shots.
The Key Takeaway: Distance Metrics Are a Starting Point, Not a Conclusion
Expected goals from long shots and distance metrics are powerful tools, but they carry a high risk of misinterpretation. The variance is real, the sample sizes are small, and the context is everything. When you see a player or team overperforming from range, don’t jump to conclusions. Run the steps above, look for patterns over time, and always ask: is this skill or luck? And if you’re still stuck, remember that the Player and Team Statistics hub is your first stop for grounding your analysis in broader data.
The next time a midfielder winds up from 30 yards, enjoy the moment. But when you’re analyzing the numbers, treat that shot with the skepticism it deserves. The risk is real, but so is the reward—if you know how to read the data.
