In the 69th minute of the most important derby in English football (that’s still the case right?) Manchester United lead Liverpool 1-0 with the game obviously still very much open.  An average team in Liverpool’s position would still expect to draw or win the game about 21% of the time, not ideal but nowhere near a lost cause.

With United on the attack and the ball in the final third Carrick plays a weighted ball through to Ander Herrera who latches onto it on the edge of the penalty area. Right here Gomez has a decision to make, he  can accept that his positioning wasn’t great but just try and track Herrera as quickly as possible,  or he can try and redeem himself with a last-ditch tackle near the byline.

Unfortunately for Liverpool Gomez attempted to make up for his poor positional play with a rash tackle on Herrera, fouling him in the process, and greatly diminishing Liverpool’s chances of getting anything from the game. It’s alright saying it was a stupid decision to dive in with hindsight, but let’s try and put a value on the decision Gomez made.

 

Say there are two different options for a defender in this position, he can either:

  1. Act aggressively – Attempt to win the ball, or block the ball by sliding in.
  2. Act conservatively – Stay on his feet and apply as much pressure to the forward as possible without possibility of fouling.

We can have a look at the possible outcomes for each scenario and try and evaluate the value of each action. We’re going to define value as the probability of a team scoring from a certain event, adjusting for the possibility of the opposition scoring as a result of the event’s outcome when possible.


 

Aggressive defending:

Going to try and split this up into 2:

  1. Value of a penalty
  2. Value of every other possible situation

Value of a penalty

  • Penalty given away- in the past 15 years in EPL 80% of penalties have been scored so

Value of a penalty = 0.8

Value of other situations

  • Player shoots – Shot conversion from wide penalty areas are converted about 5.6% of the time , more pressure probably reduces this probability, especially if the block rate is higher, so let’s say 4-5% 
  • Player crosses – Let’s assume high pressure on cross makes cross more difficult. A cross inside the box is converted about 5% of the time but let’s be generous and say pressure causes value to be about 4%
  • Player is tackled before he takes an offensive action – Three things could happen here
    • Tackle goes for a corner – 3%
    • Offense retains possession – about 2-3%
    •  Defense obtains possession – 0%

I’m going to assume they all happen at a similar rate and take the average of these at 2%

I haven’t got data on the probability that each of these events occur but they’re all within range of a value around 2-5%. Based on what seems right, just from a spectators standpoint, I’m going to assume again that these all occur at a similar rate and just find the average to get a value of 3.5%. Obviously I’m making pretty loose assumptions but I’ll show why this isn’t too important in this particular situation later.

Value of everything else = 0.035


Conservative defending

If a defender is conservative the possible outcomes are:

  • Player shoots – Shots from this area are converted about 5.6% of the time, let’s say 7% when not pressured.
  • Player crosses – Crosses in the box are usually converted at about 5% similar to shots from this area, let’s assume slightly more and again say 7% in box when not as pressured

Value of conservative play = 0.07

Since both possible outcomes have about the same chance of conversion the total value is equal to the conversion rate. We are assuming that being conservative makes the opposition twice as likely to score compared to when defending aggressively, I’d like to think this is an assumption which , if biased, is favoured towards aggressive defending.


Expected Value

Okay then let’s do some math. The expected value of each decision is the number of goals the opposition would expect to score given a certain defensive strategy. In this instance (defending) we would want the lowest number possible.

 

So the  expected value of playing aggressively is p(0.8) + q(0.035) where “p” is the probability of giving away a penalty, “q” the probability of not giving away a penalty, and therefore p+q=1. The expected value of playing conservatively is 0.07.

pen 4.6

From an expected value point of view it only makes sense to defend aggressively if p(0.8)+q(0.035)<0.07, or the chance of the opposition scoring is less than 7%.  It turns out that this is only true if “q” (probability of giving up a penalty) is less than 4.6% which seems extremely low.

Even if we assume aggressive defending decreases the value of attacking play to 2%, whilst reserved play increases it to 8% you’d need to have a less than 7.6% chance of conceding a penalty in order for being aggressive to make sense. So even when scoring is 4 times more likely when being conservative (which seems pretty extreme), you’d need to be very confident of not fouling in order for this to make sense

pen 7.6

To endeavour to find an approximate value for the proportion of tackles that end up as fouls I decided to collect data for tackles attempted, tackles made, and fouls for premier league defenders over the 2014/15 Premier League season. Now not every foul made is during a tackle, although a foul is an indicator of aggressive defensive play so I decided to include all fouls. With these assumptions I calculated proportion of fouls as.

Foul% = Fouls / (Fouls + Tackles attempted)

Foul

This worked out as 23.7% for Premier league defenders last year, Tackle success rate was 56.3% and defenders were dribbled past without committing a foul 20% of the time. From this it’s quite obvious to conclude that defenders should probably never be aggressive in the wide areas of the penalty area since the value of a penalty is far too high to make it worth risking a foul.

Even if you take into consideration referees swallowing the whistle, meaning refereeing calls are usually given less frequently in high leverage situations or in this case fouls in penalty area, it still doesn’t make sense. In order for the foul percentage to fall from 23.7% to less than 7.7% there’s got to be some serious whistle swallowing going on.

If we go back to our original expected values of 0.8, 0.035, and 0.07, and assume the probability of a foul to be 23.7% we can work out the expected number of goals this would cost a team.

 

pen 23.7

 

You can see from the table that attempting a tackle in the wide areas of the penalty box on average costs the team 0.146 goals (0.216 – 0.07 = 0.146), which may not sound much, but that’s about half of the goal advantage that an average team playing at home has over the away team. Managers should be telling their defenders not to dive in or stick a leg out when opposing players are in the wide areas of the penalty box as it never seems to be worth the risk.

I think the reason many defenders are too aggressive in this situation is that since defending is reactive compared to attacking being proactive, defenders may feel they need to be risk-seeking in order to make an impact, and in this case it isn’t the optimal choice. It’s quite strange that this is the case because teams are usually too risk averse, although maybe attackers are risk averse and defenders are risk seeking, but the proactive attackers have more influence on the game overall, just spitballing but it’s an interesting discussion.

Just as I was writing this a perfect example of reckless decision making popped up on my twitter feed from the Barcelona vs Rayo Vallecano game, although I’m not even sure if you can class this as decision making.

There are many other situations like this where decision making can be analysed based on expected value and although this may seem like an obvious one from a spectators standpoint defenders still play aggressively in the wide sections of the penalty area. More accurate data will be needed for decisions less obvious than this one but it can still be done and can inform teams where they are losing goals based on decision making. If a team can save themselves .15 goals a game based on simple decisions like this they would save themselves 5.7 goals over the course of the season. In the end that really could be the difference between the club having a successful year and an unsuccessful one.

  • Daryl

    “I think the reason many defenders are too aggressive in this situation is that since defending is reactive compared to attacking being proactive, defenders may feel they need to be risk-seeking in order to make an impact, and in this case it isn’t the optimal choice”

    I’d like to pick up the above quote along with the notion that if you were a manager you’d tell your defenders not to slide in.

    1) Defending is partly reactive, however it is mainly proactive; pushing the line up, squeezing tight to the man, showing an attacker inside or outside – are all some examples of a defender making a decision to reduce an attacking threat prior to there being one. Defender’s these days have to think more and more about defending in transition because in some cases, depending on who you’re playing, you have to already be thinking of your position – and the positions of your midfielders and other defenders – in order to have enough coverage to reduce the threat of a counter attack.

    The case you highlight arises due to a lack of experience; I would never tell one of my players not to slide/attempt to recover based on the location of the player in the box because Herrera may have had 2 people free for a square ball in the middle. Defenders will learn to judge when – and when not to – to slide in based on the picture in the middle of the box as their game experiences grow:

    • Peter Owen

      I think if we’re comparing attack vs defence you’ve got to consider attacking more proactive and defensive reactive, I agree there are areas of defending that can be considered proactive but at the end of the day the attacking team decides what they’re going to do with the ball and the defense has to try and react, you can try and force them into doing something but if they don’t the defence HAS to react.

      “Defenders will learn to judge when – and when not to – to slide in based on the picture in the middle of the box as their game experiences grow” That’s the point I’m making, you’d think defenders would know when to slide and not but there are obviously still a lot of pointless fouls in the edges of the penalty area. You can’t just assume a defender will become omniscient in relation to decisions like this, if you have enough data you can work out what the correct decisions are for certain situations (when 2 people are free in the box, when the box is congested etc.) and inform the defenders what they should be doing in each one. As for the lack of experience thing the player fouling Neymar, Nacho Martinez, is a pretty experienced defender (26 yo, played 59 times for Rayo Vallecano) and yet that’s one of the poorest decisions I’ve seen a player make, and I’m sure there are many more examples of “experienced defenders” doing similar things. We can’t just assume the players will learn everything themselves unfortunately that’s the whole reason why teams have coaches in the first place.

      • Daryl

        You’re right, defenders do have to react, but I was explaining that defending isn’t totally reactive; nor was I comparing it to attacking. Attacking is also partly reactive, as they will react to gaps and spaces between a team’s unit as and when they arise.

        I’m not assuming defenders will become all knowing, i’m saying that – especially in the instance of Gomez, an 18 year old who was playing his 5th game at left back EVER – they’ll learn from experiences and apply them later on in his/their careers. Nacho, despite his age, is not an experienced defender – he’s only played 59 times over 3 seasons in the top flight. You can tell his inexperience by his body shape; he’s not low enough; not showing Neymar down the line and hasn’t got his knees inverted. One of the first things a defender is taught, is to stay on your feet for as long as possible – Gomez and Nacho will both receive feedback, not only by the coaches but by reviewing the clips of the game back afterwards which will help them when those situations arise in future matches.

  • Gerard Comerford

    Interesting stuff. What about all other areas of the penalty box? What about elite dribblers versus shit dribblers?

    • Peter Owen

      This was really just an introduction to how we can be using analytics to help players in their decision making, I chose wide areas of the penalty box because I thought it was the most obvious and easiest one to do (especially without a large amount of data). Obviously this could go into much more depth with more available data and might be able to tell you what the optimal decision would be centrally in the box and against different quality of dribblers 🙂

  • Bill Wilkens

    Great article, mate ! The biggest amount of and most important wrong decisions made in football, has to be poor finishing though 😛 . It’s easy to train a defender to play more conservatively, but how would you train your strikers decision making ?

  • allanderek

    Nice discussion. I like that even without accurate data for some of the probabilities which you have had to estimate you can still reach a pretty decent conclusion that is at the very least plausible. Even if one wants to disagree with some of your probabilities it’s still a good formula, and if one wants to argue for aggressive defending (in a particular situation) they have to provide some reasonable numbers that produce that result.

    An additional point, is that defending aggressively risks a card. Even a yellow card can have a negative impact on the rest of the game and a red card can be a worse outcome than a goal if it is early enough. There is of course the possibility that the attacker receives a yellow for simulation, but I’d be very surprised if that probability is higher than the probability of the defender receiving a card.

  • Alexander Arguete Iskender

    Hey Peter, nice article.
    Do you have any data to know what’s the probability for a defender to “act aggressively” or “act convervatively”? I’m asking this because if we see it as a probability tree that would be the first decision. For example if the average defender is acting aggressively .5 of the time.

    Regards!

    • Peter Owen

      Nope, unfortunately don’t currently have access to any data that would help me to infer anything like that, working on getting more data though. Thanks for the interest and sorry for the late reply!