Since early 2011 student researchers from Columbia
University in New York, led by Professor Casey Ichniowski, have
been carrying out qualitative analysis of numerous leagues and
seasons of full Opta data in an attempt to further contextualise
events that occur within a football match. Casey and his
team have used a range of statistical methods to analyse the data.
The result has been a number of short-form analytical blogs,
covering many different aspects of the game including 'Team DNA',
scoring possessions and tempo. We invite comment and debate on
each.
The first blog for OptaPro from Professor
Ichinowski's team is another perspective on possession:
The number of possessions teams have in a game can differ by one
at most. What a team does with its possessions ultimately
determines who scores and wins. Clearly, possessions that move
closer to the goal are more likely to score. With this simple
logic, we develop a method to quantify the quality of
possession.
Using data from 2005-2012 from all EU5 premier clubs, we plot
all 19,018 regular play goals on a soccer field. We then scale the
values at each x-y coordinate such that 100 represents the location
where the most goal shots are taken and 0 represents no goal shots.
The result is the following heat map.

Although an event directly on the goal line should
probably be awarded many points, there are very few goals here
simply because of the difficulty of reaching such locations. An
award system purely based on goal scoring location may not be ideal
as it also does not award places that are one pass or chip away
from a goal-scoring opportunity, such as near the corners. To
account for this, we look at the inverse frequency of all passes
(and set all values in own half to 0). This essentially rewards
teams for being in heavily defended areas that are difficult to be
in and are in range of a goal or a pass leading to a
goal.

We then combine the goal map and the inverse passing
to to develop a hybrid scoring system for rating events. The
resulting points map (rescaled to a 0-100) is as
follows:

We now define Possession Score as the maximum point value of a
pass or receive event in a possession. Thus, the quality of a
possession is characterized by the most dangerous area it entered
during the possession. (Alternatively, one can do maximum point
minus points at starting location).
Looking at possessions with at least 4 passes, we calculate the
mean Possession Scores for all club teams in 2011-2012. Despite
their tiki-taka style, frequency passing and time spent midfield,
Barcelona comes out on top as most of the possessions eventually
lead to dangerous areas.

Now let us know your thoughts. Comment below: