Stuart Lancaster visited our Leeds office this week and spoke with collection manager Conor Tyndall. Coach. 27 Apr

.@SaturdayOnCouch uses key statistics to analyse teams ahead of the Champions & Europa League semi-finals. Standout. 26 Apr

Analyst @RevsTimC discusses the implementation of analytics at New England Revolution. Edge. 26 Apr

Students from Columbia University use data to build a 20-man MLS roster from scratch. Assignment. 20 Apr

.@TeleFootball speaks with #WHUFC’s Rory Campbell, technical analyst & scout, on analytics in football. Irons. 18 Apr

.@mixedknuts applies data to discuss shot quality and decision-making in his latest StatsBomb article. Pop. 13 Apr

.@BenMackriell is attending the Sports Analytics Innovation Summit #SportsLondon tomorrow. Get in touch to arrange a meeting. Connect. 12 Apr

The OptaPro Blog

RSS Feed

Welcome to the OptaPro blog, featuring news and analysis from OptaPro's cutting-edge research team.

COLUMBIA BLOG: Scoring Possessions

Since early 2011 student researchers from Columbia University in New York, led by Professor Casey Ichniowski, have been carrying out qualitative analysis of numerous leagues and seasons of full Opta data in an attempt to further contextualise events that occur within a football match. Casey and his team have used a range of statistical methods to analyse the data. The result has been a number of short-form analytical blogs, covering many different aspects of the game including 'Team DNA', scoring possessions and tempo. We invite comment and debate on each.

The first blog for OptaPro from Professor Ichinowski's team is another perspective on possession:

The number of possessions teams have in a game can differ by one at most. What a team does with its possessions ultimately determines who scores and wins. Clearly, possessions that move closer to the goal are more likely to score. With this simple logic, we develop a method to quantify the quality of possession.

Using data from 2005-2012 from all EU5 premier clubs, we plot all 19,018 regular play goals on a soccer field. We then scale the values at each x-y coordinate such that 100 represents the location where the most goal shots are taken and 0 represents no goal shots. The result is the following heat map.


Although an event directly on the goal line should probably be awarded many points, there are very few goals here simply because of the difficulty of reaching such locations. An award system purely based on goal scoring location may not be ideal as it also does not award places that are one pass or chip away from a goal-scoring opportunity, such as near the corners. To account for this, we look at the inverse frequency of all passes (and set all values in own half to 0). This essentially rewards teams for being in heavily defended areas that are difficult to be in and are in range of a goal or a pass leading to a goal.


We then combine the goal map and the inverse passing to to develop a hybrid scoring system for rating events. The resulting points map (rescaled to a 0-100) is as follows:


We now define Possession Score as the maximum point value of a pass or receive event in a possession. Thus, the quality of a possession is characterized by the most dangerous area it entered during the possession. (Alternatively, one can do maximum point minus points at starting location).

Looking at possessions with at least 4 passes, we calculate the mean Possession Scores for all club teams in 2011-2012. Despite their tiki-taka style, frequency passing and time spent midfield, Barcelona comes out on top as most of the possessions eventually lead to dangerous areas.


Now let us know your thoughts. Comment below:

Posted by Columbia University at 17:41


Post a comment