Tuesday, November 9, 2010

On the MLS Draft: Linear Regression

In my first two posts on the MLS draft I was looking at patterns in the 2009 and 2010 drafts.  I've now started looking at the data from the first 5 seasons.  Again, I'm using minutes played as an indicator of success.  I'm not attempting to predict which players should be picked when and who will be successful in MLS, but instead I'm trying to reveal patterns that show a breakdown in the draft decision making process. 
First I wanted to look at how the percentage of minutes played changes based on the selection number of the player.  Aggregating the data from the first two rounds from 2006 to 2010, a nice linear pattern emerges.  We can use a linear regression to estimate what a player's expected percentage minutes player should be and whether they are under or over performing.   

Another thing I noticed was how bad teams were at predicting talent. Approximately 25% of the first two round draft picks never make an appearance in MLS.  Given that the draft is one of the main sources of acquiring talent (although this is changing), I found this number appalling.  I wanted to see if any teams were good at drafting players or if it was a random toss of the dice.  To estimate draft success I looked at how each team's picks compared to how the linear regression estimated they should do.
 Philly's poor performance can be attributed to their strategy of drafting young players and only having one season for them to develop.  A few more seasons are needed to determine whether or not their players are panning out.  I was surprised to see the Seattle Sounders performing below expected because Steve Zakuani has been a wonderful pick.  However, looking at their other picks, David Estrada and Evan Brown have not performed up to expectations and Brown has been released from the team.  I was really impressed with the LA Galaxy's record.  3/4 of their backline this season came from the draft, including newly capped Omar Gonzalez.  In fact, half of their picks were starters in the conference semifinals. 

I also wanted to look at which universities produce the most successful players.  I was curious to see if some universities were talent pipelines to MLS.
The darker colors indicate the number of players drafted.  The data was filtered down to only universities that had 2 or more players drafted. Wake Forest and Notre Dame tend to have a lot of players drafted, but they aren't very successful.  I was a little shocked that year after year teams picked from these universities.  University of Maryland, however, seems to consistently produce talent in large numbers.  Also of note is that players who didn't attend college tended to under perform.  Definitely there have been some players that were drafted and didn't pan out, but others like Brek Shea, Fuad Ibrahim and Jack McInerney still show promise and might take longer before they become everyday starters.

