Projected # of NCAA Tourney Teams
Posted: Tue Nov 25, 2014 12:49 pm
So since I'm currently studying analytics, I decided to have some fun and create a multiple regression model to project the number of at-large bids a conference will get for the NCAA Tournament.
Based on the model, if non-conference play ended today, the Big East would be projected to get 9 at-large NCAA Bids! (9.319939757 to be exact)
Unfortunately, non-conference play does not end today and that number pretty much hinges on us keeping up a .9394 winning % as a conference. But it still shows just how good we've been as a whole to this point.
For any stat-heads out there, I've got some more info on my results below. Feel free to critique and offer any suggestions to try to improve on the accuracy.
____________________________________________________________________________________________________________________________________
My result was this simple (or not so simple equation):
Teams in Touney = 49.83465(win%^2)-62.174(win%)+50.88886(NonConRPI)+.062851(#Teams^2)-1.23931(#Teams)
Win%= conference winning percentage in non-conference play
NonConRPI= Conference RPI excluding conference play (from CBS RPI)
#Teams= # of Teams in the conference
I used data from every conference with at least 1 at large bid over the past 5 years, and on only 2 instances did the model's projection on a conference miss by more than 1 bid (rounding to integers).
R-Square: .95151
F value: 12.2907
F significance: 159E-30
MAE: .73466
MSE: .53973
Some sources of error include likely at-large bid candidates reaching the tournament by way of automatic bids, the change in the number of available at-large bids due to the increase in field size to 68, bubble teams playing their way out of the tournament during conference play, and consideration for the human element in the actual selection process.
Based on the model, if non-conference play ended today, the Big East would be projected to get 9 at-large NCAA Bids! (9.319939757 to be exact)
Unfortunately, non-conference play does not end today and that number pretty much hinges on us keeping up a .9394 winning % as a conference. But it still shows just how good we've been as a whole to this point.
For any stat-heads out there, I've got some more info on my results below. Feel free to critique and offer any suggestions to try to improve on the accuracy.
____________________________________________________________________________________________________________________________________
My result was this simple (or not so simple equation):
Teams in Touney = 49.83465(win%^2)-62.174(win%)+50.88886(NonConRPI)+.062851(#Teams^2)-1.23931(#Teams)
Win%= conference winning percentage in non-conference play
NonConRPI= Conference RPI excluding conference play (from CBS RPI)
#Teams= # of Teams in the conference
I used data from every conference with at least 1 at large bid over the past 5 years, and on only 2 instances did the model's projection on a conference miss by more than 1 bid (rounding to integers).
R-Square: .95151
F value: 12.2907
F significance: 159E-30
MAE: .73466
MSE: .53973
Some sources of error include likely at-large bid candidates reaching the tournament by way of automatic bids, the change in the number of available at-large bids due to the increase in field size to 68, bubble teams playing their way out of the tournament during conference play, and consideration for the human element in the actual selection process.