
Predictive analysis using Python
Predictive Analysis using liner regression. We are working with a data set that has data for 2805 matches from 1871 to 2015. I will be performing an exploratory data analysis to understand the statistical properties and distribution shapes of 8 key variables and do a correlation analysis to study if win or loss has a correlation to run scores, run against, and run difference, from 1960 to 2010. Lastly I will be doing correlation analysis to games won, creating a test and train data set and build linear regression model for all of our 4 time periods, and test our model by predicting games won by NYY and TORR from 2012 - 2015