Strike Zone Changes?

March 7, 2012 | Millsy

It's been a while since I have posted here. I have been swamped with some papers I am trying to get out, finishing up the dissertation, and interviews (faculty ones in addition to others). I should have some big news in the next couple of weeks regar... [Read more...]

Sabermetrics Meets R Meetup

October 25, 2011 | Millsy

I just ran across this post at Big Computing. On November 14th, there will be an R User meet-up in Washington, DC (Tyson's Corner) led by Mike Driscoll about using R for sabermetric analysis (linked here). I will actually be home in Maryland for a co... [Read more...]

Link to StatDNA Guest Post

September 7, 2011 | Millsy

The post is officially up on the StatDNA blog. Go check it out.As I said in my previous post, this is a very rough and preliminary model. This is why my work was not any sort of formal entry, just some fun with some great data.I used an ... [Read more...]

More on JSM

August 5, 2011 | Millsy

While my time at the 2011 Joint Statistical Meetings was short--I unfortunately missed some presentations I would have like to have attended--it was a great experience. The collection of academics and professionals is very different from the other con... [Read more...]

sab-R-metrics: Merging Data Sets

June 22, 2011 | Millsy

I am finally back from Greece and recovered from jet lag. Fortunately, I did not get tear gassed while in Athens, though there were riot police everywhere the whole time we visited. Today, I'm going to start getting my feet wet again with a shorter s... [Read more...]

sab-R-metrics: Kernel Density Smoothing

May 25, 2011 | Millsy

Last time I left you, I had gone over some basics of doing loess regression in R. If you remember, loess is a sort of regression that allows wigglyness in your regression of some dependent variable Y on some independent variable X (I will generalize t... [Read more...]

sab-R-metrics: Basics of LOESS Regression

May 11, 2011 | Millsy

Last week, I left you off at logistic regression. This week, I'll be pushing the limits of regression analysis a bit more with a smoothing technique called LOESS regression. There are a number of smoothing methods that can be used, such as Smoothing ... [Read more...]

sab-R-metrics: Logistic Regression

May 5, 2011 | Millsy

It's been a while since my last sab-R-metrics post, and I have not gotten to the real fun stuff yet. I apologize for the long layoff, and it's likely that these will be sparse for the next couple weeks. I have had some consulting opportunities come u... [Read more...]

Trackman Position Needs R Knowledge

April 16, 2011 | Millsy

Thought some of the R-Blogger readers would be interested in the position linked below. If you're a baseball fan and like working in R, this is a fun company that seems to be getting more and more press. Recently, it was featured in Sports Illustrate... [Read more...]

sab-R-metrics Sidetrack: Bubble Plots

March 22, 2011 | Millsy

While I had mentioned in my last post that I will cover logistic regression in my next post, I decided that a quick interlude in working with bubble plots would be fun. Bubble plots have become pretty popular recently, especially with all of the Visualization Challenges I've seen around the ... [Read more...]

Having a problem with R-2.12.2 64-bit and "gam’ package!

March 17, 2011 | Millsy

While working with some pitch location data recently, I ran across something strange when using my new computer (with R-2.12.2 64-bit) versus my work computer (with R-2.11.1 x64). Both are 64-bit computers, but I got the new one for portability (it's a laptop) and speed.Anyway, I had been doing ... [Read more...]

sab-R-metrics: Brief Sidetrack for Scatterplot Matrices

March 16, 2011 | Millsy

In my last two posts I talked about Ordinary Least Squares, then extended this discussion to the multiple predictor case and briefly talked about some of the problems that may arise. These problems can include omitted variable bias, heteroskedasticity, non-normality, and multicollinearity. Most of these problems are relatively minor in ... [Read more...]

sab-R-metrics: Multiple Regression and Interactions

March 12, 2011 | Millsy

Last time, I covered ordinary least squares with a single variable. This time, I'll extend this to using multiple predictor variables in a regression, interacting terms in R, and start thinking about using polynomials of certain terms in the regression (like Age and Age Squared). This should be a pretty ... [Read more...]

sab-R-metrics: Basic Applied Regression (OLS)

February 23, 2011 | Millsy

Today, I'll again be using a new data set that can be found here at my website (called 'leagueoutcomes.csv'). The data set includes the standings results of the 2009 season for MLB along with average game attendance by team. I'll use this to go over some basic regression techniques and ... [Read more...]

Fixing Up smoothScatter Heat Maps

February 2, 2011 | Millsy

A while back, I posted an article using the smoothScatter function in R that builds a color representation of density for scatter plots. When I first found the function, I was extremely excited because it's a very easy and automated way to make a heat... [Read more...]
