1662 search results for "regression"

Collinearity and stepwise VIF selection

February 5, 2013
By
Collinearity and stepwise VIF selection

Collinearity, or excessive correlation among explanatory variables, can complicate or prevent the identification of an optimal set of explanatory variables for a statistical model. For example, forward or backward selection of variables could produce inconsistent results, variance partitioning analyses may be unable to identify unique sources of variation, or parameter estimates may include substantial amounts

Read more »

Natura non facit saltus

February 5, 2013
By
Natura non facit saltus

(see John Wilkins’ article on the – interesting – history of that phrase http://scienceblogs.com/evolvingthoughts/…). We will see, this week in class, several smoothing techniques, for insurance ratemaking. As a starting point, assume that we do not want to use segmentation techniques: everyone will pay exactly the same price. no segmentation of the premium And that price should be related to...

Read more »

Proposed techniques for communicating the amount of information contained in a statistical result

February 5, 2013
By
Proposed techniques for communicating the amount of information contained in a statistical result

A couple of weeks ago, I posted about how much we can expect to learn about the state of the world on the basis of a statistical significance test. One way of framing this question is: if we’re trying to come to scientific conclusions on the basis of statistical results, how much can we update

Read more »

Proposed techniques for communicating the amount of information contained in a statistical result

February 4, 2013
By
Proposed techniques for communicating the amount of information contained in a statistical result

A couple of weeks ago, I posted about how much we can expect to learn about the state of the world on the basis of a statistical significance test. One way of framing this question is: if we’re trying to come to scientific conclusions on the basis of statistical results, how much can we update

Read more »

"I don’t wanna grow up": Age / value relationships for football players

February 1, 2013
By
"I don’t wanna grow up": Age / value relationships for football players

Let's get back to the age-value relationship from my last post. I did some more plotting to see on which position this inversed U-shaped relationship is strongest. Please note, that I use a dataframe called eu.players throughout this post, which holds downloaded football player information from transfermarkt.de.But first, let us get back to the original graph.

Read more »

Taking Expectations to the Next Level

January 31, 2013
By
Taking Expectations to the Next Level

Higher Expectations I came across this post on Thursday and found it to be quite interesting. Clearly rental prices vary according to where you live. That isn't too surprising. I started thinking a bit more about it and thought that Boston and the nearby communities would have to...

Read more »

Maximize Your Expectations!

January 30, 2013
By
Maximize Your Expectations!

A Problem A major problem in secondary data analysis is that you didn't get to decide what data was collected. Lets say you were interested in how many times a student has read the Twilight books). Specifically, you want to know how effective the ads for...

Read more »

F1Stats – Visually Comparing Qualifying and Grid Positions with Race Classification

January 30, 2013
By
F1Stats – Visually Comparing Qualifying and Grid Positions with Race Classification

Following the roundabout tour of F1Stats – A Prequel to Getting Started With Rank Correlations, here’s a walk through of my attempt to replicate the first part of A Tale of Two

Read more »

Modeling Residential Electricity Usage with R

January 30, 2013
By
Modeling Residential Electricity Usage with R

Wow, I can’t believe it has been 11 months since my last blog posting!  The next series of postings will be related to the retail energy field.  Residential power usage is satisfying to model as it can be forecast fairly accurately with the right inputs.  Partly as a consequence of deregulation there is now more data more available than...

Read more »

Disruptive Data Science – Transforming Your Company into a Data Science-Driven Enterprise

January 29, 2013
By
Disruptive Data Science – Transforming Your Company into a Data Science-Driven Enterprise

Big Data is the latest technology wave impacting C-Level executives across all areas of business, but amid the hype, there remains confusion about what it all means. The name emphasizes the exponential growth of data volumes worldwide (collectively, 5 Exabytes/ day in the latest estimate I saw from IDC), but more nuanced definitions of Big Data incorporate the following...

Read more »