## Simulation (is where it’s happening)

November 18, 2013
Jim Silverton wrote to the Allstat mailing list recently: “Hi, Anyone up for a challenge? Suppose we have random variables that are random points on the surface of a sphere. What is the probability that the tetrahedron made by joining these … Continue reading →

## Some Options for Testing Tables

November 18, 2013
Contingency tables are a very good way to summarize discrete data.  They are quite easy to construct and reasonably easy to understand. However, there are many nuances with tables and care should be taken when making conclusions related to the data. Here are just a few thoughts on the topic. Dealing with sparse data On

## Visualizing neural networks in R – update

November 14, 2013
In my last post I said I wasn’t going to write anymore about neural networks (i.e., multilayer feedforward perceptron, supervised ANN, etc.). That was a lie. I’ve received several requests to update the neural network plotting function described in the original post. As previously explained, R does not provide a lot of options for visualizing

## Calibration of p-value under variable selection: an example

November 14, 2013
Very often people report p-values for linear regression estimates after performing variable selection step. Here is a simple simulation that shows that such a procedure might lead to wrong calibration of such tests.Consider a simple data generating pro...

## A slightly different introduction to R, part V: plotting and simulating linear models

November 11, 2013
In the last episode (which was quite some time ago) we looked into comparisons of means with linear models. This time, let’s visualise some linear models with ggplot2, and practice another useful R skill, namely how to simulate data from known models. While doing this, we’ll learn some more about the layered structure of a

## A statistical review of ‘Thinking, Fast and Slow’ by Daniel Kahneman

November 11, 2013
I failed to find Kahneman’s book in the economics section of the bookshop, so I had to ask where it was.  ”Oh, that’s in the psychology section.”  It should have also been in the statistics section. He states that his collaboration with Amos Tversky started with the question: Are humans good intuitive statisticians? The wrong The post A...

## Key Driver vs. Network Analysis in R

November 8, 2013
When marketing researchers speak of driver analysis, they are referring to an input-output model with overall satisfaction as the output and performance ratings of specific product and service components as the inputs. The causal model is straightforwa...

## NYC R Programming Classes – starting this coming Sunday

November 5, 2013
Guest post by Vivian Zhang, original post. You can sign up for our Sunday Intensive beginner level R classes at NYC Data Science Academy meetup page or [email protected] more info. Brief: The course (which will meet five Sundays) will start from the basics, introducing the building blocks used for programming in R and building intuition for writing clean and robust code....

## analyze the american national election studies (anes) with r

November 4, 2013
on election days in the united states, the news media peppers its coverage with quick, dirty exit polls that allow them to make coarse statements like, "x% of demographic group y voted for candidate z."  the american national election studies are ...

## Dream Team – combining Tableau and R

November 3, 2013
Last quarter was a bit too busy to write some new blog post because of a new job. And changing the job often come along with changing the tools you work with. That was my way to Tableau. Tableau is one of the new stars in the BI/Analytics world and definitely worth a look. The people at Tableau...

