While preparing my slides for statistical graphics, a plot really caught my eye when I was playing around with the data. I started off by plotting the time seriesof GNI per capita by country, and as expected it got quite messy and...

tl;dr: Play with the app here When you live in Paris, chances are you are (home or work) very close to a metro station, so when you want to meet with some friends, you usually end up picking another metro station as a meeting point. Yet, finding the optimal place to meet can easily become a complex problem considering...

Short: I plot the frequency of wikipedia searches of “Behavioral Economics”, and “Beer” – who knew the correlation would be 0.7! Data reference:Data on any wikipedia searches (back to 2007) are available at http://glimmer.rstudio.com/pssguy/wikiSearchRates/. The website allows you to download frequency hits per day as a csv, which is what I've done here....

My research has allowed me to implement techniques for visualizing multivariate models in R and I wanted to share some additional techniques I’ve developed, in addition to my previous post. For example, I think a primary obstacle towards developing a useful neural network model is an under-appreciation of the effects model parameters have on model

Below are my findings from the second data analysis project in Dr. Jeffery Leek’s John Hopkins Coursera class. Introduction I used the “Human Activity Recognition Using Smartphones Dataset” (UCI, 2013) to build a model. This data was recorded from a Samsung prototype smartphone with a built-in accelerometer. The purpose of my model was to recognize the type

A great. recent enhancement for the Shiny App is the ability to upload local files. Now, in addition to users being able to interact with data provided on the host e.g. Soccer Tables or via the web, Wikipedia Search Rates they can use apps to view and analyse their own data I have knocked up

Third part on logistic regression (first here, second here). Two steps in assessing the fit of the model: first is to determine if the model fits using summary measures of goodness of fit or by assessing the predictive ability of the model; second is to deterime if there’s any observations that do not fit the

Making available replication materials for the research you do is A Good Thing. It’s also work, and it’s quite easy to never get around to. Certainly I claim no special virtue in this department so I am always happy when there’s an institutional stick to prod my better nature in the right direction. One such institutional

Second part on logistic regression (first one here). We used in the previous post a likelihood ratio test to compare a full and null model. The same can be done to compare a full and nested model to test the contribution of any subset of parameters: Interpretation of coefficients Note: Dohoo do not report the

This is a dynamic post which I will continue to update whenever I find something new. Hope you will find the following links useful.Online Courses for Learning the R languageTry R from Code Schoole-Books for Learning the R LanguageR for Beginners ...