Uncategorized

Eigensheep

March 13, 2011 | Edwin Chen

Aaron Koblin’s Sheep Market visualization is an awesome use of Mechanical Turk. But it’d be even more awesome if the grid were ordered, so inspired by the use of eigenfaces in facial recognition, I decided to try projecting the sheep … Continue reading →
[Read more...]

Counting Clusters

March 13, 2011 | Edwin Chen

Given a set of numerical datapoints, we often want to know how many clusters the datapoints form. Two practical algorithms for determining the number of clusters are the gap statistic and the prediction strength. Gap Statistic The gap statistic algorithm … Continue reading →
[Read more...]

Legendary Plots

March 12, 2011 | richierocks

I was recently pointed in the direction of a thermal comfort model by the engineering company Arup (p27–28 of this pdf). Figure 3 at the top of p28 caught my attention. It’s mostly a nice graph; there’s not too much junk in it. One thing that struck me was ... [Read more...]

Anova – Type I/II/III SS explained

March 2, 2011 | nzcoops

Not my post, just bookmarking this. It’s from http://goanna.cs.rmit.edu.au/~fscholer/anova.php ANOVA (and R) The ANOVA Controversy ANOVA is a statistical process for analysing the amount of variance that is contributed to a sample by different factors. It was initially derived by R. ... [Read more...]

RStudio: just what I’ve been looking for

February 28, 2011 | Rob J Hyndman

For many years I used RWinEdt as my text editor for R code, but when WinEdt 6.0 came out, RWinEdt stopped working. So I’ve been looking for something to replace it. I’ve tried Tinn-R, NppToR, Eclipse with StatET and a couple of other editors, but nothing was quite right. ... [Read more...]

Graphing – margins, titles, mtext, workspace

February 21, 2011 | nzcoops

This is a great post, very true, not enough of R’s graphics are well displayed online to really see how to achieve what the often ambiguous ‘help’ information suggests. http://research.stowers-institute.org/efg/R/Graphics/Basics/mar-oma/index.htm I particularly find “mtext(“lol”, outer=T)” to be ... [Read more...]

Child health metrics

February 20, 2011 | nzcoops

In analysis of Child Health data, generally z-scores or percentile groupings are used as children do not growth is not linear. The CDC (Center for Disease Control and Prevention) have released tables of data for calculating these z-scores and percentiles, and here are some scripts for R to calculate these ... [Read more...]

Mixed models – Part 2: lme lmer

February 15, 2011 | nzcoops

Getting more into mixed models, I’ve been playing around with both nlme::lme and lme4::lmer. http://tolstoy.newcastle.edu.au/R/e2/help/06/10/3345.html was quite a good post at explaining the differences, which from what I gather is largely performance based when using crossed or partially crossed ... [Read more...]

Statistical Analysis with R, a Review

February 12, 2011 | Tony

[To all of the R-bloggers out there who recognize this, I apologize.  To those that don't, This is at least the 5th review of this book to go on the feed.  The author is linking to the others here.] Long Version: I have a Bachelor’s degree in Computer Science.  ... [Read more...]

Mixed Models – Part 1

February 3, 2011 | nzcoops

Very brief. Have been exploring mixed models in R using nlme::lme. Am looking forward to understanding them more, they’re going to be used more and more in years to come I’ve no doubt of that. Here are some scripts, very rough, for diagnostics when running simple 2 levels, ... [Read more...]

The Clipboard can be Your Friend

January 17, 2011 | --

One of the features I feel that I don’t leverage enough, particularly for ad-hoc questions, is the clipboard feature  in R. I know there are many ways to use it –  as I have been exposed to a few –  but the example below, even without context, should get you thinking ... [Read more...]

CrossValidated Journal Club

December 21, 2010 | Rob J Hyndman

Journal Clubs are a great way to learn new research ideas and to keep up with the literature. The idea is that a group of people get together every week or so to discuss a paper of joint interest. This can happen within your own research group or department, or ... [Read more...]

R Workflow: Melbourne R Users Dec 1st 2010

December 18, 2010 | jeromyanglim

Melbourne R Users Group December 1st 2010 Meeting (Meetup page). 1. “What my R code looks and feels like (Vanilla)” by Geoff Robinson The other talk from the session was by Geoff Robinson who discussed several useful strategies for working … Continue reading →
[Read more...]

Ghcn V3 Metadata improvements

December 12, 2010 | Steven Mosher

The Global Historical Climate Network  (GHCN) is in it’s beta stage. On of the stated goals of the project is to improve the metadata that is provided for the station data.  Over the past few months several independent volunteers have been focusing on the issue of station metadata, each ... [Read more...]

Forecasting workshop: Switzerland, June 2011

December 6, 2010 | Rob J Hyndman

I will be running a workshop on Statistical Forecasting: Principles and Practice in Switzerland, 20-22 June 2011. Check out the venue: Waldhotel Doldenhorn, Kandersteg! So if you fancy a trip to the beautiful Swiss Alps next June, read on… Outline Forecasting is required in many situations: deciding whether to build another ... [Read more...]

Data visualization videos

November 29, 2010 | Rob J Hyndman

Probably everyone has seen Hans Rosling’s famous TED talk by now. I recently came across a couple of other exceptional talks on data visualization: Hans Rosling again: Let my dataset change your mindset. If only all statistics lecturers were this dynamic! David McCandless: The beauty of data visualization. Not ... [Read more...]

Initializing the Holt-Winters method

November 29, 2010 | Rob J Hyndman

The Holt-Winters method is a popular and effective approach to forecasting seasonal time series. But different implementations will give different forecasts, depending on how the method is initialized and how the smoothing parameters are selected. In this post I will discuss various initialization methods. Suppose the time series is denoted ... [Read more...]
1 8 9 10 11 12 16

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)