Monthly Archives: March 2011

Tips on installing R extension for Rapidminer on Mac OS X

March 9, 2011
By
Tips on installing R extension for Rapidminer on Mac OS X

Rapidminer is a cool toy to play with machine-learning/data-mining algorithms and it can interface with R. However, it was a bit problematic for me to get the R extension working properly on Mac OS X Leopard for R 2.11. Here is what works for me at the...

Read more »

Tips on installing R extension for Rapidminer on Mac OS X

March 9, 2011
By
Tips on installing R extension for Rapidminer on Mac OS X

Rapidminer is a cool toy to play with machine-learning/data-mining algorithms and it can interface with R. However, it was a bit problematic for me to get the R extension working properly on Mac OS X Leopard for R 2.11. Here is what works for me at the...

Read more »

In case you missed it: February Roundup

March 9, 2011
By

In case you missed them, here are some articles from February of particular interest to R users. Revolution R Enterprise 4.2 is now available to subscribers, and for free download to academics. A brief report from the Strata: Working with Data conference, and a comprehensive review from Ted Leung. A profile of prolific R contributor, Dirk Eddelbuettel. A list...

Read more »

Comparing two-dimensional data sets in R

March 9, 2011
By
Comparing two-dimensional data sets in R

I wanted to fit a continuous function to a discrete 2D distribution in R. I managed to do this by using nls, and wanted to display the data. I discovered a nice way to compare the actual data and the fit using ggplot2, where the background is the real ...

Read more »

Comparing two-dimensional data sets in R

March 9, 2011
By
Comparing two-dimensional data sets in R

I wanted to fit a continuous function to a discrete 2D distribution in R. I managed to do this by using nls, and wanted to display the data. I discovered a nice way to compare the actual data and the fit using ggplot2, where the background is the real ...

Read more »

Forest plots using R and ggplot2

March 9, 2011
By
Forest plots using R and ggplot2

Abhijit over at Stat Bandit posted some nice code for making forest plots using ggplot2 in R. You see these lots of times in meta-analyses, or as seen in the BioVU demonstration paper. The idea is simple - on the x-axis you have the odds ratio (or what...

Read more »

Forest plots using R and ggplot2

March 9, 2011
By

Abhijit over at Stat Bandit posted some nice code for making forest plots using ggplot2 in R. You see these lots of times in meta-analyses, or as seen in the BioVU demonstration paper. The idea is simple - on the x-axis you have the odds ratio (or what...

Read more »

My First Few Days with RStudio

March 9, 2011
By
My First Few Days with RStudio

As most readers are probably aware, the free IDE for R, called RStudio, was recently released for general use and it immediately made huge waves within the R community. IDE stands for Integrated Development Environment. IDEs typically provides a rich set tools developing in some target language. For standard programming languages like C++ (VisualStudio) and Java (Eclipse or NetBeans),...

Read more »

Playing with quantiles, part 2

March 8, 2011
By
Playing with quantiles, part 2

It is common to look at best time at the Marathon. Or perhaps the distribution of the top100, as done by John Myles White on his blog here (data can be found there), as the graph below, with the density of the time for the first 100 men (in blue) a...

Read more »

Playing with quantiles, part 1

March 8, 2011
By
Playing with quantiles, part 1

A standard idea in extreme value theory (see e.g. here, in French unfortunately) is that to estimate the 99.5% quantile (say), we just need to estimate a quantile of level 95% for observations exceeding the 90% quantile. In extreme value theory,...

Read more »