Blog Archives

Introducing parallelRandomForest: faster, leaner, parallelized

September 23, 2013
By
ffffffffffffforst

Together with other members of Andreas Beyer's research group, I participated in the DREAM 8 toxicogenetics challenge. While the jury is still out on the results, I want to introduce my improvement of the R randomForest package, namely parall...

Read more »

2D plot with histograms for each dimension (2013 edition)

April 22, 2013
By
2D plot with histograms for each dimension (2013 edition)

In 2009, I wrote about a way to show density plots along both dimensions of a plot. When I ran the code again to adapt it to a new project, it didn't work because ggplot2 has become better in the meantime. Below is the updated code. Using the gridExtra...

Read more »

ggplot2: Determining the order in which lines are drawn

August 11, 2011
By
ggplot2: Determining the order in which lines are drawn

In a time series, I want to plot the values of an interesting cluster versus the background. However, if I'm not careful, ggplot will draw the items in an order determined by their name, so background items will obscure the interesting cluster: Corr...

Read more »

Comparing two-dimensional data sets in R; take II

March 10, 2011
By
Comparing two-dimensional data sets in R; take II

David commented on yesterday's post and suggested to put the continuous fitted distribution in the background and the discrete, empirical distribution in the foreground. This looks quite nice, although there's a slight optical illusion that makes the c...

Read more »

Comparing two-dimensional data sets in R; take II

March 10, 2011
By
Comparing two-dimensional data sets in R; take II

David commented on yesterday's post and suggested to put the continuous fitted distribution in the background and the discrete, empirical distribution in the foreground. This looks quite nice, although there's a slight optical illusion that makes the c...

Read more »

Comparing two-dimensional data sets in R

March 9, 2011
By
Comparing two-dimensional data sets in R

I wanted to fit a continuous function to a discrete 2D distribution in R. I managed to do this by using nls, and wanted to display the data. I discovered a nice way to compare the actual data and the fit using ggplot2, where the background is the real ...

Read more »

Comparing two-dimensional data sets in R

March 9, 2011
By
Comparing two-dimensional data sets in R

I wanted to fit a continuous function to a discrete 2D distribution in R. I managed to do this by using nls, and wanted to display the data. I discovered a nice way to compare the actual data and the fit using ggplot2, where the background is the real ...

Read more »

Learning ggplot2: 2D plot with histograms for each dimension

September 3, 2009
By
Learning ggplot2: 2D plot with histograms for each dimension

I have two 2D distributions and want to show on a 2D plot how they are related, but I also want to show the histograms (actually, density plots in this case) for each dimension. Thanks to ggplot2 and a Learning R post, I have sort of managed to do what I want to have:

Read more »

Learning ggplot2: 2D plot with histograms for each dimension

September 3, 2009
By
Learning ggplot2: 2D plot with histograms for each dimension

Update (April 2013): The code below doesn't work anymore with new ggplot2 versions, here is an updated version.I have two 2D distributions and want to show on a 2D plot how they are related, but I also want to show the histograms (actually, density plots in this case) for each dimension. Thanks to ggplot2 and a

Read more »