Forest plots using R and ggplot2

October 31, 2010
By
Forest plots using R and ggplot2

Forest plots are most commonly used in reporting meta-analyses, but can be profitably used to summarise the results of a fitted model. They essentially display the estimates for model parameters and their corresponding confidence intervals. Matt Shotwell just posted a message to the R-help mailing list with his lattice-based solution to the problem of creating forest plots in R. I

Read more »

ABC lectures [finale]

October 31, 2010
By
ABC lectures [finale]

The latest version of my ABC slides is on slideshare. To conclude with a pun, I took advantage of the newspaper clipping generator once pointed out by Andrew. (Note that nothing written in the above should be taken seriously.) On the serious side, I managed to cover most of the 300 slides (!) over the

Read more »

Scatter plot with ggplot2

October 31, 2010
By
Scatter plot with ggplot2

I decided to go use ggplot2 more frequently and go through everything. For future reference I will start a series of blog posts on ggplot2. Scatter plot: In my next post, I will change the axis labels.

Read more »

Presenting Immer’s barley data

October 31, 2010
By
Presenting Immer’s barley data

Last time I talked about adapting graphs for presentations.  This time I’m putting some of the concepts I discussed there into action, with a presentation of Immer’s barley dataset.  This is a classic dataset, originally published in 1934; in 1993 Bill Cleveland mentioned it in his book Visualising Data on account of how it may

Read more »

How to buy a used car with R (part 1)

October 31, 2010
By
How to buy a used car with R (part 1)

nStrict Standards: Non-static method StringParser_Node::destroyNode() should not be called statically, assuming $this from incompatible context in /afs/ir.stanford.edu/users/k/n/knoepfle/cgi-bin/flatpress/fp-plugins/bbcode/inc/stringparser.class.php on line 358I’m in the process of buying a used car. Since I enjoy making these decisions as complicated as possible, I’ve written some R code to scrape relevant websites for informative data. I’ve written this up as a...

Read more »

How to buy a used car with R (part 1)

October 31, 2010
By
How to buy a used car with R (part 1)

I’m in the process of buying a used car. Since I enjoy making these decisions as complicated as possible, I’ve written some R code to scrape relevant websites for informative data. I’ve written this up as a blog entry because I think it’s a decent example of how one might use the XML...

Read more »

Errors in Ghcn Inventories

October 30, 2010
By
Errors in Ghcn Inventories

In the debate over the accuracy of the global temperature nothing is more evident than errors in the location data for stations in the GHCN inventory. That inventory is the primary source for all the temperature series. One question is “do these mistakes make a difference?” If one believes as I do that the record

Read more »

How to Start Using (pgf)Sweave in LyX in One Minute

October 30, 2010
By

regor Gorjanc published an interesting article “Using Sweave with LyX” in R News in 2008, which (I believe) makes it much easier to use Sweave. I use command-line tools a lot every day, but I am still “GUI-addicted”. (I don’t want to comment more about Microsoft Word here.) LyX is a somewhat WYSIWYG tool based

Read more »

A question from the R list

October 30, 2010
By
A question from the R list

I am currently working on rectifying the GHCN station list to improve the location information. Its the kind of database work that is mind numbingly tedious and a PITA in R. not because R lacks capabilities, its just tough and not very sexy to matching and fuzzy matching and greping and blah blah blah. Instead,

Read more »

Findings increasingly novel, scientists say…

October 29, 2010
By
Findings increasingly novel, scientists say…

…was the tongue-in-cheek title of an image that I posted to Twitpic this week. It shows the usage of the word “novel” in PubMed article titles over time. As someone correctly pointed out at FriendFeed, it needs to be corrected for total publications per year. It was inspired by a couple of items that caught

Read more »

Could someone please set this as the new R default in base graphics?

October 29, 2010
By

par (mar=c(3,3,2,1), mgp=c(2,.7,0), tck=-.01) Thank you....

Read more »

The stimulus, mapped

October 29, 2010
By
The stimulus, mapped

Edward Tufte created this "Lights-On Map", animating the grants under the American Recovery and Investment Act (more commonly known as the Stimulus): Click the image to see the lights come on over time. It's a nice visualization of the distribution of the stimulus projects over the country and time, but I'm actually surprised that it doesn't do a great...

Read more »

Because it’s Friday: Werner Heisenberg at a traffic stop

October 29, 2010
By
Because it’s Friday: Werner Heisenberg at a traffic stop

I saw this on in a random image stream and it made me chuckle: A little googling suggests MathematiciansPictures.com is the source, where you can also get Heisenberg on a mug. Whether your coffee is in or outside the mug is, unfortunately, indetermin...

Read more »

SAS vs Open Source

October 29, 2010
By

SAS CEO Jim Goodnight elicited guffaws from the audience during his keynote at JSM this year when he said "I have a problem with government using open source software". Now it seems Goodnight is at the center of another controversy related to open source, having dismissed the growth of open-source software for business intelligence in an interview with CBR...

Read more »

What I’ve been up to..

October 29, 2010
By

It's been nearly a year since I posted here and much has changed. The obvious and most important change is a second new addition to our family which I've been blogging about elsewhere. On the work front I was able to publish a paper about some of my ...

Read more »

World Economic Forum Data Visualization Challenge

October 28, 2010
By

Jaidev Deshpande writes: The World Economic Forum recently posed a data visualization problem. The dataset is a survey of experts from the so called "Agenda Councils" of the WEF. Here are the details. The dataset primarily contains the experts...

Read more »

Adapting graphs for presentations

October 28, 2010
By
Adapting graphs for presentations

I’ve just finished reading slide:ology by Nancy Duarte. It contains lots of advice about how to convey meaning through aesthetics. The book has a general/business presentation focus, but it got me wondering about how to apply the ideas in a scientific context.  Since graphs from a big part of most scientific talks, and since that’s

Read more »

Integrating R with C++: Rcpp, RInside, and RProtobuf

October 28, 2010
By

Last week the R “intergrouplet” at Google invited Dirk Eddelbuettel and Romain Francois to come and give a Tech Talk about their joint work on Rcpp, RInside, and RProtoBuf. Dirk and Romain agreed and also let us share the video of their talk:We were also happy to have John Chambers,...

Read more »

Integrating R with C++: Rcpp, RInside, and RProtobuf

October 28, 2010
By

Last week the R “intergrouplet” at Google invited Dirk Eddelbuettel and Romain Francois to come and give a Tech Talk about their joint work on Rcpp, RInside, and RProtoBuf. Dirk and Romain agreed and also let us share the video of their talk:We were also happy to have John Chambers,...

Read more »

Google tech talk / Rcpp, … presentation on youtube

October 28, 2010
By

Following this post, the 90 minutes presentation is now available to watch on youtube:

Read more »

Revolution Analytics’ Chief Scientist

October 28, 2010
By

Lee Edlefsen, who has led the Revolution R development team in Seattle for more than three years, is now the Chief Scientist for Revolution Analytics. With more than 30 years experience in high-performance and statistical computing, this is a great position for Lee. There's more details in the press release linked below. Revolution Analytics: Revolution Analytics names Chief Scientist...

Read more »

R is Hot: Part 4

October 28, 2010
By

This is Part 4 of a five-part article series, with new parts published each Thursday. You can download the complete article from the Revolution Analytics website. High Quality Graphics, Made Easy R is especially useful for generating charts and graphics, quickly and easily. The ability to create visual plots of complex data is more than just a handy trick;...

Read more »

Lattice vs. ggplot2

October 28, 2010
By

Both lattice and ggplot2 seem really interesting and worthy of learning. But I only have time to learn one of them, and the choice is not an easy one.Here is an awesome reference; this blog is generally very interesting; and here is something...

Read more »

Lattice vs. ggplot2

October 28, 2010
By

Both lattice and ggplot2 seem really interesting and worthy of learning. But I only have time to learn one of them, and the choice is not an easy one.Here is an awesome reference; this blog is generally very interesting; and here is something...

Read more »

Random generators for parallel processing

October 28, 2010
By
Random generators for parallel processing

Given the growing interest in parallel processing through GPUs or multiple processors, there is a clear need for a proper use of (uniform) random number generators in this environment. We were discussing the issue yesterday with Jean-Michel Marin and briefly looked at a few solutions: given p parallel streams/threads/processors, starting each generator with a random

Read more »

Google Tech Talk on Integrating R and C++: video and slides

October 27, 2010
By

Last Friday, Romain and I were guests of the R intergrouplet (what an adorable name!) at Google's headquarter in Mountain View. This arose out of discussions following useR! 2010 where we met Google's Murray Stokely. There appears to be ever increasi...

Read more »

R Cookbook

October 27, 2010
By

Following up on the successful "R in a Nutshell", O'Reilly has just published a new book on R, The R Cookbook, by Paul Teetor. Here's the description: Perform data analysis with R quickly and efficiently using the task-oriented recipes in this cookbook. The R language and environment include everything necessary to perform statistical work right out of the box,...

Read more »

InfoWorld: R a programming language "on the rise"

October 27, 2010
By

In an article looking at once-niche programming languages that are now being deployed in businesses, R is named as one of 7 programming languages on the rise: R is another Swiss Army Knife of numerical and statistical routines for hacking through the big data sets -- collections big enough that it might be better called a Swiss Army Machete....

Read more »

A million ? what are the odds…

October 27, 2010
By
A million ? what are the odds…

50 days ago, I published a post, here, on forecasting techniques. I was wondering what could be the probability to have, by the end of this year, one million pages viewed (from Google Analytics) on this blog. Well, initially, it was on my blog at t...

Read more »