Monthly Archives: August 2011

Using the google prediction API from R

August 10, 2011
By
Using the google prediction API from R

Google has a "black box" prediction API that they provide for use with creating recommender systems or filtering spam. Furthermore, they provide an R package for interfacing that API, but try as I might I cannot get it to work under windows. Here are ...

Read more »

Plotting molecular properties for (sub)sets

August 10, 2011
By
Plotting molecular properties for (sub)sets

For a toxicology paper we are writing up, I need to create a few plots showing how the toxic and non-toxic molecules differ (or not) with respect to a few molecular properties, such as logP or the molecular weight. The rcdk package provides all, of cou...

Read more »

A 60-second survey for R users

August 10, 2011
By

I'm doing a little research to validate estimates of the size of the R user community. If you're an R user, please take a minute to complete this three-question survey on R usage at your organization. Thanks in advance. Revolution Analytics: R user base survey

Read more »

Informational Easing: A Change In F.O.M.C. Expectations

August 10, 2011
By
Informational Easing: A Change In F.O.M.C. Expectations

Let's analyze the latest FOMC policy move.The FOMC met yesterday and changed up the communications strategy.  How so? Well, until yesterday the statement has been saying as of June 22, 2011:"The Committee continues to anticipat...

Read more »

Scraping web data in R

August 10, 2011
By
Scraping web data in R

In my last post, I went through a lot of effort to scrape the PMI index off the ISM website.  It turns out that was unnecessary effort, as commentator "senne" pointed out that this index is available from FRED, with the symbol NAPM. &nbs...

Read more »

Using a “pure infographic” to explore differences between information visualization and statistical graphics

August 10, 2011
By
Using a “pure infographic” to explore differences between information visualization and statistical graphics

Our discussion on data visualization continues. One one side are three statisticians–Antony Unwin, Kaiser Fung, and myself. We have been writing about the different goals served by information visualization and statistical graphics. On the other side are graphics experts (sorry for the imprecision, I don’t know exactly what these people do in their day jobs The post Using...

Read more »

Multiple cores in R, revisited

August 10, 2011
By

The bigmemory package in combination with doMC provides at least a partial solution for sharing a large data set across multiple cores in R. With this solution you can work on the same matrix using several threads. It is also a very scalable solution. ...

Read more »

Coding, GUIs and Statistical Rituals

August 10, 2011
By
Coding, GUIs and Statistical Rituals

I was recently inspired to comment on this blog post, asking is R is a cure for ‘mindless statistics’. Anyone whose familiar with statistics used in applied fields like epidemiology, sociology, social sciences generally will be familiar with the idea of a ‘statistical ritual’. Rather than think about the proper statistical approach to every question,

Read more »

What do you want to see at useR 2012?

August 9, 2011
By

This year's useR! conference at Warwick University is less than a week away, but planning is already underway for useR! 2012, to be held at Vanderbilt University in Nashville. If you're planning to attend, conference organizer Frank Harrell is looking for your input: The 2012 R User Conference - useR! 2012 - will be held in Nashville Tennessee USA,...

Read more »

Amazon Machine Image Created With RTextTools Pre-installed

We recently created an AMI for Amazon's EC2 cloud computing service. Users with AWS accounts can access the public AMI by searching ami-817eb8e8. The AMI is based off of Drew Conway's excellent AMI, but with R 2.13 loaded and RTextTools and maxent installed.

Read more »