Blog Archives

Find Duplicate Records in a File

September 24, 2010
By
Find Duplicate Records in a File

In the world of data preparation a common task is to identify duplicate records in a file or data set.  A few years ago, I did most development work in Java, and shudder to think of the amount of code required to accomplish this sort of task. &nbs...

Read more »

New World Bank Data Available

September 22, 2010
By
New World Bank Data Available

Just announced:  World Bank Data features and data are available.  Previous posts have demonstrated how to access and plot this data using R (including the use of the R WDI package).  The chart above can be created using the following pr...

Read more »

New World Bank Data Available

September 22, 2010
By
New World Bank Data Available

Just announced:  World Bank Data features and data are available.  Previous posts have demonstrated how to access and plot this data using R (including the use of the R WDI package).  The chart above can be created using the following pr...

Read more »

Elder Research Two Day Course

September 18, 2010
By

... or what I did on my summer vacation...Just got back from the Elder Research Two Day Course "Tools for Discovering Patterns in Data".  It was a great course that (while not R specific) provides a great overview of Data Mining tools and tec...

Read more »

Elder Research Two Day Course

September 18, 2010
By

... or what I did on my summer vacation...Just got back from the Elder Research Two Day Course "Tools for Discovering Patterns in Data".  It was a great course that (while not R specific) provides a great overview of Data Mining tools and tec...

Read more »

Ah Bach…

September 7, 2010
By
Ah Bach…

As announced by David Smith over at Revolution Analytics,  a ggplot2 Case Study Competition is on...   Rather than blogging for the last few days, I cobbled together an entry.  It is not a particularly mind bending use of ...

Read more »

Ah Bach…

September 7, 2010
By
Ah Bach…

As announced by David Smith over at Revolution Analytics,  a ggplot2 Case Study Competition is on...   Rather than blogging for the last few days, I cobbled together an entry.  It is not a particularly mind bending use of ...

Read more »

Bot Botany – K-Means and ggplot2

September 2, 2010
By
Bot Botany – K-Means and ggplot2

So if you had a robot that was an expert at botany - would you have a bot botanist?  Among other things, it would need to to distinguish flowers through vision and image processing, and be able to classify various kinds of plants based upon specif...

Read more »

Bot Botany – K-Means and ggplot2

September 2, 2010
By
Bot Botany – K-Means and ggplot2

So if you had a robot that was an expert at botany - would you have a bot botanist?  Among other things, it would need to to distinguish flowers through vision and image processing, and be able to classify various kinds of plants based upon specif...

Read more »

Better than Average

August 31, 2010
By
Better than Average

The NIST's The Engineering Statistics Handbook includes an Introduction to Time Series Analysis which provides a great way of demonstrating how R can be used to make such calculations.  This post replicates the analys...

Read more »