Blog Archives

Bot Botany – K-Means and ggplot2

September 2, 2010
By
Bot Botany – K-Means and ggplot2

So if you had a robot that was an expert at botany - would you have a bot botanist?  Among other things, it would need to to distinguish flowers through vision and image processing, and be able to classify various kinds of plants based upon specif...

Read more »

Better than Average

August 31, 2010
By
Better than Average

The NIST's The Engineering Statistics Handbook includes an Introduction to Time Series Analysis which provides a great way of demonstrating how R can be used to make such calculations.  This post replicates the analys...

Read more »

Better than Average

August 31, 2010
By
Better than Average

The NIST's The Engineering Statistics Handbook includes an Introduction to Time Series Analysis which provides a great way of demonstrating how R can be used to make such calculations.  This post replicates the analys...

Read more »

Fractals in R

August 27, 2010
By
Fractals in R

Atte Tenkanen had a blog on fractals using R for a time. Much of his source code is still available online.  To produce his version of the Mandelbrot set:source('http://users.utu.fi/attenka/mandelbrot_set.R')Fractals (such...

Read more »

Fractals in R

August 27, 2010
By
Fractals in R

Atte Tenkanen had a blog on fractals using R for a time. Much of his source code is still available online.  To produce his version of the Mandelbrot set:source('http://users.utu.fi/attenka/mandelbrot_set.R')Fractals (such...

Read more »

How Safe is Your Money?

August 24, 2010
By
How Safe is Your Money?

The FDIC regularly publishes a Failed Bank List and related statistics.  This post uses data in the original XLS from the FDIC web site which is formatted for human consumption to produce the charts below using R.  Note that 2010 data be...

Read more »

Map of Upcoming Ruby Conferences

August 21, 2010
By
Map of Upcoming Ruby Conferences

One of the top searches on rubyflow is “conference”.  A recent post showed how to create a map with the location of the 2010 R User Conference.  So why not expand on the subject and create a map with numerous conference locations thr...

Read more »

Programming Language Popularity: StackOverflow and Ohloh

August 17, 2010
By
Programming Language Popularity: StackOverflow and Ohloh

In the following example, programming language popularity is measured based upon two data sets.  The first is the number of  contributors associated with a language on ohloh.net.  The second is tag usage at stackoverflow.c...

Read more »

GitHub Stats on Programming Languages

August 9, 2010
By
GitHub Stats on Programming Languages

GitHub has become a popular site for Open Source Developers to stash code and collaborate on projects.  The following are some stats and analysis related to programming languages in use based upon the number of users and repositories.  T...

Read more »

Iris Data Set Visualization Web App in < 100 LOC

August 7, 2010
By
Iris Data Set Visualization Web App in < 100 LOC

The iris data set pops up pretty regularly in statistical literature.  It consists of 50 records from three species of Iris flowers (Iris setosa, Iris virginica and Iris versicolor).   I came across it recently while reading

Read more »