586 search results for "SQL"

Producing grids of plots in R with ggplot2: A journey of discovery

August 26, 2010
By
Producing grids of plots in R with ggplot2: A journey of discovery

I’ve just gone through a bit of a ‘journey of discovery’ in R while trying to plot a grid of plots for one of the research projects I’m doing. I wanted to write a simple function which could produce this grid of plots from a CSV file, allowing me to easily view the trends of

Read more »

How Safe is Your Money?

August 24, 2010
By
How Safe is Your Money?

The FDIC regularly publishes a Failed Bank List and related statistics.  This post uses data in the original XLS from the FDIC web site which is formatted for human consumption to produce the charts below using R.  Note that 2010 data be...

Read more »

Using R for Introductory Statistics, Chapter 3.4

August 21, 2010
By
Using R for Introductory Statistics, Chapter 3.4

...a continuing journey through Using R for Introductory Statistics, by John Verzani. Simple linear regression Linear regression is a kooky term for fitting a line to some data. This odd bit of terminology can be blamed on Sir Francis Galton, a proli...

Read more »

Using R for Introductory Statistics, Chapter 3.4

August 21, 2010
By
Using R for Introductory Statistics, Chapter 3.4

...a continuing journey through Using R for Introductory Statistics, by John Verzani. Simple linear regression Linear regression is a kooky term for fitting a line to some data. This odd bit of terminology can be blamed on Sir Francis Galton, a proli...

Read more »

Programming Language Popularity: StackOverflow and Ohloh

August 17, 2010
By
Programming Language Popularity: StackOverflow and Ohloh

In the following example, programming language popularity is measured based upon two data sets.  The first is the number of  contributors associated with a language on ohloh.net.  The second is tag usage at stackoverflow.c...

Read more »

Handling Large CSV Files in R

A follow-up of my previous post Excellent Free CSV Splitter. I asked a question at LinkedIn about how to handle large CSV files in R / Matlab. Specifically, Quotationsuppose I have a large CSV file with over 30 million number of rows, both Matlab / R lacks memory when importing the data. Could you...

Read more »

Save R plot as a BLOB

July 30, 2010
By

I recently posed a question on stackoverflow on whether anyone knew an efficient way to save an R plot to a MySQL database as a BLOB. My plan was to use my personal desktop to perform R routines and save them to a web server, where they could then be accessed and displayed on

Read more »

Pie Charts in ggplot2

July 29, 2010
By
Pie Charts in ggplot2

...and other isomorphic data shape presentations...The Pie Chart has been widely criticized in recent times by statisticians.  Edward Tufte goes as far as to call this the "prevailing orthodoxy."  The reasons generally cited:The rel...

Read more »

Hacker News User Base Changed?

July 26, 2010
By
Hacker News User Base Changed?

There are lots of references on Hacker news to the fact that the "good old days" are gone and that the character of the site has changed since it started.  The visualization above was based on a sample of users that posted on the site in recent ti...

Read more »

Analyze Online R User Conference Data

July 19, 2010
By
Analyze Online R User Conference Data

The R User Conference 2010 will be underway shortly - and what better way to commemorate the event than to  blast out some R code related to the conference?  HTML tables on websites for the past three years list participants an...

Read more »