569 search results for "sql"

Bot Botany – K-Means and ggplot2

September 2, 2010
By
Bot Botany – K-Means and ggplot2

So if you had a robot that was an expert at botany - would you have a bot botanist?  Among other things, it would need to to distinguish flowers through vision and image processing, and be able to classify various kinds of plants based upon specif...

Read more »

Bot Botany – K-Means and ggplot2

September 2, 2010
By
Bot Botany – K-Means and ggplot2

So if you had a robot that was an expert at botany - would you have a bot botanist?  Among other things, it would need to to distinguish flowers through vision and image processing, and be able to classify various kinds of plants based upon specif...

Read more »

GEO database: curation lagging behind submission?

August 30, 2010
By
GEO database: curation lagging behind submission?

I was reading an old post that describes GEOmetadb, a downloadable database containing metadata from the GEO database. We had a brief discussion in the comments about the growth in GSE records (user-submitted) versus GDS records (curated datasets) over time. Below, some quick and dirty R code to examine the issue, using the Bioconductor GEOmetadb

Read more »

Producing grids of plots in R with ggplot2: A journey of discovery

August 26, 2010
By
Producing grids of plots in R with ggplot2: A journey of discovery

I’ve just gone through a bit of a ‘journey of discovery’ in R while trying to plot a grid of plots for one of the research projects I’m doing. I wanted to write a simple function which could produce this grid of plots from a CSV file, allowing me to easily view the trends of

Read more »

How Safe is Your Money?

August 24, 2010
By
How Safe is Your Money?

The FDIC regularly publishes a Failed Bank List and related statistics.  This post uses data in the original XLS from the FDIC web site which is formatted for human consumption to produce the charts below using R.  Note that 2010 data be...

Read more »

Using R for Introductory Statistics, Chapter 3.4

August 21, 2010
By
Using R for Introductory Statistics, Chapter 3.4

...a continuing journey through Using R for Introductory Statistics, by John Verzani. Simple linear regression Linear regression is a kooky term for fitting a line to some data. This odd bit of terminology can be blamed on Sir Francis Galton, a proli...

Read more »

Using R for Introductory Statistics, Chapter 3.4

August 21, 2010
By
Using R for Introductory Statistics, Chapter 3.4

...a continuing journey through Using R for Introductory Statistics, by John Verzani. Simple linear regression Linear regression is a kooky term for fitting a line to some data. This odd bit of terminology can be blamed on Sir Francis Galton, a proli...

Read more »

Programming Language Popularity: StackOverflow and Ohloh

August 17, 2010
By
Programming Language Popularity: StackOverflow and Ohloh

In the following example, programming language popularity is measured based upon two data sets.  The first is the number of  contributors associated with a language on ohloh.net.  The second is tag usage at stackoverflow.c...

Read more »

Handling Large CSV Files in R

A follow-up of my previous post Excellent Free CSV Splitter. I asked a question at LinkedIn about how to handle large CSV files in R / Matlab. Specifically, Quotationsuppose I have a large CSV file with over 30 million number of rows, both Matlab / R lacks memory when importing the data. Could you...

Read more »

Save R plot as a BLOB

July 30, 2010
By

I recently posed a question on stackoverflow on whether anyone knew an efficient way to save an R plot to a MySQL database as a BLOB. My plan was to use my personal desktop to perform R routines and save them to a web server, where they could then be accessed and displayed on

Read more »