Monthly Archives: November 2010

My First R Package: infochimps

November 20, 2010
By

I have finally taken the plunge and created my first R package! As frequent readers will know, I often sing the praises of infochimps, a startup out of Austin, TX attempting to be the world’s data clearinghouse. While infochimps is an excellent resource for data sets, they also provide their own set excellent data

Read more »

R function for reading big tables

November 20, 2010
By

HugeFileLoader = function(path, sep = "\t", skip = 0, header = T, nrows = 10){### counts the number of lines using shell wc command, and converts the output to numericline.count = paste("wc -l ", path, sep = "")row.count = as.numeric(strsplit(system(li...

Read more »

R function for reading big tables

November 20, 2010
By

HugeFileLoader = function(path, sep = "\t", skip = 0, header = T, nrows = 10){### counts the number of lines using shell wc command, and converts the output to numericline.count = paste("wc -l ", path, sep = "")row.count = as.numeric(strsplit(system(li...

Read more »

ShortCut[R]: locator

November 20, 2010
By
ShortCut[R]: locator

Welcome to my new category: ShortCut! Here I'll shortly explain some smart features, unknown extensions or uncommon pathways of going for gold. Today it's about the Gnu R tool locator.

Read more »

Running R on remote computer via local emacs

November 19, 2010
By

Aquamacs in Mac OS X and Emacs in Linux/unix can be used to edit remote (and local) R code and submit pieces of code to a remote R session. For this to work you need to install ess for emacs (Aquamacs comes with ess by default now, I don't know about e...

Read more »

Running R on remote computer via local emacs

November 19, 2010
By

Aquamacs in Mac OS X and Emacs in Linux/unix can be used to edit remote (and local) R code and submit pieces of code to a remote R session. For this to work you need to install ess for emacs (Aquamacs comes with ess by default now, I don't know about e...

Read more »

Finally! A practical R book on Data Mining: "Data Mining With R, Learning with Case Studies," by Luis Torgo

November 19, 2010
By
Finally! A practical R book on Data Mining:  "Data Mining With R, Learning with Case Studies," by Luis Torgo

I've been a bit busy lately with a few big things, however, I wanted to stop by and mention a fantastic book for those who have been following along the R examples.  Anyone who's followed my blog knows that I'm big on practical books with examples...

Read more »

Is there a Market for Premium R Packages?

November 19, 2010
By
Is there a Market for Premium R Packages?

Nathan Yau, of the excellent FlowingData blog, recently asked on his Twitter stream: I wonder if there’s a market for premium R packages, like there is for say, @wordpress themes and plugins There are some great packages available for R, all of which are currently free. I think it would be great if authors like

Read more »

Airport security: science vs backlash

November 19, 2010
By

The United States has recently introduced millimeter wave and backscatter x-ray scanners to the security screening process in many airports, prompting a backlash in some quarters. Much of the opposition is centered around the invasion of privacy: the scanners generate an image of the traveller's naked body. There are also health concerns, at least for the backscatter x-ray variants...

Read more »

Making R growl

November 18, 2010
By
Making R growl

Spending the day churning through large data set or doing some heavy-duty number crunching? What is one to do while the computer is running in overdrive? We’ll, for one, you could get a steaming cup of joe and write a … Continue reading →

Read more »