Monthly Archives: November 2010

R function for reading big tables

November 20, 2010
By

HugeFileLoader = function(path, sep = "\t", skip = 0, header = T, nrows = 10){### counts the number of lines using shell wc command, and converts the output to numericline.count = paste("wc -l ", path, sep = "")row.count = as.numeric(strsplit(system(li...

Read more »

ShortCut[R]: locator

November 20, 2010
By
ShortCut[R]: locator

Welcome to my new category: ShortCut! Here I'll shortly explain some smart features, unknown extensions or uncommon pathways of going for gold. Today it's about the Gnu R tool locator.

Read more »

Running R on remote computer via local emacs

November 19, 2010
By

Aquamacs in Mac OS X and Emacs in Linux/unix can be used to edit remote (and local) R code and submit pieces of code to a remote R session. For this to work you need to install ess for emacs (Aquamacs comes with ess by default now, I don't know about e...

Read more »

Running R on remote computer via local emacs

November 19, 2010
By

Aquamacs in Mac OS X and Emacs in Linux/unix can be used to edit remote (and local) R code and submit pieces of code to a remote R session. For this to work you need to install ess for emacs (Aquamacs comes with ess by default now, I don't know about e...

Read more »

Finally! A practical R book on Data Mining: "Data Mining With R, Learning with Case Studies," by Luis Torgo

November 19, 2010
By
Finally! A practical R book on Data Mining:  "Data Mining With R, Learning with Case Studies," by Luis Torgo

I've been a bit busy lately with a few big things, however, I wanted to stop by and mention a fantastic book for those who have been following along the R examples.  Anyone who's followed my blog knows that I'm big on practical books with examples...

Read more »

Is there a Market for Premium R Packages?

November 19, 2010
By
Is there a Market for Premium R Packages?

Nathan Yau, of the excellent FlowingData blog, recently asked on his Twitter stream: I wonder if there’s a market for premium R packages, like there is for say, @wordpress themes and plugins There are some great packages available for R, all of which are currently free. I think it would be great if authors like

Read more »

Airport security: science vs backlash

November 19, 2010
By

The United States has recently introduced millimeter wave and backscatter x-ray scanners to the security screening process in many airports, prompting a backlash in some quarters. Much of the opposition is centered around the invasion of privacy: the scanners generate an image of the traveller's naked body. There are also health concerns, at least for the backscatter x-ray variants...

Read more »

Making R growl

November 18, 2010
By
Making R growl

Spending the day churning through large data set or doing some heavy-duty number crunching? What is one to do while the computer is running in overdrive? We’ll, for one, you could get a steaming cup of joe and write a … Continue reading →

Read more »

Competitive Data Science: An Update

November 18, 2010
By

A quick reminder that two competitions based around data analysis, both very suited to R, are currently underway. First, there's still plenty of time to enter the competition to predict popular R packages, announced by the The Dataists and hosted at Kaggle. According to organizer Drew Conway, the competition has already received 114 entries from 21 teams. But with...

Read more »

Rapidminer + R Example for Trading

November 18, 2010
By
Rapidminer + R Example for Trading

RapidMiner + R is an advanced tool that can be used to analyze trading strategies, In order to check its power I made a simple example using an algorithm based on a support vector machine for predicting the next day's price and based on it I generated ...

Read more »