Monthly Archives: October 2012

"Advanced R" Course – November 15-16, 2012

October 30, 2012
By

This is the last post about the course. As places are limited, please register as soon as possible! Milano R net, in collaboration with Quantide, organizes "Advanced R" Course November 15-16, 2012 Course description This course is designed for those … Continue reading →

Read more »

Introducing R and Biostatistics to first year LCG students (2012 version)

October 30, 2012
By
Introducing R and Biostatistics to first year LCG students (2012 version)

On Friday November 9th I’ll be giving a talk to the first year students from the Undergraduate Program on Genomic Sciences (LCG in Spanish) during their “Seminar 1: Introduction to Bioinformatics” course. It’s just like I did a year ago as I documented in my post Introducing Biostatistics to first year LCG students. Well, this time I’ll change things...

Read more »

Can We Live Without Backslashes?

October 30, 2012
By
Can We Live Without Backslashes?

Two months ago there was a discussion in the ESS mailing list about Emacs/ESS started by Paul Johnson, who claimed "Emacs Has No Learning Curve". While this sounds impossible, he really has some good points, e.g. he encourages beginners to look at the ...

Read more »

Tracking Hurricane Sandy with Open Data and R

October 29, 2012
By
Tracking Hurricane Sandy with Open Data and R

Hurricane Sandy is shaping up to be a major, and very dangerous, meteorological event for the US's East coast. Naturally, everyone is looking for the latest information and forecasts. Fortunately, the wealth of public meteorological data available on the open web, combined with real-time on-the-ground updates via social media, means that an ecosystem of on-line apps is now available...

Read more »

Working with Shootout – 2012 in R (001)

October 29, 2012
By
Working with Shootout – 2012 in R (001)

I have downloaded (from the IDRC) the ASCI files of the Shootout 2012 (see: Shootout 2012 files), so I can work with the data  to develop a model and predict a Validation Set.For that task I have a "Calibration Set", and a  "Test Se...

Read more »

Temporal network of information diffusion in Twitter

October 29, 2012
By

Millions of tweets, retweets and mentions are exchanged in Twitter everyday about very different subjects, events, opinions, etc. While aggregating this data over a time window might help to understand some properties of those processes in online social networks, the … Continue reading →   Related posts: Temporal...

Read more »

Pull Yahoo Finance Key-Statistics Instantaneously Using XML and XPath in R

October 29, 2012
By
Pull Yahoo Finance Key-Statistics Instantaneously Using XML and XPath in R

This two-part blog post I published a day ago required key-stats from Yahoo Finance for all the companies in the control group I created for my research.  I wanted all the key-stats pulled, arranged in a data-frame and then present them side-...

Read more »

ggplot2 Pinterest

October 29, 2012
By

I don’t understand the website Pinterest, but it looks pretty (especially on the iPad), and an undergraduate student said it was the greatest thing since Facebook, so I thought I would give it a shot. The idea is that Pinterest … Continue reading →

Read more »

lag function for data frames

October 29, 2012
By
lag function for data frames

When applying the stats::lag() function to a data frame, you probably expect it will pad the missing time periods with NA, but lag() doesn’t. For example: Nothing happened. Here is an alternative lag function made for this situation. It pads … Continue reading →

Read more »

Charting Wikipedia interest in GOP candidates with googleVis

October 29, 2012
By

I recently posted an article on how to collate Wikipedia page views As there is a time component to this, it seemed appropriate to use the googleVis Package to visualize changes in page hits in the Google Motion chart For this exercise, I ran the wikiFun function covered in the last post to collate page

Read more »