Monthly Archives: September 2011

In case you missed it: August Roundup

September 8, 2011
By

In case you missed them, here are some articles from August of particular interest to R users. A contest to showcase applications of R for businesses is offering $20,000 in prizes from Revolution Analytics. Three new open-source packages integrating R and Hadoop will be introduced by Revolution Analytics' CTO David Champagne in a webinar on September 21. Dirk Eddelbuettel...

Read more »

Interacting with bioinformatics webservers using R

September 8, 2011
By
Interacting with bioinformatics webservers using R

In an ideal world, all bioinformatics tools would be made available via the Web as a web service with an API, as well as a standalone package to download for local use. This is rarely the case and sometimes, even where one or the other is available, factors such as cost come into play. So

Read more »

A brief history of S&P 500 beta

September 8, 2011
By
A brief history of S&P 500 beta

Data The data are daily returns starting at the beginning of 2007.  There are 477 stocks for which there is full and seemingly reliable data. Estimation The betas are all estimated on one year of data. The times that identify the betas mark the point at which the estimate would become available.  So the betas … Continue reading...

Read more »

Multiple plots with subplot in R

September 8, 2011
By

I'm in the middle of creating a poster and wanted to compresss the content by transforming some of the charts into subplots of other charts.I made a little survey and found that there is a TeachingDemos library in CRAN that fits my needs. Well, the parameterization of the functions is a bit tricky but after a few tries...

Read more »

Shared and reproducible computing with OpenCPU

September 7, 2011
By
Shared and reproducible computing with OpenCPU

While looking for an online computing provider, I bumped into OpenCPU.org: OpenCPU is a new initiative to make innovations in statistics, visualization and data-science more widely applicable. I guess the idea of online analysis and visualization, and online cloud R computing platform isn’t really new at this point anymore, but the real incentive is the

Read more »

Analyzing big data in R: two presentations from useR! 2011

September 7, 2011
By

At last month's useR! 2011 conference at Warwick University, there were two talks on the RevoScaleR package for big data statistics in R. The first was a keynote presentation from Revolution Analytics' Chief Scientist, Lee Edlefsen. Here is the overview of his talk, Scalable Data Analysis in R: For the past several decades the rising tide of technology --...

Read more »

Information Transmission in a Social Network: Dissecting the Spread of a Quora Post

September 7, 2011
By
Information Transmission in a Social Network: Dissecting the Spread of a Quora Post

tl;dr See this movie visualization for a case study on how a post propagates through Quora. How does information spread through a network? Much of Quora’s appeal, after all, lies in its social graph — and when you’ve got a network of users, all broadcasting their activities to their neighbors, information can cascade in multiple

Read more »

Hey! I made you some Wiener processes!

September 7, 2011
By
Hey! I made you some Wiener processes!

Check them out. Here are thirty homoskedastic ones: > homo.wiener for (j in 1:30) {  for (i in 2:length(homo.wiener)) {          homo.wiener for (j in 1:30) {        plot( homo.wiener,           type = "l", col = rgb(.1,....

Read more »

Hey! I made you some Wiener processes!

September 7, 2011
By
Hey! I made you some Wiener processes!

Check them out. Here are thirty homoskedastic ones: > homo.wiener for (j in 1:30) {  for (i in 2:length(homo.wiener)) {          homo.wiener for (j in 1:30) {        plot( homo.wiener,           type = "l", col = rgb(.1,....

Read more »

Link to StatDNA Guest Post

September 7, 2011
By
Link to StatDNA Guest Post

The post is officially up on the StatDNA blog. Go check it out.As I said in my previous post, this is a very rough and preliminary model. This is why my work was not any sort of formal entry, just some fun with some great data.I used an Vector Genera...

Read more »