2087 search results for "Twitter"

Oh (de)bugger!

August 26, 2010
By
Oh (de)bugger!

By number of questions asked, R passed MATLAB for the first time on Stack Overflow today. Thus it seems an appropriate time to write my first R-based post. This post concerns what to  do when your R-code goes pear shaped. Back in June there were a couple of very good videos on R debugging that

Read more »

Global Temperature Proxy Reconstructions ~ now with CO2 forcing

August 26, 2010
By
Global Temperature Proxy Reconstructions ~ now with CO2 forcing

Previously, I did a simple Bayesian projection of recent temperature using proxy data and the methods shown in McShane and Wyner (2010). I showed that when you take out the last 30 years of data (1969~1998), the projection does not track the recent uptick in temperatures well. The “projection” is a simple unparametric bootstrap which

Read more »

From igraph to network and back again

August 25, 2010
By
From igraph to network and back again

In an effort to achieve this (last paragraph), I created a couple of functions to coerce networks as ‘igraph’ objects to networks as ‘network’ objects and vice versa. I wrapped them into a package called ‘intergraph’ which I just uploaded to my personal miniCRAN. Please mind, this is still an experimental version! Might be bug-infested.

Read more »

R and Analytics: A good career choice

August 24, 2010
By

According to Microsoft, the hottest three new tech majors are: Data Mining/Machine Learning/AI/Natural Language Processing Business Intelligence/Competitive Intelligence Analytics/Statistics – specifically Web Analytics, A/B Testing and statistical analysis Tim O'Reilly concurs. Of course, R features prominently in most of these areas -- follow the links I added above for examples -- and is growing rapidly, so learning R makes...

Read more »

Packing everything into a data.frame

August 23, 2010
By
Packing everything into a data.frame

OK, I know I talk about R too much, but I like R, so I’m going to talk about it some more. Common situation: repeat a procedure many times; each time generates some large wadge of awful-structured data, and in … Continue reading →

Read more »

What’s for lunch? Private browsing.

August 23, 2010
By
What’s for lunch? Private browsing.

Over at the Mozilla Metrics blog, Mozillan Hamilton Ulmer uses R and ggplot2 to look at when people (or at least, Firefox users that volunteered to share their usage data) enable private browsing. Turns out it isn't just "porn mode" after all: the main use turns out to be lunchtime browsing away from the employer's prying eyes: Follow the...

Read more »

Abstract word clouds using R

August 23, 2010
By
Abstract word clouds using R

A recent question over at BioStar asked whether abstracts returned from a PubMed search could easily be visualised as “word clouds”, using Wordle. This got me thinking about ways to solve the problem using R. Here’s my first attempt, which demonstrates some functions from the RCurl and XML packages. update: corrected a couple of copy/paste

Read more »

Newcomb, Benford, and their Dirty, Dirty Logarithms

August 22, 2010
By
Newcomb, Benford, and their Dirty, Dirty Logarithms

Tom Taverner introduced me to Benford’s Law as we were eating lunch together at a statistical computing conference: If you look at the first digits of data in many naturally-occuring datasets, a startling 30 percent of them are ones. “Pah!” I said. “That belies intuition! Why would one digit occur any more than another? I’d

Read more »

Global Temperature Proxy Reconstructions ~ Bayesian extrapolation of warming w/ rjags

August 22, 2010
By
Global Temperature Proxy Reconstructions ~ Bayesian extrapolation of warming w/ rjags

Update: fixed projection. There are a bunch of “hockey sticks” that calculate past global temps. through the use of proxies when instrumental data is absent. There is a new one out there by McShane and Wyner (2010) that’s creating quite a stir in the blogosphere (here, here, here, here). The main take out being, that

Read more »

Taking R to the Limit, Part II – Large Datasets in R

August 20, 2010
By
Taking R to the Limit, Part II – Large Datasets in R

For Part I, Parallelism in R, click here. Tuesday night I again had the opportunity to present on high performance computing in R, at the Los Angeles R Users’ Group. This was the second part of a two part series called “Taking R to the Limit: High Performance Computing in R.” Part II discussed ways to work with large datasets...

Read more »