Monthly Archives: September 2013

Shiny App for Polling Forums

September 5, 2013
By

In 2010, Crystal Palace FC were in administration and had 10 points deducted during the year. They only survived in the Championship on the last day of the season A year ago, they started the league with three consecutive losses and were relegation favourites. Fast forward 12 months and they are again strong tips for

Read more »

Metaprogramming in R with an example: Beating lazy evaluation

September 5, 2013
By

Functional languages allows us to treat functions as types. This brings us a distinct advantage of being able to write a code that generates further code, this practise is generally known as metaprogramming. As a functional language R project provides ...

Read more »

Type conversion and you (or and R)

September 5, 2013
By

Types and type conversion can be a tricky and intricate topic, and sometimes can lead to some real head-scratcher issues in R. Hence a somewhat confusing title.This is for people still relatively new to R, and I will skip some gory details. Actually I will skip most of them, the canonical source for type and conversion information is the...

Read more »

Text Mining the Complete Works of William Shakespeare

September 5, 2013
By
Text Mining the Complete Works of William Shakespeare

I am starting a new project that will require some serious text mining. So, in the interests of bringing myself up to speed on the tm package, I thought I would apply it to the Complete Works of William Shakespeare and just see what falls out. The first order of business was getting my hands

Read more »

After Three Months I Cannot Reproduce My Own Book

September 5, 2013
By
After Three Months I Cannot Reproduce My Own Book

I thought I could easily jump to a high standard (reproducibility), but I failed. Some of you may have noticed that the knitr book is finally out. Amazon is offering a good price at the moment, so if you are interested, you'd better hurry up. I a...

Read more »

Two presentations on Big Data Big Analytics with R

September 4, 2013
By

Last week, Revolution Analytics' US Chief Scientist Mario Inchiosa gave a presentation on high-performance predictive analytics in R and Hadoop, showing how Revolution R Enterprise 7 will bring the high-performance predictable algorithms of ScaleR to run on Cloudera and Hortonworks Hadoop clusters, while retaining the same easy-to-use interface from the R language. Here are the slides from his presentation,...

Read more »

The Beta Prior, Likelihood, and Posterior

September 4, 2013
By
The Beta Prior, Likelihood, and Posterior

The Beta distribution (and more generally the Dirichlet) are probably my favorite distributions.  However, sometimes only limited information is available when trying set up the distribution.  For example maybe you only know the lowest likely value, the highest likely value and the median, as a measure of center.  That information is sufficient to construct a

Read more »

SPSS looked great! 20 years ago…

September 4, 2013
By
SPSS looked great! 20 years ago…

For some reason someone dropped a pamphlet advertising SPSS for Windows 3.0 in my mail box at work. This means that the pamphlet, and the advertised version of SPSS, should be at least 20 years old! These days I’m happily using R for everything but if I was going to estimate any models 20 years ago SPSS actually looked...

Read more »

How to build a single-node Hadoop/R system

September 3, 2013
By

The best way to learn any software is to use it, and if you're new to Hadoop and want to try using Hadoop with R the process of setting up your own Hadoop cluster can be daunting (to say the least). But if learning is the goal, the key is that you don't need to install a full cluster....

Read more »

Scheduling R Tasks with Crontabs to Conserve Memory

September 3, 2013
By
Scheduling R Tasks with Crontabs to Conserve Memory

One of R’s biggest pitfalls is that eats up memory without letting it go.  This can be a huge problem if you are running really big jobs, have a lot of tasks  to run, or there are multiple users on your local computer or r server.  When I run huge jobs on my mac, I

Read more »