Monthly Archives: September 2013

Type conversion and you (or and R)

September 5, 2013
By

Types and type conversion can be a tricky and intricate topic, and sometimes can lead to some real head-scratcher issues in R. Hence a somewhat confusing title.This is for people still relatively new to R, and I will skip some gory details. Actually I will skip most of them, the canonical source for type and conversion information is the...

Read more »

Text Mining the Complete Works of William Shakespeare

September 5, 2013
By
Text Mining the Complete Works of William Shakespeare

I am starting a new project that will require some serious text mining. So, in the interests of bringing myself up to speed on the tm package, I thought I would apply it to the Complete Works of William Shakespeare and just see what falls out. The first order of business was getting my hands

Read more »

After Three Months I Cannot Reproduce My Own Book

September 5, 2013
By
After Three Months I Cannot Reproduce My Own Book

I thought I could easily jump to a high standard (reproducibility), but I failed. Some of you may have noticed that the knitr book is finally out. Amazon is offering a good price at the moment, so if you are interested, you'd better hurry up. I a...

Read more »

Two presentations on Big Data Big Analytics with R

September 4, 2013
By

Last week, Revolution Analytics' US Chief Scientist Mario Inchiosa gave a presentation on high-performance predictive analytics in R and Hadoop, showing how Revolution R Enterprise 7 will bring the high-performance predictable algorithms of ScaleR to run on Cloudera and Hortonworks Hadoop clusters, while retaining the same easy-to-use interface from the R language. Here are the slides from his presentation,...

Read more »

The Beta Prior, Likelihood, and Posterior

September 4, 2013
By
The Beta Prior, Likelihood, and Posterior

The Beta distribution (and more generally the Dirichlet) are probably my favorite distributions.  However, sometimes only limited information is available when trying set up the distribution.  For example maybe you only know the lowest likely value, the highest likely value and the median, as a measure of center.  That information is sufficient to construct a

Read more »

SPSS looked great! 20 years ago…

September 4, 2013
By
SPSS looked great! 20 years ago…

For some reason someone dropped a pamphlet advertising SPSS for Windows 3.0 in my mail box at work. This means that the pamphlet, and the advertised version of SPSS, should be at least 20 years old! These days I’m happily using R for everything but if I was going to estimate any models 20 years ago SPSS actually looked...

Read more »

How to build a single-node Hadoop/R system

September 3, 2013
By

The best way to learn any software is to use it, and if you're new to Hadoop and want to try using Hadoop with R the process of setting up your own Hadoop cluster can be daunting (to say the least). But if learning is the goal, the key is that you don't need to install a full cluster....

Read more »

Scheduling R Tasks with Crontabs to Conserve Memory

September 3, 2013
By
Scheduling R Tasks with Crontabs to Conserve Memory

One of R’s biggest pitfalls is that eats up memory without letting it go.  This can be a huge problem if you are running really big jobs, have a lot of tasks  to run, or there are multiple users on your local computer or r server.  When I run huge jobs on my mac, I

Read more »

Using Google maps API and R

September 3, 2013
By
Using Google maps API and R

This post shows how to use Google Maps‘ API with R. Combine the first part with Plyr and it becomes a very powerful tool in just a few lines of code. You can find a gist in RMarkdown with the code here or click below to continue reading. The post Using Google maps API and R appeared first on

Read more »

Call for papers: Budapest BI Forum

September 3, 2013
By
Call for papers: Budapest BI Forum

I am really happy to share some news with all R users about an upcoming conference to be hold in Budapest, Hungary. The organisers gave birth to the Hungarian Open Source BI Conference and the Innovative BI conference last year, and now building on the...

Read more »