Posts Tagged ‘ Uncategorized ’

Monty Hall by simulation in R

February 3, 2012
By
Monty Hall by simulation in R

(Almost) every introductory course in probability introduces conditional probability using the famous Monte Hall problem. In a nutshell, the problem is one of deciding on a best strategy in a simple game. In the game, the contestant is asked to select one of three doors. Behind one of the doors is a great prize...

Read more »

Great Maps with ggplot2

February 2, 2012
By
Great Maps with ggplot2

The above map (and this one) was produced using R and ggplot2 and serve to demonstrate just how sophisticated R visualisations can be. We are used to seeing similar maps produced with conventional GIS platforms or software such as Processing but I hadn’t yet seen one from the R community (feel free to suggest...

Read more »

R Appears Among Top 20 Programming Languages

January 24, 2012
By
R Appears Among Top 20 Programming Languages

Speaking of R… On the 16th, TIOBE Software released the Tiobe Index of the most popular programming languages. For the first time ever, R is in the Top 20. The top spots are, no surprise, occupied by Java and C respectively. More after the jump. The way this index is assembled is…interesting. The full...

Read more »

Winning from losing

January 19, 2012
By
Winning from losing

By following twitter’s #rstats hashtag (rss feed), I recently came across a very interesting R-related blog: datanalytics.com. The first post I read from it was about setting up an on-line reading group to go through the excellent “The Elements of … Continue reading

Read more »

Quick Introduction to ggplot2

January 17, 2012
By
Quick Introduction to ggplot2

For a much better looking version of this post (where code is actually readable!), see this Github repository, which also contains some of the example datasets I use and a literate programming version of this tutorial. Introduction This is a bare-bones introduction to ggplot2, a visualization package in R. It assumes no knowledge of...

Read more »

Eisen and Elsevier have words over bill to end NIH Public Access Policy

January 10, 2012
By

This article linked below and the subsequent comments (some apparently from an Elsevier rep.) are interesting. It's not directly related to R, but is related to open source/science philosophy. Elsevier-funded NY Congresswoman Carolyn Maloney Wants to D...

Read more »

RTextTools v1.3.5: Saving models, text labels, and a game plan for 2012

RTextTools v1.3.5 addresses some key concerns that have been raised in recent months. Many of the algorithms used in RTextTools require that any new data presented to a trained classifier contain the same features as the original document-term matrix. Since this rarely (if ever) happens in the real world, I have added an originalMatrix...

Read more »

Initial impressions of RangeLab

December 30, 2011
By
Initial impressions of RangeLab

I was rummaging around in the source of R looking for trouble, as one does, when I came across what I believed to be a less than optimally accurate floating-point algorithm (function R_pos_di in src/main/arithemtic.c). Analyzing the accuracy of floating-point code is notoriously difficult and those having the required skills tend to concentrate their...

Read more »

Happy Holidays and Best Wishes for 2012

December 22, 2011
By
Happy Holidays and Best Wishes for 2012

This is just a quick note to wish you and your family a very healthy and happy holidays and wonderful New Year! I hope you enjoyed reading my blog and thank you for your comments and emails. Here is a short R code that implements an interesting idea from the Charting the Santa Claus...

Read more »

RTextTools v1.3.2 Released

RTextTools was updated to version 1.3.2 today, adding support for n-gram token analysis, a faster maximum entropy algorithm, and numerous bug fixes. The source code has been synced with the Google Code repository, so please feel free to check out a copy and add your own features!With the core feature set of RTextTools finalized,...

Read more »