I use R very frequently and take for granted much that it has to offer. I forget how R is different from similar tools, so I have trouble communicating the benefits of using R. The goal of this post is to highlight R’s main strengths, but first… my story.

__How I got started with R__

I was introduced to R while I was working as a Research Analyst at the Federal Reserve Bank of St. Louis. I wanted to do statistical analysis at home but the tools I used at work (GAUSS and SAS) were expensive, so I started doing my analysis in Excel.

But as my analysis became more complex, the Excel files became large and cumbersome. The files also did not document my thought process, which made it difficult to revisit analysis I had started several months earlier. I asked my fellow analysts for advice and one introduced me to R and Modern Applied Statistics with S. Thus began my auto-didactic journey with R.

__Why should you use R?__

R is the leading tool for statistics, data analysis, and machine learning. It is more than a statistical package; it’s a programming language, so you can create your own objects, functions, and packages.

Speaking of packages, there are over 2,000 cutting-edge, user-contributed packages available on

CRAN (not to mention

Bioconductor and

Omegahat). To get an idea of what packages are out there, just take a look at these

Task Views. Many packages are submitted by prominent members of their respective fields.

Like all programs, R programs explicitly document the steps of your analysis and make it easy to reproduce and/or update analysis, which means you can quickly try many ideas and/or correct issues.

You can easily use it anywhere. It’s

platform-independent, so you can use it on any operating system. And it’s free, so you can use it at any employer without having to persuade your boss to purchase a license.

Not only is R free, but it’s also

open-source. That means anyone can examine the source code to see exactly what it’s doing. This also means that you, or anyone, can fix bugs and/or add features, rather than waiting for the vendor to find/fix the bug and/or add the feature–at their discretion–in a future release.

R allows you to integrate with other languages (C/C++,

Java,

Python) and enables you to interact with many data sources:

ODBC-compliant databases (Excel, Access) and other statistical packages (

SAS,

Stata,

SPSS,

Minitab).

Explicit parallelism is straightforward in R (see the

High Performance Computing Task View): several packages allow you to take advantage of multiple cores, either on a single machine or across a network. You can also build R with

custom BLAS.

I hope that’s a helpful overview of some benefits of using R. I’m sure I have forgotten some things, so please add them in the comments.

*Related*

To

**leave a comment** for the author, please follow the link and comment on their blog:

** FOSS Trading**.

R-bloggers.com offers

**daily e-mail updates** about

R news and

tutorials on topics such as:

Data science,

Big Data, R jobs, visualization (

ggplot2,

Boxplots,

maps,

animation), programming (

RStudio,

Sweave,

LaTeX,

SQL,

Eclipse,

git,

hadoop,

Web Scraping) statistics (

regression,

PCA,

time series,

trading) and more...

If you got this far, why not

__subscribe for updates__ from the site? Choose your flavor:

e-mail,

twitter,

RSS, or

facebook...