[This article was first published on FOSS Trading
, and kindly contributed to R-bloggers
]. (You can report issue about the content on this page here
Want to share your content on R-bloggers? click here
if you have a blog, or here
if you don't.
I use R very frequently and take for granted much that it has to offer. I forget how R is different from similar tools, so I have trouble communicating the benefits of using R. The goal of this post is to highlight R’s main strengths, but first… my story.
How I got started with R
I was introduced to R while I was working as a Research Analyst at the Federal Reserve Bank of St. Louis. I wanted to do statistical analysis at home but the tools I used at work (GAUSS and SAS) were expensive, so I started doing my analysis in Excel.
But as my analysis became more complex, the Excel files became large and cumbersome. The files also did not document my thought process, which made it difficult to revisit analysis I had started several months earlier. I asked my fellow analysts for advice and one introduced me to R and Modern Applied Statistics with S. Thus began my auto-didactic journey with R.
Why should you use R?
R is the leading tool for statistics, data analysis, and machine learning. It is more than a statistical package; it’s a programming language, so you can create your own objects, functions, and packages.
Speaking of packages, there are over 2,000 cutting-edge, user-contributed packages available on CRAN
(not to mention Bioconductor
). To get an idea of what packages are out there, just take a look at these Task Views
. Many packages are submitted by prominent members of their respective fields.
Like all programs, R programs explicitly document the steps of your analysis and make it easy to reproduce and/or update analysis, which means you can quickly try many ideas and/or correct issues.
You can easily use it anywhere. It’s platform-independent
, so you can use it on any operating system. And it’s free, so you can use it at any employer without having to persuade your boss to purchase a license.
Not only is R free, but it’s also open-source
. That means anyone can examine the source code to see exactly what it’s doing. This also means that you, or anyone, can fix bugs and/or add features, rather than waiting for the vendor to find/fix the bug and/or add the feature–at their discretion–in a future release.
R allows you to integrate with other languages (C/C++, Java
) and enables you to interact with many data sources: ODBC
-compliant databases (Excel, Access) and other statistical packages (SAS
Explicit parallelism is straightforward in R (see the High Performance Computing Task View
): several packages allow you to take advantage of multiple cores, either on a single machine or across a network. You can also build R with custom BLAS
I hope that’s a helpful overview of some benefits of using R. I’m sure I have forgotten some things, so please add them in the comments.