Articles by Joseph Rickert

5 New R Packages for Data Scientists

August 20, 2015 | Joseph Rickert

by Joseph Rickert One great beauty of the R ecosystem, and perhaps the primary reason for R’s phenomenal growth, is the system for contributing new packages. This, coupled to the rock solid stability of CRAN, R’s primary package repository, gives R a great advantage. However, anyone with enough ... [Read more...]

R News From JSM 2015

August 13, 2015 | Joseph Rickert

by Joseph Rickert We can declare 2015 the year that R went mainstream at the JSM. There is no doubt about it, the calculations, visualizations and deep thinking of a great many of the world's statisticians are rendered or expressed in R and the JSM is with the program. In 2013 I ... [Read more...]

How do you know if your data has signal?

August 11, 2015 | Joseph Rickert

by Nina Zumel Data Scientist Win-Vector LLC Image by Liz Sullivan, Creative Commons. Source: Wikimedia An all too common approach to modeling in data science is to throw all possible variables at a modeling procedure and "let the algorithm sort it out." This is tempting when you are not sure ... [Read more...]

A simple statnet model of CRAN

August 6, 2015 | Joseph Rickert

by Joseph Rickert In a recent post on creating JavaScript network graphs directly from R, my colleague and fellow blogger, Andrie de Vries, included a link to a saved graph of CRAN. Here, I will use that same graph (network) to build a simple exponential random graph model using functions ... [Read more...]

A Short Introduction to Bioconductor

August 4, 2015 | Joseph Rickert

by Peter Hickey (@PeteHaitch) One of the keys to R's success as a software environment for data analysis is the availability of user-contributed packages. Most useRs will be familiar with (and very grateful for) the Comprehensive R Archive Network (CRAN). The packages available on CRAN, nearly 7000 at last count, cover ... [Read more...]

MRAN’s Packages Spotlight

July 30, 2015 | Joseph Rickert

by Joseph Rickert New R packages just keep coming. The following plot, constructed with information from the monthly files on Dirk Eddelbuettel's CRANberries site, shows a plot of the number of new packages released to CRAN between January 1, 2013 and July 27, 2015 by month (not quite 31 months). This is amazing growth! The ... [Read more...]

Efficient Accumulation in R

July 28, 2015 | Joseph Rickert

by John MountData Scientist, Win-Vector LLC R has a number of very good packages for manipulating and aggregating data (plyr, sqldf, RevoScaleR, data.table, and more), but when it comes to accumulating results the beginning R user is often at sea. The... [Read more...]

R, Extreme Value Statistics and Missing Data

July 16, 2015 | Joseph Rickert

by Joseph Rickert June was a hot month for extreme statistics and R. Not only did we close out the month with useR! 2015, but two small conferences in the middle of the month brought experts together from all over the world to discuss two very difficult areas of statistics that ... [Read more...]

5 Steps to Create an R Package Email Course

July 14, 2015 | Joseph Rickert

by Ari Lamstein, Software Engineer and Data Analyst Creating an email course for my R packages has significantly increased the number of people who use the packages. It has also reduced the learning curve for the packages and brought me into greater contact with my users. In this post I ... [Read more...]

Get your R education going with GitHub

July 2, 2015 | Joseph Rickert

by Joseph Rickert Last week, I was fortunate enough to attend the R Summit & Workshop, an invitation only event, held at the Copenhagen Business School. The abstracts for the public talks presented are online and well worth a look. Collectively they provide a snapshot of the state of development of ... [Read more...]

Working with the RStudio CRAN logs

June 25, 2015 | Joseph Rickert

by Joseph Rickert The installr package has some really nice functions for working with the daily package download logs for the RStudio CRAN mirror which RStudio graciously makes available at http://cran-logs.rstudio.com/. The following code uses the download_RStudio_CRAN_data() function to download a month's worth of .... [Read more...]

DeployR Data I/O

June 22, 2015 | Joseph Rickert

by Sean Wells, Senior Software Engineer, Microsoft and David Russell DeployR exists to solve a number of fundamental R analytics integration problems faced by application developers. For example, have you ever wondered how you might execute an R script from within a Web-based dashboard, an enterprise middleware solution, or a ... [Read more...]

Fishing for packages in CRAN

June 18, 2015 | Joseph Rickert

by Joseph Rickert It is incredibly challenging to keep up to date with R packages. As of today (6/16/15), there are 6,789 listed on CRAN. Of course, the CRAN Task Views are probably the best resource for finding what's out there. A tremendous amount of work goes into maintaining and curating these ... [Read more...]

Pairwise-complete correlation considered dangerous

June 16, 2015 | Joseph Rickert

by B. W. Lewis This note warns about potentially misleading results when using the use=pairwise.complete.obs and related options in R’s cor and cov functions. Pitfalls are illustrated using a very simple pathological example followed by a brief list of alternative ways to deal with missing data ... [Read more...]

R User Groups are Everywhere

June 11, 2015 | Joseph Rickert

by Joseph Rickert In a little over three weeks useR! 2015 will convene in Aalborg, Denmark and I am looking forward to being there and learning and talking about R user groups. The following map shows the big picture for R User Groups around the world. However, it is very difficult ...
[Read more...]
1 5 6 7 8 9 17

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)