Blog Archives

How do you know if your data has signal?

August 11, 2015
By
How do you know if your data has signal?

by Nina Zumel Data Scientist Win-Vector LLC Image by Liz Sullivan, Creative Commons. Source: Wikimedia An all too common approach to modeling in data science is to throw all possible variables at a modeling procedure and "let the algorithm sort it out." This is tempting when you are not sure what are the true causes or predictors of the...

Read more »

A simple statnet model of CRAN

August 6, 2015
By
A simple statnet model of CRAN

by Joseph Rickert In a recent post on creating JavaScript network graphs directly from R, my colleague and fellow blogger, Andrie de Vries, included a link to a saved graph of CRAN. Here, I will use that same graph (network) to build a simple exponential random graph model using functions from the igraph package, and the network and ergm...

Read more »

A Short Introduction to Bioconductor

August 4, 2015
By
A Short Introduction to Bioconductor

by Peter Hickey (@PeteHaitch) One of the keys to R's success as a software environment for data analysis is the availability of user-contributed packages. Most useRs will be familiar with (and very grateful for) the Comprehensive R Archive Network (CRAN). The packages available on CRAN, nearly 7000 at last count, cover common data analysis tasks, such as importing data...

Read more »

MRAN’s Packages Spotlight

July 30, 2015
By
MRAN’s Packages Spotlight

by Joseph Rickert New R packages just keep coming. The following plot, constructed with information from the monthly files on Dirk Eddelbuettel's CRANberries site, shows a plot of the number of new packages released to CRAN between January 1, 2013 and July 27, 2015 by month (not quite 31 months). This is amazing growth! The mean rate is about...

Read more »

Efficient Accumulation in R

July 28, 2015
By
Efficient Accumulation in R

by John MountData Scientist, Win-Vector LLC R has a number of very good packages for manipulating and aggregating data (plyr, sqldf, RevoScaleR, data.table, and more), but when it comes to accumulating results the beginning R user is often at sea. The...

Read more »

Sunbelt XXXV, Social Network Analysis, Statnet and R

July 23, 2015
By
Sunbelt XXXV, Social Network Analysis, Statnet and R

by Joseph Rickert The XXXV Sunbelt Conference of the International Network for Social Network Analysis (INSNA) was held last month at Brighton beach in the UK. (And I am still bummed out that I was not there.) A run of 35 conferences is impressive indeed, but the social network analysts have been at it for an even longer time...

Read more »

Using Azure as an R datasource Part 3: Pulling data from MySQL/MariaDB to Linux

July 21, 2015
By
Using Azure as an R datasource Part 3: Pulling data from MySQL/MariaDB to Linux

by Gregory Vandenbrouck Software Engineer, Microsoft This post is the third in a series that covers pulling data from Microsoft SQL Server or MySQL/MariaDB on Azure to an R client on Windows or Linux. In the previous two, we covered pulling data from SQL Server and MySQL/MariaDB to an R client on Windows. This time we’ll be pulling data...

Read more »

R, Extreme Value Statistics and Missing Data

July 16, 2015
By
R, Extreme Value Statistics and Missing Data

by Joseph Rickert June was a hot month for extreme statistics and R. Not only did we close out the month with useR! 2015, but two small conferences in the middle of the month brought experts together from all over the world to discuss two very difficult areas of statistics that generate quite a bit of R code. The...

Read more »

5 Steps to Create an R Package Email Course

July 14, 2015
By
5 Steps to Create an R Package Email Course

by Ari Lamstein, Software Engineer and Data Analyst Creating an email course for my R packages has significantly increased the number of people who use the packages. It has also reduced the learning curve for the packages and brought me into greater contact with my users. In this post I will share the 5 steps I took to create...

Read more »

Get your R education going with GitHub

July 2, 2015
By
Get your R education going with GitHub

by Joseph Rickert Last week, I was fortunate enough to attend the R Summit & Workshop, an invitation only event, held at the Copenhagen Business School. The abstracts for the public talks presented are online and well worth a look. Collectively they provide a snapshot of the state of development of R and the R Community as well some...

Read more »