Blog Archives

Open soure software has changed the way we do business

May 20, 2015
By

Earlier this month TechCrunch published an article of mine, "The Business Economics And Opportunity Of Open-Source Data Science". With this article I wanted to share how open-source software has disrupted the economics of doing business, now that data is a fundamental component of every businesses' operations. Open source projects like Hadoop and R, coupled with commodity hardware, have fundamentally...

Read more »

Because it’s Friday: Love in the land of Facebook

May 15, 2015
By
Because it’s Friday: Love in the land of Facebook

Today is my 11th wedding anniversary with my wonderful husband Jay, so it's a love-themed Friday post today. Jay and I met before Facebook was a thing, but we've been touched by the congratulations on our timelines today. Those timeline posts reveal a lot about you and your relationships, and last year the Facebook data science team published a...

Read more »

In-database R coming to SQL Server 2016

May 15, 2015
By

R is coming to SQL Server. SQL Server 2016 (which will be in public preview this summer) will include new real-time analytics, automatic data encryption, and the ability to run R within the database itself: For deeper insights into data, SQL Server 2016 expands its scope beyond transaction processing, data warehousing and business intelligence to deliver advanced analytics as...

Read more »

Computerworld’s list of R packages for data wrangling

May 13, 2015
By

Computerworld's Sharon Machlis published today a very useful list of R packages that every R user should know. The list covers packages for data import, data wrangling, data visualization and package development, but for beginning R users the biggest challenge is usually just dealing with data. To that end, I thought it was worth listing the package for data...

Read more »

What data science software tools do you use?

May 11, 2015
By

KDnuggets is once again running its annual poll of data science software tools, now in its 16th year. If you'd like to participate, visit the KDnuggets poll page and answer the question, "What Predictive Analytics, Data Mining, Data Science software/tools you used in the past 12 months?". The poll allows you to select up to 20 tools from the...

Read more »

In case you missed it: April 2015 roundup

May 8, 2015
By

In case you missed them, here are some articles from April of particular interest to R users. Joseph Rickert reviews the inaugural New York City R User Conference, featuring Andrew Gelman. Engineer Vineet Abraham compares performance benchmarks for R and Revolution R Open on OS X and Ubuntu. R was featured in the keynotes for the BUILD developer’s conference....

Read more »

Comparing data frames, data.table and dplyr with random walks

May 6, 2015
By
Comparing data frames, data.table and dplyr with random walks

Arthur Charpentier was trying to solve an interesting problem with R: given this data set of random walks in the 2-D plane, what is the likely origin of a pathway that ends in the black circle below? It's pretty easy to generate random data like this with a few lines of code in R. And with 2 million trajectories...

Read more »

Call R and Python from base SAS

May 4, 2015
By
Call R and Python from base SAS

Since 2009, it has been possible to call R from SAS programs. However, this integration requires IML, an add-on matrix-object language for SAS which isn't available with all SAS installations and is separate from the standard SAS PROC execution model. Now, engineers at SAS have shared a method of calling R, Python and other open-source tools using the Java...

Read more »

Revolution R Open 8.0.3 now available

May 1, 2015
By
Revolution R Open 8.0.3 now available

Revolution R Open 8.0.3 is now available for download for Windows, OS X, Red Hat, Ubuntu and OpenSUSE. This release includes seveal new features: it upgrades RRO to the R 3.1.3 engine, which adds several new features to the R language, adds support for Ubuntu 15.04, and updates the checkpoint package for reproducibility. RRO is designed to work with...

Read more »

Benchmarks of RRO on OSX and Ubuntu

April 29, 2015
By
Benchmarks of RRO on OSX and Ubuntu

Bay Area engineer Vineet Abraham recently ran some benchmarks for Revolution R Open (RRO) running on Mac OS X and on Ubuntu. Thanks to the multi-threaded processing capabilites of RRO, several operations ran much faster than R downloaded from CRAN, without having to change any code: For the most part, RRO performs significantly faster than standard R both locally...

Read more »