Blog Archives

Because it’s Friday: A 3-minute movie in 4095 bytes

August 22, 2014
By

This entire movie — images, music, everything — is generated from a Windows PC executable of just 4,095 bytes. That's not a typo: we're not talking bytes not megabytes or gigabytes here. Less than 4kb total creates this entire scene. For comparison, a medium-quality video file of this exact same scene in AVI format comes in at over 64Mb:...

Read more »

Entering the field as a data scientist with certification

August 22, 2014
By

By Neera Talbert, VP Services and Ben Wiley, R Programmer at Revolution Analytics By now, everyone should be familiar with the data scientist boom. Simply logging onto LinkedIn reveals a seemingly infinite number of people with words and phrases like “Data Scientist”, “Big Data Specialist”, and “Analytics” in their title. A few weeks ago, an article floated around the...

Read more »

How to integrate R with your calendar

August 20, 2014
By
How to integrate R with your calendar

Hilary Parker has contributed a lovely article to Significance, the magazine of the American Statistical Association and the Royal Statistical Society, on using R to set your Google calendar to mark the time of sunsets. Hilary details the process in the article, but the basic idea is to use the sunrise.set function from the StreamMetabolism package to calculate sunset...

Read more »

Data Cleaning is a critical part of the Data Science process

August 18, 2014
By

A New York Times article yesterday discovers the 80-20 rule: that 80% of a typical data science project is sourcing cleaning and preparing the data, while the remaining 20% is actual data analysis. The article gives short shrift to this important task by calling it "janitorial work", but whether you call it data munging, data wrangling or anything else,...

Read more »

Search for CRAN, GitHub and BioConductor packages at Rdocumentation.org

August 15, 2014
By
Search for CRAN, GitHub and BioConductor packages at Rdocumentation.org

If you're looking for just the right package to solve your R problem, you could always browse through the list of available packages on CRAN. But with almost 6000 entries, that's not going to be the most efficient process. And even then, many very useful packages aren't found on CRAN: there are more than 800 packages hosted on BioConductor...

Read more »

Table comparing the statistical capabilities of software packages

August 13, 2014
By
Table comparing the statistical capabilities of software packages

A statistical consultant known only as "Stanford PhD" has put together a table comparing the statistical capabilities of the software packages R, Matlab, SAS, Stata and SPSS. For each of 57 methods (including techniques like "ridge regression", "survival analysis", "optimization") the author ranks the capabilities of each software package as "Yes" (fully supported), "Limited" or "Experimental". Here are the...

Read more »

John Chambers: Interfaces, Efficiency and Big Data

August 11, 2014
By

Joe wrote about this already, but now the recording of John Chambers' keynote presentation from the useR! 2014 conference, Interfaces, Efficiency and Big Data, is now available for viewing thanks to Data Science LA. In the video, John dives into the history of the S language for which he won the ACM Software Systems award and which ultimately led...

Read more »

The Open Source R Programming Language is Becoming Pervasive

August 8, 2014
By

So says CIO.com, in a recent article 11 Market Trends in Advanced Analytics. R, an open source programming language for computational statistics, visualization and data is becoming a ubiquitous tool in advanced analytics offerings. Kirsch says nearly every top vendor of advanced analytics has integrated R into their offering and so that they can now import R models. This...

Read more »

In case you missed it: July 2014 Roundup

August 6, 2014
By

In case you missed them, here are some articles from June of particular interest to R users: The deadline for our contest to visualize the location of R user groups has been extended to August 16. Previews of R-related sessions at this year's JSM conference in Boston. Coding errors in R graphics scripts serendipitously create some interesting art. Another...

Read more »

Statisticians get the PR treatment

August 4, 2014
By

I'm here at the JSM conference in Boston, the latest annual gathering of 6000+ statisticians from North America and around the world. (Revolution Analytics is a proud sponsor of the conference.) One of the great things to see is that the American Statistical Association, the organizer of the conference and the professional body for statisticians, is putting some effort...

Read more »