Blog Archives

Reflections on Data Science Summit 2011

May 13, 2011
By

The Data Science Summit held in Las Vegas this week was outstanding - kudos and thanks to EMC/Greenplum for organizing the event. The energy of 150+ data scientists coupled with a well-curated agenda of talks created a real sense of being at the cusp of a real revolution in the applications of data analysis. Here are just a few...

Read more »

The R-Files: Martin Morgan

May 12, 2011
By
The R-Files: Martin Morgan

"The R-Files" is an occasional series from Revolution Analytics, where we profile prominent members of the R Community. Name: Martin Morgan Profession: Senior Staff Scientist at Fred Hutchinson Cancer Research Center Nationality: Canadian Years Using R: 7 Known for: Director of the Bioconductor project Martin Morgan is a Senior Staff Scientist at the Fred Hutchinson Cancer Research Center (FHCRC)...

Read more »

An essential vocabulary for the R language

May 11, 2011
By

The Oxford English Dictionary includes more than 600,000 words, yet most of us get by in our day-to-day lives with a vocabulary of just a few thousand. In a similar vein, the R language includes thousands of functions: when you start up R 2.13, you have 2832 functions at your disposal: > length(apropos(".", mode="function")) 2382 This includes only...

Read more »

Data Science Toolset discussion at Data Scientist Summit

May 10, 2011
By

Heads-up to anyone attending the sold-out Data Science Summit in Las Vegas this week: I'll be there tomorrow and Thursday for the conference and to discuss R on the panel discussion "Data Science Toolset - Recipes That Win" (more details about the panel discussion below.) I'm looking forward to meeting with the other R users there -- tweet or...

Read more »

Registration open for Rmetrics Workshop on Computational Finance

May 9, 2011
By

The Rmetrics Association is once again holding its annual Workshop and Summer School on Computational Finance and Financial Engineering at Meielisalp (on Lake Thune in Switzerland) from June 26-30. Now in its fifth year, the workshop consists of Summer School-like tutorial sessions and a user/developer meeting: Both focus on topics from "Computational Finance and Financial Engineering" and on the...

Read more »

Propagation of the news of OBL’s death via Twitter

May 6, 2011
By
Propagation of the news of OBL’s death via Twitter

SocialFlow's blog has a great case study today on how news from a single tweet -- in this case, speculation made an hour before the President's announcement that Osama bin Laden had been killed -- can propagate through social networks. At 10:24 p.m. EST on Sunday May 1, Keith Urbahn tweeted: "So I'm told by a reputable person they...

Read more »

How to access databases from R

May 5, 2011
By

From his presentation at the Greater Boston useR Group, R user Jeffrey Breen has shared some useful slides detailing how to bring data from relational databases like MySQL and Oracle. In fact, data from just about any relational database is accessible from R by sending an SQL query to the standard ODBC or JDBC interfaces. R packages also offer...

Read more »

Mapping airline flight networks with R

May 5, 2011
By
Mapping airline flight networks with R

Inspired by the Facebook Social Network chart, FlowingData's Nathan Yau also turns to R to create a beautiful chart of the network of all flight connections between major airlines in the US: Like the Facebook chart, the chart reflects the intensity of the connections (here, the number of flights) between pairs of cities. Nathan explains: Brighter lines represent more...

Read more »

Using R for Map-Reduce applications in Hadoop

May 4, 2011
By

Data Scientist Antonio Piccolboni recently published this comparison of the various language and interfaces available for programming Big Data analysis tasks in the map-reduce framework. The interfaces he reviewed included: Java Hadoop (mature and efficient, but verbose and difficult to program) Cascading (brings an SQL-like flavor to Java programming with Hadoop) Pipes/C++ (a C++ interface to programming on Hadoop)...

Read more »

How to learn R

May 3, 2011
By

Over at R community site inside-R.org, Revolution's Joseph Rickert has published a How-To guide with tips for new users on How to Learn R, with links to resources for R books, blogs and courses. Check it out at the link below. Inside-R: How to Learn R

Read more »