Getting into shape for the sport of data science: Screencast of talk by Jeremy Howard at Melbourne R Users

Jeremy Howard gave a talk at the Melbourne R User Group on 16th March 2011.

Jeremy provided tips on how to successfully compete in data mining competitions. He showed how he combines R with other tools to build predictive models. He gave a walkthrough of the data, visualizations, and code, for a number of his competition entries. The talk also included an introduction to the theory behind Jeremy’s favourite modelling algorithm: random forests.

Screencast of the talk:

Additional Links:

Posted in RUG Melbourne | Tagged , , , | Leave a comment

Video Introduction to R Packages by Rory Winston – Melbourne R Users February 2011

On February 17th 2011
Rory Winston gave a talk on creating R packages at the Melbourne R Users Group
(see Meetup page).

The Video can be viewed directly here.

Many thanks to

  • Pedro Olaya for filming the talk;
  • Drew Conway for posting and hosting the
    video; and
  • Deloitte for providing an excellent venue.
Posted in RUG Melbourne | Tagged , | Leave a comment

Discussions on the future of R

Inspired by the discussions on the same topic, Avram Aelony presented an overview of the issues and the Los Angeles R users group proceeded with further discussions.

Posted in RUG Los Angeles | Tagged , | 2 Comments

Software tools for data analysis – an overview

Discussions on various software tools (C, C++, Perl, Python, Unix shell, R, Matlab, SAS, SPSS, Excel, databases, Hadoop etc.) used in data analysis. Szilard Pafka (founder and co-organizer of the Los Angeles R users group) presents an overview and discusses the survey results regarding their usage by the members of the Los Angeles R users group. A plan for possible talks in the future (at LA RUG meetings) with more details on some of these tools and how they can be used with R is also discussed.

(Also have a look at the discussions on the same topic here: Comparison of data analysis packages: R, Matlab, SciPy, Excel, SAS, SPSS, Stata.)

Posted in RUG Los Angeles | Tagged , , , , , , , , , , , , , , , , , | Leave a comment

RHIPE: An Interface Between Hadoop and R for Large and Complex Data Analysis

RHIPE: An Interface Between Hadoop and R
Presented by Saptarshi Guha

Video Link

About the Video:

I filmed the event using LectureMaker’s live event recording technique. One special feature I add to my R video recordings is the addition of my own R source code highlighting and math symbol publishing plugins for WordPress blogs. The highlighting is unique in that R and RHIPE constructs are hot-linked back to online documentation so the user can learn more about the source code.

What others are saying about this video:

“I just watched the Saptarshi Guha video. It looks great!! Thank you! The picture is incredibly crisp, and the timeline tab is a nice touch for reviewing the film. Thank you!” — Matt Bascom

Posted in RUG San Francisco Bay Area | Tagged , , , , | 1 Comment

R Workflow: Melbourne R Users Dec 1st 2010

Melbourne R Users Group December 1st 2010 Meeting
(Meetup page).

1. “What my R code looks and feels like (Vanilla)” by Geoff Robinson

The other talk from the session was by Geoff Robinson
who discussed several useful strategies for working with R.

Video is embedded below (requires Flash and may not be viewable in
RSS Readers)

or go here .

2. “Reproducible Research and R Workflow” by Jeromy Anglim

Video is embedded below:

or go here

Many thanks to Pedro Olaya for filming and
Drew Conway for posting and hosting the videos.

Posted in RUG Melbourne | Tagged , , , , , | Leave a comment

Databases (SQL, noSQL); Interfacing R with Excel

Los Angeles R users group Dec. 14 2010 meeting (see meetup info here):

1. A SQL primer for R users – Neal Fultz

2. R Database Access – Shrikrishna Bhogaonker

3. NoSQL data stores – Scott Gonyea

4. Interfacing R with Excel – Eric Kostello

Updated slides:

Posted in RUG Los Angeles | Tagged , , , , , , , , , | Leave a comment

Analyst First – SURF

This presentation is aimed at all those working in commercial and government analytics, irrespective of what tools they use, and also to those students intending on such a career. R and other open source tools play a powerful, unique and disruptive role in business analytics, and are even now changing the landscape. The use of such tools leads to new business models and strategies that challenge the conventional approach.

The key element of the presentation is “Analyst First”, a new approach to analytics, where tools take a far less important place than the people who perform, manage, request or envision analytics, while analytics is a non-repetitive, exploratory and creative process where the outcome is not known at the start, and only a fraction of efforts are expected to result in success. This is in contrast with a common perception of analytics as IT and process.

Analyst First

Posted in RUG Sydney | Tagged , , , | 1 Comment

Text mining with R

Videos from the October meeting “Text Mining with R” of the Los Angeles R users group:

Rob Zinkov, “Text Mining with R”:

Ryan Rosario, “Accessing R from Python using RPy2″:

Posted in RUG Los Angeles | Tagged , , , , , , | 3 Comments

Introduction to statistical finance with R

During the first part of our meeting, Nicolas Christou gave an introduction of statistical finance in R, and presented a package he co-authored with previous PhD student David Diez (2010). Video of the talk is below:

During the second part, we accommodated shorter talks outlining R users’ experiences with statistical finance in R.

Kyle Matoba, a Finance PhD student from UCLA Anderson School of Management, presented on Algorithmic Trading with R.

Bryce Little, UCLA alum, presented on Constructing Minimum Variance Portfolios with R.

Posted in RUG Los Angeles | Tagged , , , , , , , , | Leave a comment