Creating an R package, using developer/productivity tools

Couple of R programming (mainly infrastructure/workflow) related topics discussed at the Los Angeles R users group in a tutorial/demo-like form (targeted mainly to beginners) by Szilard Pafka and Jeroen Ooms: how easy it is to create a simple package for your R code, the benefits of having your code in packages, documentation, automated testing and version control, the basic infrastructure R and some other software tools provide for this (CMD check, roxygen, devtools, RUnit/testthat, git/svn) and how an IDE can improve productivity.

Video (integrated slides+demo+audio) is embedded next, but we encourage you to watch it in higher resolution (especially the demo part).

More info on the meeting and about the Los Angeles R users group here. More meetings to come on further R programming and related topics such as programming with classes (S3/S4), debugging, more productivity tools etc.

Experiences with using SAS and R in insurance and banking

In July 2011, Hong Ooi presented an engaging talk to Melbourne R Users Group. Both David Smith from Revolutions and Eugene Dubossarsky behind the Analyst First movement have discussed the presentation.

The video of the talk is now available for viewing through Drew Conway’s Vcasmo site.

The video can be accessed here.

The talk outline was as follows:

Hong Ooi talks about some of the more interesting projects that he has used R for in the last year. These include fitting models for mortgage loss given default, a Monte Carlo application for stress-testing loan portfolios (in combination with Excel and SAS), a framework for calculating through-the-[economic-]cycle probabilities of default, and possibly others. He also talks about some of the challenges involved in getting R accepted in a conservative financial institution workplace.

Hong Ooi graduated from Macquarie University with a BEc in actuarial studies, then worked with NRMA Insurance/IAG in Sydney for many years. Completed a Masters in Applied Stats from Macquarie in 1997, and a PhD in statistics from ANU from 2000-2004. Displayed impeccable timing by switching jobs to St George Bank on the eve of the global financial crisis. Now with ANZ Bank in Melbourne, and adapting to the rigours of life south of the border.

Getting into shape for the sport of data science: Screencast of talk by Jeremy Howard at Melbourne R Users

Jeremy Howard gave a talk at the Melbourne R User Group on 16th March 2011.

Jeremy provided tips on how to successfully compete in data mining competitions. He showed how he combines R with other tools to build predictive models. He gave a walkthrough of the data, visualizations, and code, for a number of his competition entries. The talk also included an introduction to the theory behind Jeremy’s favourite modelling algorithm: random forests.

Screencast of the talk:

Additional Links:

Video Introduction to R Packages by Rory Winston – Melbourne R Users February 2011

On February 17th 2011
Rory Winston gave a talk on creating R packages at the Melbourne R Users Group
(see Meetup page).

The Video can be viewed directly here.

Many thanks to

  • Pedro Olaya for filming the talk;
  • Drew Conway for posting and hosting the
    video; and
  • Deloitte for providing an excellent venue.

Software tools for data analysis – an overview

Discussions on various software tools (C, C++, Perl, Python, Unix shell, R, Matlab, SAS, SPSS, Excel, databases, Hadoop etc.) used in data analysis. Szilard Pafka (founder and co-organizer of the Los Angeles R users group) presents an overview and discusses the survey results regarding their usage by the members of the Los Angeles R users group. A plan for possible talks in the future (at LA RUG meetings) with more details on some of these tools and how they can be used with R is also discussed.

(Also have a look at the discussions on the same topic here: Comparison of data analysis packages: R, Matlab, SciPy, Excel, SAS, SPSS, Stata.)

RHIPE: An Interface Between Hadoop and R for Large and Complex Data Analysis

RHIPE: An Interface Between Hadoop and R
Presented by Saptarshi Guha

Video Link

About the Video:

I filmed the event using LectureMaker’s live event recording technique. One special feature I add to my R video recordings is the addition of my own R source code highlighting and math symbol publishing plugins for WordPress blogs. The highlighting is unique in that R and RHIPE constructs are hot-linked back to online documentation so the user can learn more about the source code.

What others are saying about this video:

“I just watched the Saptarshi Guha video. It looks great!! Thank you! The picture is incredibly crisp, and the timeline tab is a nice touch for reviewing the film. Thank you!” — Matt Bascom