Monthly Archives: September 2014

Top open R jobs (for September 25th 2014)

September 25, 2014
By
Top open R jobs (for September 25th 2014)

This is the bimonthly R Jobs post (for 2014-09-25), based on the R-bloggers’ sister website: R-users.com. If you are an employer who is looking to hire people from the R community, please visit this link to post a new R job (it’s free, and registration takes less than 10 seconds). After almost 8 months, this is the first time that two weeks had pass without a single new job to share. As compensation, I...

Read more »

Estimating Generalization Error with the PRESS statistic

September 25, 2014
By
Estimating Generalization Error with the PRESS statistic

As we’ve mentioned on previous occasions, one of the defining characteristics of data science is the emphasis on the availability of “large” data sets, which we define as “enough data that statistical efficiency is not a concern” (note that a “large” data set need not be “big data,” however you choose to define it). In Related posts:

Read more »

DescTools: a new R "misc package"

September 25, 2014
By
DescTools: a new R "misc package"

by Joseph Rickert One of the most difficult things about R, a problem that is particularly vexing to beginners, is finding things. This is an unintended consequence of R's spectacular, but mostly uncoordinated, organic growth. The R core team does a superb job of maintaining the stability and growth of the R language itself, but the innovation engine for...

Read more »

Aggregate portfolio contributions through time

September 25, 2014
By
Aggregate portfolio contributions through time

The last CRAN release didn’t have much new functionality, but Ross Bennett and I have completely re-written the Return.portfolio function to fix some issues and make the calculations more transparent.  The function calculates the returns of a portfolio given asset returns, weights, and rebalancing periods – which, although not rocket science, requires some diligence about it. Users of this

Read more »

How Many Paths are Possible in an 18 Hole Round of Match Play Golf?

September 25, 2014
By
How Many Paths are Possible in an 18 Hole Round of Match Play Golf?

In honor of the Ryder Cup, here's a fun puzzle for the mathematically inclined golfer to consider: how many different paths are possible in an 18 hole round of match play golf? If you'd rather not wade through the math then you can skip ahead to the "practical exploration" section of this post to see some actual match play...

Read more »

Effective Applications of the R Language Conference 2014

September 25, 2014
By
Effective Applications of the R Language Conference 2014

By Chris Campbell - Senior Consultant, UK. What struck me first was how few sandals I could see, none of which were paired with socks. The energy in the room was electric as introductions were made and business cards were exchanged. The inaugural Effective Applications of the R Language (EARL) had started strongly with two sold-out workshops. As Matt Aldridge...

Read more »

RMOA package for running streaming classifcation & regression models now at CRAN

RMOA package for running streaming classifcation & regression models now at CRAN

Last week, we released the RMOA package at CRAN (http://cran.r-project.org/web/packages/RMOA). It is an R package to allow building streaming classification and regression models on top of MOA. MOA is the acronym of 'Massive Online Analysis' and it is the most popular open source framework for data stream mining which is being developed at the University of Waikato: http://moa.cms.waikato.ac.nz....

Read more »

Joint Models for Longitudinal and Survival Data

September 25, 2014
By
Joint Models for Longitudinal and Survival Data

What are joint models for longitudinal and survival data? In this post we will introduce in layman's terms the framework of joint models for longitudinal and time-to-event data. These models are applied in settings where the sample units are followed-up in time, for example, we may be interest in patients suffering...

Read more »

“R for Developers” course – Oct 16-17 @ Milano, Italy

September 25, 2014
By

R for Developers Milano - October 16 and 17, 2014 Course description This two-day course provides an overview of several advanced R topics, such as: R environments, object oriented programming, functional programming and debugging. Who should attend this course Anyone … Continue reading →

Read more »

Become an effective data hacker with the R-Hadoop stack

September 24, 2014
By

In discussion with several data scientists, Will Stanton (a data scientist with Return Path) learned that a common concern is: what software should I be using? There are many options out there, but what is the best platform to be an effective "data hacker"? Will recommends using a technology stack with R and Hadoop, which allows data scientists "to...

Read more »