August 2015

Using R To Get Data *Out Of* Word Docs

August 23, 2015 | 0 Comments

This was asked on twitter recently: Is it possible to import data entered in MS Word into R – I have multiple tables in 235 files that need importing #rstats— Richard Telford (@richardjtelford) August 23, 2015 The answer is a very cautious “yes”. Much depends on how well-formed and un-formatted the table is. Take ...
[Read more...]

Predicting Titanic deaths on Kaggle IV: random forest revisited

August 23, 2015 | 0 Comments

On July 19th I used randomForest to predict the deaths on Titanic in the Kaggle competition. Subsequently I found that both bagging and boosting gave better predictions than randomForest. This I found somewhat unsatisfactory, hence I am now revisiting randomForest. To my disappointment this does not result in predictions as ... [Read more...]

Examining Email Addresses in R

August 22, 2015 | 0 Comments

I don’t normally work with personal identifiable information such as emails. However, the recent data dump from Ashley Madison got me thinking about how I’d examine a data set composed of email addresses. What are the characteristics of an email that I’d look to extract? How would ... [Read more...]

Pricing Game

August 22, 2015 | 0 Comments

In November, with Romuald Elie and Jérémie Jakubowicz, we will organize a session during the 100% Actuaires day, in Paris, based on a “pricing game“. We provide two datasets, (motor insurance, third party claims), with 2  years of experience, and 100,000 policies. Each ‘team’ has to submit premium proposal for 36,000 potential ... [Read more...]

Where Does the S&P 500 Stand?

August 22, 2015 | 0 Comments

Last week was brutal for pretty much all markets. Surprisingly, it was bad even for the US dollar. The sharp and straight downward move was reminiscent of the descent of 2011. It’s time to review where does the major index stands from technical point of view. Let’s start with ... [Read more...]

Why I Don’t Like Jupyter (FKA IPython Notebook)

August 22, 2015 | 0 Comments

Don’t get me wrong, it’s certainly a great tool for presenting your code or even reporting, but everytime I use it for explorative, interactive data science, I keep switching to other tools quite quickly and wonder why I am still even trying t... [Read more...]

Why I Don’t Like Jupyter (FKA IPython Notebook)

August 22, 2015 | 0 Comments

Don’t get me wrong, it’s certainly a great tool for presenting your code or even reporting, but everytime I use it for explorative, interactive data science, I keep switching to other tools quite quickly and wonder why I am still even trying to use it. I just mostly ... [Read more...]

Analysing longitudinal data: Multilevel growth models (I)

August 22, 2015 | 0 Comments

Last time we have discussed the two formats of longitudinal data and visualised the individual growth trajectories using an imaginary randomised controlled trial data-set. But could we estimate the overall trajectory of the outcomes along time and see if it’s increasing, decreasing, or stable? Yes, of course, we could ... [Read more...]

Partools 1.1.4

August 21, 2015 | 0 Comments

Partools 1.1.4 is now on GitHub. The main change this time is enhancement of the debugging facilities (which work not only for partools but also the cluster-based portion of R’s parallel package in general). As some of you know, I place huge importance on debugging, so much so that I ...
[Read more...]

Ofuro, start H2O on Hadoop from R

August 21, 2015 | 0 Comments

tl;dr I made a simple functionality to start H2O on hadoop from R. You can easily start H2O on hadoop, run your analytics and close all the processes without occuppying Hadoop nodes and memory all the time. I like to take a bath. Fill a bath a... [Read more...]

Ofuro, start H2O on Hadoop from R

August 21, 2015 | 0 Comments

tl;dr I made a simple functionality to start H2O on hadoop from R. You can easily start H2O on hadoop, run your analytics and close all the processes without occuppying Hadoop nodes and memory all the time. I like to take a bath. Fill a bath and ... [Read more...]

Comparing World Ocean Atlases 2013 and 2013v2

August 21, 2015 | 0 Comments

Introduction The ocedata package [1] provides data that may be of use to oceanographers, either working with their own R code or working with the oce package [2]. One such dataset, called levitus, holds sea surface temperature and salinity (SST and SSS), based on the 2013 version of the World Ocean Atlas. An ... [Read more...]

Function Argument Lists and missing()

August 21, 2015 | 0 Comments

This entry is part 17 of 17 in the series Using RSometimes it is useful to write a wrapper function for an existing function. In this short example we demonstrate how to grab the list of arguments passed to a …   read more ... [Read more...]

A better interactive neuroimage plotter in R

August 21, 2015 | 0 Comments

In a previous post, I described how you can interactively explore a 3D nifti object in R. I used the manipulate package, but the overall results were sluggish and not really usable. I was introduced to a a good neuroimaging viewer called Mango, by a friend or two and use ...
[Read more...]

Free edX course for R beginners

August 21, 2015 | 0 Comments

If you've thought about learning the R language but didn't know how to start, there's a new, free course on edX that starts you from the R basics and lets you learn R by trying R as you go. Presented by DataCamp and Microsoft, the course starts from the very ... [Read more...]

RTutor: How Soap Operas Reduced Fertility in Brazil

August 21, 2015 | 0 Comments

What is the real world impact of tv series? Did Brazilian women get fewer children because they watched soap operas that portray happy, rich families that have few children? Clara Ulmer has written a very nice RTutor problem set that allows you interactively explore this question in R. It is ... [Read more...]
1 2 3 4 5 6 14

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)