#rstats Make arrays into vectors before running table

July 29, 2015
By
#rstats Make arrays into vectors before running table

Setup of Problem While working with nifti objects from the oro.nifti, I tried to table the values of the image. The table took a long time to compute. I thought this was due to the added information about a medical image, but I found that the same sluggishness happened when coercing the nifti object to

Read more »

R Oddities: Strings in DataFrames

July 29, 2015
By
R Oddities:  Strings in DataFrames

Have you ever read a file into R and then encountered strange problems filtering and sorting because the strings were converted to factors?  For...

Read more »

But I Don’t Want to Be a Statistician!

July 29, 2015
By
But I Don’t Want to Be a Statistician!

"For a long time I have thought I was a statistician.... But as I have watched mathematical statistics evolve, I have had cause to wonder and to doubt.... All...

Read more »

Mapping the past and the future with Leaflet

July 29, 2015
By
Mapping the past and the future with Leaflet

I have been working on mapping things for a while and I must say that I really like the Leaflet package from Rstudio. It makes it very easy and...

Read more »

Player Value Gap Assessment

July 29, 2015
By
Player Value Gap Assessment

Looking at fantasy football projections we have a group of experts providing their views on how a player will do during the football season. We have collected projections from...

Read more »

Predict Social Network Influence with R and H2O Ensemble Learning

July 29, 2015
By
Predict Social Network Influence with R and H2O Ensemble Learning

What is H2O? H2O is an awesome machine learning framework. It is really great for data scientists and business analysts “who need scalable and fast machine learning”. H2O is...

Read more »

Hockey Elbow and Other Response Time Injuries

July 29, 2015
By
Hockey Elbow and Other Response Time Injuries

You've heard of tennis elbow. Well, there's a non-sports, performance injury that I like to call hockey elbow. An example of such an "injury" is shown in Figure...

Read more »

The most popular programming languages on StackOverflow

July 29, 2015
By
The most popular programming languages on StackOverflow

by Andrie de Vries Last week, IEEE Spectrum said R rised to #6 in Top Programming languages. They use a weighted methodology of 12 factors to compute their score....

Read more »

Introducing the nominatim geocoding package

July 29, 2015
By
Introducing the nominatim geocoding package

In the never-ending battle for truth, justice and publishing more R packages than Oliver, I whipped out an R package for the OpenStreetMap Nominatim API. It actually hits the...

Read more »

Computing AIC on a Validation Sample

July 29, 2015
By
Computing AIC on a Validation Sample

This afternoon, we’ve seen in the training on data science that it was possible to use AIC criteria for model selection. > library(splines) > AIC(glm(dist ~ speed, data=train_cars, family=poisson(link="log")))...

Read more »

Mongolite 0.5: authentication and iterators

July 28, 2015
By
Mongolite 0.5: authentication and iterators

A new version of the mongolite package has appeared on CRAN. Mongolite builds on jsonlite to provide a simple, high-performance...

Read more »

I loved this %>% crosstable

July 28, 2015
By

This is a public tank you for @heatherturner's contribution. Now the SciencesPo's crosstable can work in a chain (%>%) fashion; useful for using along with other packages that have...

Read more »

Pluto: To Catch an Icy King

July 28, 2015
By
Pluto: To Catch an Icy King

Sly as a fox, it is. Mysterious and diminutive, it has eluded us for decades. Despite what we've learned about Pluto, constant debate continues to rage over its classification....

Read more »

Goals for the New R Consortium

July 28, 2015
By
Goals for the New R Consortium

by Bob Muenchen The recently-created R Consortium consists of companies that are deeply involved in R such as RStudio, Microsoft/Revolution Analytics, Tibco, and others. The Consortium’s goals include advancing...

Read more »

R tutorial on the Apply family of functions

July 28, 2015
By
R tutorial on the Apply family of functions

Introduction In our previous tutorial Loops in R: Usage and Alternatives , we discussed one of the most important constructs in programming: the loop.  Eventually we deprecated the usage of loops in...

Read more »

Modelling Occurence of Events, with some Exposure

July 28, 2015
By
Modelling Occurence of Events, with some Exposure

This afternoon, an interesting point was raised, and I wanted to get back on it (since I did publish a post on that same topic a long time ago)....

Read more »

Efficient Accumulation in R

July 28, 2015
By
Efficient Accumulation in R

by John MountData Scientist, Win-Vector LLC R has a number of very good packages for manipulating and aggregating data (plyr, sqldf, RevoScaleR, data.table, and more), but when it comes to...

Read more »

The complete catalog of argument variations of select() in dplyr

July 28, 2015
By

When I read the dplyr vignette, I found a convenient way to select sequential columns such as select(data, year:day). Because I had inputted only column names to select()...

Read more »

Visualising Claims Frequency

July 28, 2015
By
Visualising Claims Frequency

A few years ago, I did publish a post to visualize and empirical claims frequency in a portfolio. I wanted to update the code. Here is a code to...

Read more »

Notes from the 3rd R in Insurance Conference

July 28, 2015
By
Notes from the 3rd R in Insurance Conference

Photo: Arthur CharpentierThe R in Insurance conference in Amsterdam was a sold out success! Congratulations to the organising committee at the University of Amsterdam, and many thanks to our...

Read more »

upsetplot in ChIPseeker

July 28, 2015
By
upsetplot in ChIPseeker

ChIPseeker is an R package for ChIP peak annotation, comparison and visualization. We have implemented several visualization methods, including vennpie that was designed for viewing annotation overlap as shown below:

Read more »

Catalan numbers for triangulations gambler’s ruin binary…

July 27, 2015
By
Catalan numbers for
triangulations
gambler’s ruin
binary…

Catalan numbers for triangulations gambler’s ruin binary trees trees …from the Flajolet/Sedgewick coursera analysis of...

Read more »

Graph-based circle packing

July 27, 2015
By
Graph-based circle packing

The previous two posts showed examples of a simple circle packing algorithm using the packcircles package (available from CRAN and GitHub). The algorithm involved iterative pair-repulsion to...

Read more »

Statistical Models of Judgment and Choice: Deciding What Matters Guided by Attention and Intention

July 27, 2015
By
Statistical Models of Judgment and Choice: Deciding What Matters Guided by Attention and Intention

Preference begins with attention, a form of intention-guided perception. You enter the store thirsty on a hot summer day, and all you can see is the beverage cooler at...

Read more »

Egyptian fractions [Le Monde puzzle #922]

July 27, 2015
By
Egyptian fractions [Le Monde puzzle #922]

For its summer edition, Le Monde mathematical puzzle switched to a lighter version with immediate solution. This #922 considers Egyptian fractions which only have distinct denominators (meaning the numerator...

Read more »

Hadley Wickham on why he created all those R packages

July 27, 2015
By
Hadley Wickham on why he created all those R packages

Priceonomics published on Friday an in-depth profile of Hadley Wickham, author of many of the most popular R packages including ggplot2, dplyr and devtools. In the article, he reveals...

Read more »

Announcing: Mastering RStudio

July 27, 2015
By

Learn the holistic use of RStudio to communicate your R code effectively and persuasively. Max (@nierhoff) and I are both absolute R enthusiasts. We both strongly believe in the power...

Read more »

Efficient accumulation in R

July 27, 2015
By
Efficient accumulation in R

R has a number of very good packages for manipulating and aggregating data (plyr, sqldf, ScaleR, data.table, and more), but when it comes to accumulating results the beginning R...

Read more »

RBerkeley Was Just Pining For The Fjords

July 27, 2015
By

If you made it to Chapter 8 of Data-Driven Security after ~October 2014 and tried to run the BerkeleyDB R example, you were greeted with: Warning in install.packages : ...

Read more »