## The most popular programming languages on StackOverflow

July 29, 2015
By

by Andrie de Vries Last week, IEEE Spectrum said R rised to #6 in Top Programming languages. They use a weighted methodology of 12 factors to compute their score. Among these factors is the activity on social programming websites, including StackOverflow and Github. I recently used data.stackexchange.com to query the total number of questions on StackOverflow using the R...

Read more »

## Introducing the nominatim geocoding package

July 29, 2015
By

In the never-ending battle for truth, justice and publishing more R packages than Oliver, I whipped out an R package for the OpenStreetMap Nominatim API. It actually hits the...

Read more »

## Computing AIC on a Validation Sample

July 29, 2015
By

This afternoon, we’ve seen in the training on data science that it was possible to use AIC criteria for model selection. > library(splines) > AIC(glm(dist ~ speed, data=train_cars, family=poisson(link="log")))...

Read more »

## Mongolite 0.5: authentication and iterators

July 28, 2015
By

A new version of the mongolite package has appeared on CRAN. Mongolite builds on jsonlite to provide a simple, high-performance...

Read more »

## I loved this %>% crosstable

July 28, 2015
By

This is a public tank you for @heatherturner's contribution. Now the SciencesPo's crosstable can work in a chain (%>%) fashion; useful for using along with other packages that have...

Read more »

## Pluto: To Catch an Icy King

July 28, 2015
By

Sly as a fox, it is. Mysterious and diminutive, it has eluded us for decades. Despite what we've learned about Pluto, constant debate continues to rage over its classification....

Read more »

## Goals for the New R Consortium

July 28, 2015
By

by Bob Muenchen The recently-created R Consortium consists of companies that are deeply involved in R such as RStudio, Microsoft/Revolution Analytics, Tibco, and others. The Consortium’s goals include advancing...

Read more »

## R tutorial on the Apply family of functions

July 28, 2015
By

Introduction In our previous tutorial Loops in R: Usage and Alternatives , we discussed one of the most important constructs in programming: the loop.  Eventually we deprecated the usage of loops in...

Read more »

## Modelling Occurence of Events, with some Exposure

July 28, 2015
By
$Y_i^\star$

This afternoon, an interesting point was raised, and I wanted to get back on it (since I did publish a post on that same topic a long time ago)....

Read more »

## Efficient Accumulation in R

July 28, 2015
By

by John MountData Scientist, Win-Vector LLC R has a number of very good packages for manipulating and aggregating data (plyr, sqldf, RevoScaleR, data.table, and more), but when it comes to...

Read more »

## The complete catalog of argument variations of select() in dplyr

July 28, 2015
By

When I read the dplyr vignette, I found a convenient way to select sequential columns such as select(data, year:day). Because I had inputted only column names to select()...

Read more »

## Notes from the 3rd R in Insurance Conference

July 28, 2015
By

Photo: Arthur CharpentierThe R in Insurance conference in Amsterdam was a sold out success! Congratulations to the organising committee at the University of Amsterdam, and many thanks to our...

Read more »

## upsetplot in ChIPseeker

July 28, 2015
By

ChIPseeker is an R package for ChIP peak annotation, comparison and visualization. We have implemented several visualization methods, including vennpie that was designed for viewing annotation overlap as shown below:

Read more »

## Catalan numbers for triangulations gambler’s ruin binary…

July 27, 2015
By

Catalan numbers for triangulations gambler’s ruin binary trees trees …from the Flajolet/Sedgewick coursera analysis of...

Read more »

## Graph-based circle packing

July 27, 2015
By

The previous two posts showed examples of a simple circle packing algorithm using the packcircles package (available from CRAN and GitHub). The algorithm involved iterative pair-repulsion to...

Read more »

## Statistical Models of Judgment and Choice: Deciding What Matters Guided by Attention and Intention

July 27, 2015
By

Preference begins with attention, a form of intention-guided perception. You enter the store thirsty on a hot summer day, and all you can see is the beverage cooler at...

Read more »

## Egyptian fractions [Le Monde puzzle #922]

July 27, 2015
By

For its summer edition, Le Monde mathematical puzzle switched to a lighter version with immediate solution. This #922 considers Egyptian fractions which only have distinct denominators (meaning the numerator...

Read more »

## Hadley Wickham on why he created all those R packages

July 27, 2015
By

Priceonomics published on Friday an in-depth profile of Hadley Wickham, author of many of the most popular R packages including ggplot2, dplyr and devtools. In the article, he reveals...

Read more »

## Announcing: Mastering RStudio

July 27, 2015
By

Learn the holistic use of RStudio to communicate your R code effectively and persuasively. Max (@nierhoff) and I are both absolute R enthusiasts. We both strongly believe in the power...

Read more »

## Efficient accumulation in R

July 27, 2015
By

R has a number of very good packages for manipulating and aggregating data (plyr, sqldf, ScaleR, data.table, and more), but when it comes to accumulating results the beginning R...

Read more »

## RBerkeley Was Just Pining For The Fjords

July 27, 2015
By

If you made it to Chapter 8 of Data-Driven Security after ~October 2014 and tried to run the BerkeleyDB R example, you were greeted with: Warning in install.packages : ...

Read more »

## Evading the “Hadley tax”: Faster Travis tests for R

July 26, 2015
By

Hadley is a popular figure, and rightly so as he successfully introduced many newcomers to the wonders offered by R. His approach strikes some of us...

Read more »

## Tiny Data, Approximate Bayesian Computation and the Socks of Karl Broman: The Movie

July 26, 2015
By

This is a screencast of my UseR! 2015 presentation: Tiny Data, Approximate Bayesian Computation and the Socks of Karl Broman. Based on the original blog post it...

Read more »

## Making Static/Interactive Voronoi Map Layers In ggplot/leaflet

July 26, 2015
By

Despite having shown various ways to overcome D3 cartographic envy, there are always more examples that can cause the green monster to rear it’s ugly head. Take the Voronoi...

Read more »

## RcppZiggurat 0.1.3: Faster Random Normal Draws

July 26, 2015
By

After a slight hiatus since the last release in early 2014, we are delighted to announce a new release of RcppZiggurat which is now on the CRAN network...

Read more »

## Installing and Starting SparkR Locally on Windows OS and RStudio

July 26, 2015
By

Introduction With the recent release of Apache Spark 1.4.1 on July 15th, 2015, I wanted to write a step-by-step guide to help new users get up and running with...

Read more »

## Where do letters occur in words

July 26, 2015
By

A while back I encountered an interesting graphic showing where letters were located in english words (http://www.prooffreader.com/2014/05/graphing-distribution-of-english.html). The other day I decided to do a similar one for letters...

Read more »

## Predicting Titanic deaths on Kaggle II: gbm

July 26, 2015
By

Following my previous post I have decided to try and use a different method: generalized boosted regression models (gbm). I have read the background in Elements of Statistical Learning and arthur...

Read more »

## Rcpp 0.12.0: Now with more Big Data!

July 25, 2015
By

A new release 0.12.0 of Rcpp arrived on the CRAN network for GNU R this morning, and I also pushed a

Read more »