## Modelling Occurence of Events, with some Exposure

July 28, 2015
By
$Y_i^\star$

This afternoon, an interesting point was raised, and I wanted to get back on it (since I did publish a post on that same topic a long time ago). How can we adapt a logistic regression when all the observations do not have the same exposure. Here the model is the following: , the occurence of an event  on the period ...

## Efficient Accumulation in R

July 28, 2015
By

by John MountData Scientist, Win-Vector LLC R has a number of very good packages for manipulating and aggregating data (plyr, sqldf, RevoScaleR, data.table, and more), but when it comes to...

## The complete catalog of argument variations of select() in dplyr

July 28, 2015
By

When I read the dplyr vignette, I found a convenient way to select sequential columns such as select(data, year:day). Because I had inputted only column names to select()...

## Notes from the 3rd R in Insurance Conference

July 28, 2015
By

Photo: Arthur CharpentierThe R in Insurance conference in Amsterdam was a sold out success! Congratulations to the organising committee at the University of Amsterdam, and many thanks to our...

## upsetplot in ChIPseeker

July 28, 2015
By

ChIPseeker is an R package for ChIP peak annotation, comparison and visualization. We have implemented several visualization methods, including vennpie that was designed for viewing annotation overlap as shown below:

## Catalan numbers for triangulations gambler’s ruin binary…

July 27, 2015
By

Catalan numbers for triangulations gambler’s ruin binary trees trees …from the Flajolet/Sedgewick coursera analysis of...

## Graph-based circle packing

July 27, 2015
By

The previous two posts showed examples of a simple circle packing algorithm using the packcircles package (available from CRAN and GitHub). The algorithm involved iterative pair-repulsion to...

## Statistical Models of Judgment and Choice: Deciding What Matters Guided by Attention and Intention

July 27, 2015
By

Preference begins with attention, a form of intention-guided perception. You enter the store thirsty on a hot summer day, and all you can see is the beverage cooler at...

## Egyptian fractions [Le Monde puzzle #922]

July 27, 2015
By

For its summer edition, Le Monde mathematical puzzle switched to a lighter version with immediate solution. This #922 considers Egyptian fractions which only have distinct denominators (meaning the numerator...

## Hadley Wickham on why he created all those R packages

July 27, 2015
By

Priceonomics published on Friday an in-depth profile of Hadley Wickham, author of many of the most popular R packages including ggplot2, dplyr and devtools. In the article, he reveals...

## Announcing: Mastering RStudio

July 27, 2015
By

Learn the holistic use of RStudio to communicate your R code effectively and persuasively. Max (@nierhoff) and I are both absolute R enthusiasts. We both strongly believe in the power...

## Efficient accumulation in R

July 27, 2015
By

R has a number of very good packages for manipulating and aggregating data (plyr, sqldf, ScaleR, data.table, and more), but when it comes to accumulating results the beginning R...

## RBerkeley Was Just Pining For The Fjords

July 27, 2015
By

If you made it to Chapter 8 of Data-Driven Security after ~October 2014 and tried to run the BerkeleyDB R example, you were greeted with: Warning in install.packages : ...

July 26, 2015
By

Hadley is a popular figure, and rightly so as he successfully introduced many newcomers to the wonders offered by R. His approach strikes some of us...

## Tiny Data, Approximate Bayesian Computation and the Socks of Karl Broman: The Movie

July 26, 2015
By

This is a screencast of my UseR! 2015 presentation: Tiny Data, Approximate Bayesian Computation and the Socks of Karl Broman. Based on the original blog post it...

## Making Static/Interactive Voronoi Map Layers In ggplot/leaflet

July 26, 2015
By

Despite having shown various ways to overcome D3 cartographic envy, there are always more examples that can cause the green monster to rear it’s ugly head. Take the Voronoi...

## RcppZiggurat 0.1.3: Faster Random Normal Draws

July 26, 2015
By

After a slight hiatus since the last release in early 2014, we are delighted to announce a new release of RcppZiggurat which is now on the CRAN network...

## Installing and Starting SparkR Locally on Windows OS and RStudio

July 26, 2015
By

Introduction With the recent release of Apache Spark 1.4.1 on July 15th, 2015, I wanted to write a step-by-step guide to help new users get up and running with...

## Where do letters occur in words

July 26, 2015
By

A while back I encountered an interesting graphic showing where letters were located in english words (http://www.prooffreader.com/2014/05/graphing-distribution-of-english.html). The other day I decided to do a similar one for letters...

## Predicting Titanic deaths on Kaggle II: gbm

July 26, 2015
By

Following my previous post I have decided to try and use a different method: generalized boosted regression models (gbm). I have read the background in Elements of Statistical Learning and arthur...

## Rcpp 0.12.0: Now with more Big Data!

July 25, 2015
By

A new release 0.12.0 of Rcpp arrived on the CRAN network for GNU R this morning, and I also pushed a

July 25, 2015
By

As I was putting together the coord_proj ggplot2 extension I had posted a (https://gist.github.com/hrbrmstr/363e33f74e2972c93ca7) that I shared on Twitter. Said gist received a comment (several, in fact) and a...

## IEEE Spectrum Puts R in 6th Place

July 25, 2015
By

R has moved up three positions to 6th place on IEEE Spectrum ranking. How long will it be before Julia is on the list? The post IEEE Spectrum Puts R...

## Logistic Growth, S Curves, Bifurcations, and Lyapunov Exponents in R

July 24, 2015
By

If you’ve ever wondered how logistic population growth (the Verhulst model), S curves, the logistic map, bifurcation diagrams, sensitive dependence on initial conditions, “orbits”, deterministic chaos, and Lyapunov exponents...

## New quantmod and TTR on CRAN

July 24, 2015
By

I just sent quantmod_0.4-5 to CRAN, and TTR_0.23-0 has been there for a couple weeks. I'd like to thank Ivan Popivanov for many useful reports and patches to TTR....

## A Path Towards Easier Map Projection Machinations with ggplot2

July 24, 2015
By

The \$DAYJOB doesn’t afford much opportunity to work with cartographic datasets, but I really like maps and tinker with shapefiles and geo-data when I can, plus answer a ton...

## {Long Vs. Wide} Data Frames

July 24, 2015
By

Introduction This is an excellent resource to understand 2 types of data frame format: Long and Wide. Just take a look at figure 1 inside the article 1) Long format: ggplot2 needs...

## R #6 in IEEE 2015 Top Programming Languages, Rising 3 Places

July 24, 2015
By

IEEE Spectrum has published its 2015 list of Top Programming Languages, and R ranks in 6th place, jumping 3 places from its 2014 ranking. Here's what the IEEE has...

$Why I use Panel/Multilevel Methods$