Blog Archives

One datavis for you, ten for me

September 14, 2014
By
One datavis for you, ten for me

Over the years of my graduate studies I made a lot of plots. I mean tonnes. To get an extremely conservative estimate I grep’ed for every instance of “plot(” in all of the many R scripts I wrote over the past five years. The actual number is very likely orders of magnitude larger as 1) many

Read more »

Plot with ggplot2, interact, collaborate, and share online

July 31, 2014
By
Plot with ggplot2, interact, collaborate, and share online

Editor’s note: This is a guest post by Marianne Corvellec from Plotly. This post is based on an interactive Notebook (click to view) she presented at the R User Conference on July 1st, 2014. Plotly is a platform for making, editing, and sharing graphs. If you are used to making plots with ggplot2, you can

Read more »

Online R and Plotly Graphs: Canadian and U.S. Maps, Old Faithful with Multiple Axes, & Overlaid Histograms

February 6, 2014
By
Online R and Plotly Graphs: Canadian and U.S. Maps, Old Faithful with Multiple Axes, & Overlaid Histograms

Guest post by Matt Sundquist of plot.ly. Plotly is a social graphing and analytics platform. Plotly’s R library lets you make and share publication-quality graphs online. Your work belongs to you, you control privacy and sharing, and public use is free (like GitHub). We are in beta, and would love your feedback, thoughts, and advice.

Read more »

What’s Warren Buffett’s $1 Billion Basketball Bet Worth?

January 22, 2014
By
What’s Warren Buffett’s $1 Billion Basketball Bet Worth?

A friend of mine just alerted me to a story on NPR describing a prize on offer from Warren Buffett and Quicken Loans. The prize is a billion dollars (1B USD) for correctly predicting all 63 games in the men’s Division I college basketball tournament this March. The facebook page announcing the contest puts the odds at 1:9,223,372,036,854,775,808,

Read more »

Simudidactic

November 21, 2013
By
swirl

(This article was first published on bayesianbiologist » Rstats, and kindly contributed to R-bloggers) auto·di·dact n. A self-taught person. From Greek autodidaktos, self-taught : auto-, auto- + didaktos, taught; + sim·u·late v. To create a representation or model of (a physical system or particular situation, for example). From Latin simulre, simult-, from similis, like; = (If you can get past the mixing of Latin and Greek roots) sim·u·di·dactic adj. To learn by...

Read more »

Montreal R User Group – Dr. Ramnath Vaidyanathan on his rCharts package

October 27, 2013
By
Montreal R User Group –  Dr. Ramnath Vaidyanathan on his rCharts package

Monday, October 28, 2013. 6:00pm at Notman House 51 Sherbrooke W., Montreal, QC. We are very pleased to welcome back Dr. Ramnath Vaidyanathan for a talk on interactive documents as it relates to his excellent rCharts package. Bringing a laptop to follow along is highly encouraged. I would recommend installing rCharts prior to the workshop. library(devtools) pkgs <- c(‘rCharts’, ‘slidify’, ‘slidifyLibraries’) install_github(pkgs, ‘ramnathv’, ref

Read more »

Follow up to Johnson et al Post

October 20, 2013
By
Follow up to Johnson et al Post

Last week I posted a comment on a paper by Neil Johnson and colleagues that I now regret. The comment amounted to a bit of statistical pedantry on my part regarding some of the wording in the paper. It was my wording in this post, and specifically the title, which would have benefited from some

Read more »

Calculating AUC the hard way

October 10, 2013
By
Calculating AUC the hard way

The Area Under the Receiver Operator Curve is a commonly used metric of model performance in machine learning and many other binary classification/prediction problems. The idea is to generate a threshold independent measure of how well a model is able to distinguish between two possible outcomes. Threshold independent here just means that for any model

Read more »

Time-series forecasting: Bike Accidents

August 20, 2013
By
Time-series forecasting: Bike Accidents

About a year ago I posted this video visualization of all the reported accidents involving bicycles in Montreal between 2006 and 2010. In the process I also calculated and plotted the accident rate using a monthly moving average. The results followed a pattern that was for the most part to be expected. The rate shoots up

Read more »

From Whale Calls to Dark Matter: Competitive Data Science with R and Python

July 12, 2013
By
From Whale Calls to Dark Matter: Competitive Data Science with R and Python

Back in June I gave a fun talk at Montreal Python on some of my dabbling in the competitive data science scene. The good people at Savior-fair Linux recorded the talk and have edited it all together into a pretty slick video. If you can spare twenty-minutes or so, have a look. If you want

Read more »