## 2013-3 Generating unique names in gridSVG

April 2, 2013
By

The gridSVG package exports grid images to the SVG image format for viewing on the web. This article describes the problems associated with retaining grid object names in SVG element id attributes. In addition, new features in gridSVG that allow … Continue reading →

## Revisiting MPs’ Expenses

April 2, 2013
By

I couldn’t but notice the chatter about Iain Duncan Smith claiming he’d have no problem “living on 53 pounds a dayweek“, which made me wonder not only how many meal catered events he attends each week (and how many of his scheduled meeting also have complementary tea and biscuits (a bellweather of the extent of

## a brief on naked statistics

April 2, 2013
By

Over the last Sunday breakfast I went through Naked Statistics: Stripping the Dread from the Data. The first two pages managed to put me in a prejudiced mood for the rest of the book. To wit: the author starts with some math bashing (like, no one ever bothers to tell us about the uses of

## R Beginners – Plotting Locations on to a World Map

April 2, 2013
By

This post is targeted at those who are just getting started plotting on maps using R.The relevant libraries are: maps, ggplot2, ggmap, and maptools. Make sure you install them.The Problem Let's take a fairly simple use case: We have a few points on the...

## What’s New in Release 6.2: Additional ScaleR Features

April 2, 2013
By

by Thomas Dinsmore Revolution R Enterprise Release 6.2 is in track for General Availability on April 22. In previous posts, I've commented on support for open source R 2.15.3 and Stepwise Regression. Today I'll wrap this series with a summary of some of the other new features supported in this release. Parallel Random Number Generation For analysts seeking to...

## Estimating continuous piecewise linear regression

April 2, 2013
By

When talking about smoothing splines a simple point to start with is a continuous piecewise linear regression with fixed knots. I did not find any simple example showing how to estimate the it in GNU R so I have created a little snippet that does the j...

## Push hard on a problem here and it might just pop up over there

April 2, 2013
By

One thing I have noticed when reading other peoples’ R code is that their functions are often a lot longer than mine. Writing overly long functions is a common novice programmer mistake, but the code I am reading does not look like it is written by novices (based on the wide variety of base functions

## Introducing the healthvis R package – one line D3 graphics with R

April 2, 2013
By

We have been a little slow on the posting for the last couple of months here at Simply Stats. That’s bad news for the blog, but good news for our research programs! Today I’m announcing the new healthvis R package … Continue reading →

## Top 10 tips to get started with R

April 2, 2013
By

Be motivated. R has a steep learning curve. Find a problem you can't solve otherwise. E.g. plotting multivariate data, a statistical analysis for which an R function exists already. Download and install R. Get to know the R console. Learn how to instal...

## This day in history (1997-04-01)

April 1, 2013
By

Today it's 16 years ago and 367,496 messages later since Martin Mächler started the R-help (321,119 msgs), R-devel (45,830 msgs) and R-announce (547 msgs) mailing lists - a great benefit to all of us.  Special thanks to Martin and also thanks to everyone else contributing to these forums. https://stat.ethz.ch/pipermail/r-help/1997-April/001490.html

## Le Monde puzzle [#814]

April 1, 2013
By

The #814 Le Monde math puzzle was to find 100 digits (between 1 and 10) such that their sum is equal to their product. Given the ten possible values of those digits, this is equivalent to finding integers a1,…,a10 such that a1+…+a10=100 and a1+2a2+…+10a10=2a2x….x10a10, which reduces the number of unknowns from 100 to 10 (or

## Install R

April 1, 2013
By

Video Files This file contains data on pain score after laparoscopic vs. open hernia repair. Age, gender and primary/recurrent hernia also included. The ultimate aim here is to work out which of these factors are associated with more pain after this operation. lap_hernia Script

## Old Price Tables in Modern d3 Visualization

April 1, 2013
By

In my post Dust off 130 Year Old Gold Books on Google Bookshelf, I reproduced some of the old and way out of copyright price tables from the appendices in Gold and Prices Since 1873 by James Laurence Laughlin using latticeExtra xyplot. Now, with the clickme multiline d3 ractive built in my last post

## New Toy package wingui

April 1, 2013
By

A new toy package that I have been playing with has just been submitted to CRAN.  I called it wingui.  It works off the original windows R GUI.  The idea is to be able to adjust the advanced attributes for the window through a reference object GUI.  Right now it only allows for changing the

## R User Groups Continue to Grow

April 1, 2013
By

by Joseph Rickert R user groups seem to be sprouting all over. Since last September we have noticed ten new groups worldwide: Auckland, New Zealand: Auckland-R-Users-Group (AKLRUG) had 33 people attend their March 8th meeting Chang Mai Thailand: Chang Mai is the first R user group in Thailand Durban, South Africa: The Durban R User Group is looking forward...

April 1, 2013
By

To install a package in R, the function to be used is install.packages. Let say we want to install the ggplot2 package, well simply code this withTo install more than one package, we do this byNote that in executing the above codes, a dialogue box...

## A nifty line plot to visualize multivariate time series

April 1, 2013
By

A few days ago a colleague came to me for advice on the interpretation of some data. The dataset was large and included measurements for twenty-six species at several site-year-plot combinations. A substantial amount of effort had clearly been made to ensure every species at every site over several years was documented. I don’t pretend

## Updating R (on Windows) through a menu-bar: installr 0.9 released on CRAN

April 1, 2013
By

In preparation for the upcoming release of R 3.0.0, a new release 0.9 of installr is now on CRAN. The package can be installed and loaded using: # installing/loading the package: if(!require(installr)) { install.packages("installr"); require(installr)} #load / install+load installr The …Read more »

## R Tackles Big Garbage

April 1, 2013
By

April 1, 2013 – Although the capabilities of the R system for data analytics have been expanding with impressive speed, it has heretofore been missing important fundamental methods. A new function works with the popular plyr package to provide these missing … Continue reading →

## A pictorial history of US large cap correlation

April 1, 2013
By

How has the distribution of correlations changed over the last several years? Previously Posts about correlation boxplots explained Data Daily returns of 443 large cap US stocks from 2004 through 2012 were used.  The sample correlations — almost 98,000 of them — during each year were created. If we were actually using the correlations, then … Continue reading...

## The R-Podcast Episode 12: Using Version Control with R

April 1, 2013
By

This is not an April Fool’s joke … The R-Podcast is back once again! In this episode, I discuss the concept of version control and how you can get started with using the Git VCS right now with your R projects. Also I discuss a big batch of listener feedback, and highlight a couple of

## The R-Podcast Episode 12: Using Version Control with R

March 31, 2013
By

This is not an April Fool's joke ... The R-Podcast is back once again! In this episode, I discuss the concept of version control and how you can get started with using the Git VCS right now with your R projects. Also I discuss a big batch of listener f...

## p-values are (possibly biased) estimates of the probability that the null hypothesis is true

March 31, 2013
By
$p-values are (possibly biased) estimates of the probability that the null hypothesis is true$

Last week, I posted about statisticians’ constant battle against the belief that the p-value associated (for example) with a regression coefficient is equal to the probability that the null hypothesis is true, for a null hypothesis that beta is zero or negative. I argued that (despite our long pedagogical practice) there are, in fact, many

## To plot them is my real test

March 31, 2013
By

I almost couldn’t bring myself to post this, but it’s April Fools’ Day, so I’ll never have a better opportunity. This Gist shows how to scrape “stats” and .PNG images from, erm, Bulbapedia, run a simple dimensionality reduction on those “stats,” and plot all 151 original Pokemon. I don’t know much about Pokemon, so I can’t vouch for...

## R and the last comma

March 31, 2013
By

In R, every comma matters. When creating a vector, c(1, 2, 5) will do the right thing, but add one unfortunate comma and c(1, 2, 5,) will greet you with a deadly Error in c(1, 2, 5, ) : argument 4 is empty. Other languages like Perl are less strict when defining basic data structures: having a comma after...

## How do Dew and Fog Form? Nature at Work with Temperature, Vapour Pressure, and Partial Pressure

In the early morning, especially here in Canada, I often see dew – water droplets formed by the condensation of water vapour on outside surfaces, like windows, car roofs, and leaves of trees.  I also sometimes see fog – water droplets or ice crystals that are suspended in air and often blocking visibility at great

## Gary King and Stuart Shieber on Open Access

March 31, 2013
By

Harvard Professors Gary King and Stuart Shieber provide advice to graduate students about open access, dissertations, and journal publishing. They explain how freely available publications are essential to the scientific community, but also benefit your own career. King suggests a clever way of dealing with publisher copyright agreements to prevent locking up your work behind ...

## Checking for Normality with Quantile Ranges and the Standard Deviation

$Checking for Normality with Quantile Ranges and the Standard Deviation$

Introduction I was reading Michael Trosset’s “An Introduction to Statistical Inference and Its Applications with R”, and I learned a basic but interesting fact about the normal distribution’s interquartile range and standard deviation that I had not learned before.  This turns out to be a good way to check for normality in a data set.