Spearman’s Rho

August 30, 2012
By

Spearman’s Rho Rank Correlation There are generally three types of correlation that a researcher may encounter: Pearson’s r, Kendall’s Tau, and Spearman’s Rho.  They each have their own uses and applications depending on the da...

Read more »

Three ways of visualizing the growth of Walmart

August 30, 2012
By
Three ways of visualizing the growth of Walmart

It's a wonderful thing when people make interesting data sets available to the public. When Thomas Jones wrote a paper in Econometrics about the growth of US retail giant Walmart, he made the data he collected about every Walmart store opening in history (location and date) available to the public. Since then, several people have used different techniques to...

Read more »

A Stan is Born

August 30, 2012
By

Stan 1.0.0 and RStan 1.0.0 It’s official. The Stan Development Team is happy to announce the first stable versions of Stan and RStan. What is (R)Stan? Stan is an open-source package for obtaining Bayesian inference using the No-U-Turn sampler, a variant of Hamiltonian Monte Carlo. It’s sort of like BUGS, but with a different language The post A...

Read more »

Making matrices with zeros and ones

August 30, 2012
By

So I was trying to figure out a fast way to make matrices with randomly allocated 0 or 1 in each cell of the matrix. I reached out on Twitter, and got many responses (thanks tweeps!). Here is the solution I came up with. See if you can tell why it...

Read more »

Another Great Google Summer of Code 2012 R Project

August 30, 2012
By
Another Great Google Summer of Code 2012 R Project

Tradeblotter announced the very nice features that will be added to the PerformanceAnalytics package as a result of the Google Summer of Code (GSOC) 2012 project: “…Matthieu commenced to produce dozens of new functions, extend several more existin...

Read more »

Visually weighted regression in R (à la Solomon Hsiang)

August 30, 2012
By
Visually weighted regression in R (à la Solomon Hsiang)

, and also the discussions on the Statistical Modeling, Causal

Read more »

F1 2012 Mid-Season Review

August 30, 2012
By
F1 2012 Mid-Season Review

Rather belatedly, I got around to posting a series of posts summarising the Formula One season to date: F1 2012 Mid-Season Review – Grid/Classification Analysis: for example, how do the drivers’ grid and final classifications compare? F1 2012 Mid-Season Review – Pit Stops: for example, how does pit stop performance across the teams compare? F1

Read more »

Late to the ggplot2 party

August 29, 2012
By

I have resisted learning the popular R graphics package, ggplot2. I dismissed ggplot2 as primarily useful for exploratory graphics and rationalized my avoidance of ggplot2 by assuming that it would require just as many (or more) lines of code as the R base package to whip the default plots into publication-quality figures. The few times

Read more »

Virginia: Comparison of Registered Voter Counts to Census Voting Age Population

August 29, 2012
By
Virginia: Comparison of Registered Voter Counts to Census Voting Age Population

By Earl F Glynn | Franklin Center A comparison of US Census voting age population data in Virginia to voter registration data shows only one locality, Surry County, with about 100% of the voting age population registered to vote. Six other localities have about 95% of their voting age population registered:  Craig County, Isle of

Read more »

Processing Data from a Statistica Worksheet Using R

August 29, 2012
By
Processing Data from a Statistica Worksheet Using R

Context: I work with data from non-profit organizations, and so a big concern in many of my analyses is if and how much people are donating from one year to the next.  One of the  things I normally like to do … Continue reading →

Read more »

R does CSI: Using R to nail break-in suspects

August 29, 2012
By
R does CSI: Using R to nail break-in suspects

You've probably heard (or seen in TV shows) how the unique pattern of rifling in a gunbarrel generates forensic evidence: microscopic scoring on the bullets left at the scene of the crime can be linked to the shooter by matching the marks to the firearm. What you might not know is that the same technique can be applied to...

Read more »

back from down under

August 29, 2012
By
back from down under

After a sunny weekend to unpack and unwind, I am now back to my normal schedule, on my way to Paris-Dauphine for an R (second-chance) exam. Except for confusing my turn signal for my wiper, thanks to two weeks of intensive driving in four Australian states!, things are thus back to “normal”, meaning that I

Read more »

Basic documentation for soilDB package (R) now available

August 29, 2012
By

A proper introduction to the soilDB package is now available here. Installation and basic usage are covered. More detailed, task-specific documentation on aqp and soilDB will be available soon. read more

Read more »

…Now With More Bacon (2008)!

August 29, 2012
By
…Now With More Bacon (2008)!

I’m sure that Carl Bacon sighs deeply when he reads such headlines, but it is clearly appropriate in this case. Perhaps you remember that I proposed a Google Summer of Code project for 2012 around a considerable code contribution to PerformanceAnalytics from Diethelm Wuertz at ETHZ. That code was focused on adding a large number

Read more »

Plotting model fits

August 29, 2012
By
Plotting model fits

We all know that it is important to plot your data and explore the data visually to make sure you understand it. The same is true for your model fits. First, you want to make sure that the model is fitting...

Read more »

R Script to Build Animation of Arctic Sea Ice Extent – Update 12/20/13

August 29, 2012
By
R Script to Build Animation of Arctic Sea Ice Extent – Update 12/20/13

In my previous post I showed an animation of Arctic Sea Ice Extent from the 1980’s through August, 2012 (link).  In this post, I show how to build this Arctic Sea ice Extent  animated chart. Source Data The Arctic Ice … Continue reading →

Read more »

Integrating R into a SAS shop

August 29, 2012
By

I work in an environment dominated by SAS, and I am looking to integrate R into our environment. Why would I want to do such a thing? First, I do not want to get rid of SAS. That would not only take away most of our investment in SAS training and hiring good quality SAS programmers, but...

Read more »

Generate simple HTML slides using deck.js and markdown

August 29, 2012
By

RStudio and knitr are an excellent conbination for generating dynamic reports. But in this blog, I will show you how to generate HTML-style presentaion using R only. OK, I confess that we still need something else: deck.js and markdown and R.utils. ...

Read more »

Facts About R Packages (2)

August 29, 2012
By

R Packages All Well maintained? There are so many R packages, can they all be trusted? or are they well maintained? To answer this question, we just need to take a look of their archive histories. If a package has many versions, we can take that as th...

Read more »

Facts About R Packages (1)

August 29, 2012
By

R Packages growth Curve Why R is so popular? There are a lot of reasons, such as: easy to learn and convenient to use, active community, open source, etc. Another important reason is the numerous contributed packages. Up to yesterday, there are 4033 R...

Read more »

Generate Quasi-Poisson Distribution Random Variable

August 29, 2012
By

Most of regression methods assume that response variables follow some exponential distribution families, e.g. Guassian, Poisson, Gamma, etc. However, this assumption was frequently violated in real world by, for example, zero-inflated overdispersion problem. A number of methods were developed to deal with such problem, and among them, Quasi-Poisson and Negative Binomial are the most popular methods perhaps due to that...

Read more »

googleVis — NASA’s exploration of Mars

August 29, 2012
By
googleVis — NASA’s exploration of Mars

After generating a few interactive charts with googleVis, I realized that it’s a great way to visualize numeric data, especially multi-dimentional data. Days ago, my colleague sent me a picture taken by Curiosity from Mars. He was crazy about it … Continue reading →

Read more »

Setting Up the Development Version of R

August 28, 2012
By

My coworkers at Fred Hutchinson regularly use the development version of R (i.e., R-devel) and have urged me to do the same. This post details how I have set up the development version of R on our Linux server, which I use remotely because it is much faster than my Mac. First, I downloaded the R-devel source into ~/local/, which...

Read more »

Newton-Raphson can compute an average

August 28, 2012
By
Newton-Raphson can compute an average

In our article How robust is logistic regression? we pointed out some basic yet deep limitations of the traditional full-step Newton-Raphson or Iteratively Reweighted Least Squares methods of solving logistic regression problems (such as in R‘s standard glm() implementation). In fact in the comments we exhibit a well posed data fitting problem that can not Related posts:

Read more »

Will Data Scientists Be Replaced by Tools?

August 28, 2012
By

The Quick-and-Dirty Summary I was recently asked to participate in a proposed SXSW panel that will debate the question, “Will Data Scientists Be Replaced by Tools?” This post describes my current thinking on that question as a way of (1) convincing you to go vote for the panel’s inclusion in this year’s SXSW and (2)

Read more »

m x n matrix with randomly assigned 0/1

August 28, 2012
By
m x n matrix with randomly assigned 0/1

Today Scott Chamberlain tweeted asking for a better/faster solution to building an m x n matrix with randomly assigned 0/1. He already had a working version: Now, I’m the first to acknowledge that I’ve never got the ‘apply’ family of … Continue reading →

Read more »

Arctic sea-ice at lowest levels since observations began

August 28, 2012
By
Arctic sea-ice at lowest levels since observations began

RealClimate.org used the R language and data from the National Snow and Ice Data Center to create this chart showing the extent of Arctic sea-ice in each year since satellite observations began in 1978, and the current extent of ice coverage (in red). Even though there are several weeks of annual melting yet to come, the area of ice...

Read more »

More on Exploring Correlations in R

August 28, 2012
By
More on Exploring Correlations in R

About a year ago I wrote a post about producing scatterplot matrices in R. These are handy for quickly getting a sense of the correlations that exist in your data. Recently someone asked me to pull out some relevant statistics (correlation coefficient ...

Read more »

R-Studio

August 28, 2012
By
R-Studio

A post over on Dang, another error (show me yours and I’ll show you mine) has a method of working with R which uses an IDE called Eclipse in conjunction with a plugin called StatET. Eclipse is one of a number of IDEs that I’m aware of (Tinn-R being another, but this Sciviews pages has

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.