2774 search results for "GIS"

Modeling Trick: Impact Coding of Categorical Variables with Many Levels

July 23, 2012
By
Modeling Trick: Impact Coding of Categorical Variables with Many Levels

One of the shortcomings of regression (both linear and logistic) is that it doesn’t handle categorical variables with a very large number of possible values (for example, postal codes). You can get around this, of course, by going to another modeling technique, such as Naive Bayes; however, you lose some of the advantages of regression Related posts:

Read more »

Third year wrap-up

July 23, 2012
By
Third year wrap-up

July marks the end of three years of blogging for us. By our count, we've posted 121 examples across the first three years. We aim to be helpful and interesting.As always, it's hard to get a sense of our readership. At the time we wrote this, Feedbur...

Read more »

London Olympics and a prediction for the 100m final

July 22, 2012
By
London Olympics and a prediction for the 100m final

It is less than a week before the 2012 Olympic games will start in London. No surprise therefore that the papers are all over it, including a lot of data and statistis around the games. The Economist investigated the potential financial impact on spons...

Read more »

Modeling Permanent and Gradual Process Changes with CDFs

July 20, 2012
By
Modeling Permanent and Gradual Process Changes with CDFs

Spencer HerathSpecial thanks to Ben OgorekBackgroundI recently faced a process with a structural change resulting in an increase in the process mean.  The jump to the new mean was not immediate; rather, there was a gradual increase in values over time.  I had previously benefited from multi-staged process-behavior charts when encountering immediate process shifts, but now I needed a...

Read more »

Best of Axys, R, d3.js, and HTML5

July 19, 2012
By
Best of Axys, R, d3.js, and HTML5

Axys, R, d3.js, and HTML5 all offer incredibly powerful tools for investment management and reporting, but they are not set up to synergistically interact to fill each other’s gaps and leverage each other’s strengths.  In my ideal scenario, Ax...

Read more »

Plotting the Frequency of Twitter Hashtag Usage Over Time with R and ggplot2

July 17, 2012
By
Plotting the Frequency of Twitter Hashtag Usage Over Time with R and ggplot2

The 20th annual ISMB meeting was held over the last week in Long Beach, CA. It was an incredible meeting with lots of interesting and relevant talks, and lots of folks were tweeting the conference, usually with at least a few people in each concurrent ...

Read more »

Optical Art with R

July 16, 2012
By
Optical Art with R

Last week, in a post entitled Bridget Riley exhibition in London, the author Markus Gesmann wrote an R script reproducing one of Riley's famous art pieces: Movement in Squares.This reminded me of my own first "brush" with Op art. It was in art class ye...

Read more »

Sourcing an R Script from Dropbox

July 14, 2012
By

Working on my R bootcamp materials and I thought it would be handy to get the bootcamp computers setup by sourcing an R script that will install all necessary non-core packages in it. The problem? How to deploy this script efficiently. A quick method w...

Read more »

Smartphone operating system share mosaic plot

July 13, 2012
By
Smartphone operating system share mosaic plot

(This article was first published on Actuarially (Matt Malin), and kindly contributed to R-bloggers) Smartphone operating system share mosaic plot Author: Matt Malin The increasing dominance of smartphones across the market is a very common topic in technology and news sites, with analysis of operating system share and phone types often shown in the media. Stumbling across this article...

Read more »