2603 search results for "GIS"

Two Talks on Data Science, Big Data and R

October 23, 2012
By

On Thursday next week (November 1), I'll be giving a new webinar on the topic of Big Data, Data Science and R. Titled "The Rise of Data Science in the Age of Big Data Analytics: Why Data Distillation and Machine Learning Aren’t Enough", this is a provocative look at why data scientists cannot be replaced by technology, and why...

Read more »

Benchmarking matrix creation

October 23, 2012
By
Benchmarking matrix creation

Sometimes it is useful to take a vector, or one column/row of a matrix, and build a new matrix of identical copies of that vector. There are lots of different ways to do this, but I just discovered a new, and very straightforward way to do this with m...

Read more »

It Takes 2 Lines of R Code to Discover Interesting Biology

October 23, 2012
By
It Takes 2 Lines of R Code to Discover Interesting Biology

The following biological phenomenon demonstrates just how elegant R code can be. In vertebrate genomes, a methyl group (-CH3) can be added to nucleotides. Such process of methylation is commonly associated with gene suppression. Most of the cytosines in the … Continue reading →

Read more »

googleVis 0.3.0/0.3.1 is released: It’s faster!

October 23, 2012
By
googleVis 0.3.0/0.3.1 is released: It’s faster!

Version 0.3.0 of the googleVis package for R has been released on CRAN on 20 October 2012. With this version we have been able to speed up the code considerably. The transformation of R data frames into JSON works significantly faster. The execution of the gvisMotionChart function in the World Bank demo is over 35 times...

Read more »

Is it meaningful to talk about a probability of “65.7%” that Obama will win the election?

October 22, 2012
By
Is it meaningful to talk about a probability of “65.7%” that Obama will win the election?

The other day we had a fun little discussion in the comments section of the sister blog about the appropriateness of stating forecast probabilities to the nearest tenth of a percentage point. It started when Josh Tucker posted this graph from Nate Silver: My first reaction was: this looks pretty but it’s hyper-precise. I’m a The post Is...

Read more »

Predict User’s Return Visit within a day part-3

October 22, 2012
By
Predict User’s Return Visit within a day part-3

Welcome to the last part of the series on predicting user’s revisit to the website. In the  first part of series, I generated the logistic regression model for prediction problem whether a user will come back on  website in next 24 hours. In the second part, I discussed about model improvement and seen the model accuracy.

Read more »

Distribution of colors by flag

October 22, 2012
By
Distribution of colors by flag

A story: We showed you how to use R to assess flag similarity and make a scatter plot of raster images. Dr. Wickham referred us to the set of 2400 flag icons made available by GoSquared, and then (probably jokingly) challenged us to replicate the cool...

Read more »

Predict User’s Return Visit within a day part-2

October 22, 2012
By
Predict User’s Return Visit within a day part-2

Welcome to the second part of the series on predicting user’s revisit to the website. In my earlier blog Logistic Regression with R, I discussed what is logistic regression. In the first part of the series, we applied logistic regression to available data set. The problem statement there was whether a user will return in

Read more »

Predict User’s Return Visit within a day part-1

October 22, 2012
By
Predict User’s Return Visit within a day part-1

In my earlier blog, I have discussed about what is logistic regression? And how logistic model is generated in R? Now we will apply that learning on a specific problem of prediction. In this post, I will create a basic model to predict whether a user will return on website in next 24 hours. This

Read more »

Carl Morris Symposium on Large-Scale Data Inference (2/3)

October 20, 2012
By
Carl Morris Symposium on Large-Scale Data Inference (2/3)

Continuing the summary of last week’s symposium on statistics and data visualization (see part 1 and part 3)… Here I describe Dianne Cook’s discussion of visual inference, and Rob Kass’ talk on statistics in cognitive neuroscience. [Edit: I've added a few … Continue reading →

Read more »