Bayes says “don’t worry” about Scotland’s Referendum

September 17, 2014
By
Bayes says “don’t worry” about Scotland’s Referendum

Just few hours before Scots head to the polls, there is not an overwhelming advantage of the anti-independence vote. Actually, the margin is shorter than last time I looked at it, but despite such a growing trend in favor of the "Yes" campaign in the last weeks, the "NO" side has an edge still. To … Read More...

Read more »

Maximal Information Coefficient (Part II)

September 17, 2014
By
Maximal Information Coefficient (Part II)

A while back, I wrote a post simply announcing a recent paper that described a new statistic called the "Maximal Information Coefficient" (MIC),...

Read more »

Changes to FSA — Size Structure

September 16, 2014
By
Changes to FSA — Size Structure

I have added a (very rough) first draft to the Size Structure chapter of the forthcoming Introductory Fisheries Science with R book on the book’s fishR webpage.  Accompanying this...

Read more »

PerformanceAnalytics update released to CRAN

September 16, 2014
By
PerformanceAnalytics update released to CRAN

Version number 1.4.3541 of PerformanceAnalytics was released on CRAN today. If you’ve been following along, you’ll note that we’re altering our version numbering system.  From here on out, we’ll...

Read more »

New members for R-core and R Foundation

September 16, 2014
By

The R Foundation for Statistical Computing, the Vienna-based non-profit organization that oversees the R Project, has just added several new "ordinary members". (Ordinary members participate in R Foundation meetings...

Read more »

R package to convert statistical analysis objects to tidy data frames

September 16, 2014
By

I talked a little bit about tidy data my recent post about dplyr, but you should really go check out Hadley’s paper on the subject. R expects inputs...

Read more »

3D Sine Wave

September 16, 2014
By
3D Sine Wave

Had a headache last night, so decided to take things easy and...

Read more »

Notes from the Kölner R meeting, 12 September 2014

September 16, 2014
By
Notes from the Kölner R meeting, 12 September 2014

Last Friday we had guests from Belgium and the Netherlands joining us in Cologne. Maarten-Jan Kallen from BeDataDriven came from The Hague to introduce us to Renjin, and...

Read more »

Using SQLite in R

September 16, 2014
By
Using SQLite in R

Working on big data requires a clean and robust approach on storing and accessing the data. SQLite is an all inclusive server-less database system in a single file. This...

Read more »

Nuts and Bolts of Quantstrat, Part II

September 16, 2014
By
Nuts and Bolts of Quantstrat, Part II

Last week, I covered the boilerplate code in quantstrat. This post will cover parameters and adding indicators to strategies in … Continue reading →

Read more »

how to provide a variance calculation on your public-use survey data file without disclosing sampling clusters or violating respondent confidentiality

September 16, 2014
By

this post and accompanying syntax would not have been possible without dan oberski.  read more, find out why.  thanks dan.dear survey administrator: someone sent you this link because you...

Read more »

Why Are We Still Teaching t-Tests?

September 15, 2014
By
Why Are We Still Teaching t-Tests?

My posting about the statistics profession losing ground to computer science drew many comments, not only here in Mad (Data) Scientist, but also in the co-posting at Revolution Analytics,...

Read more »

Interview with Romain Francois at useR! 2014

September 15, 2014
By

At the useR! 2014 conference, without a doubt one of the overriding themes was R’s...

Read more »

If the typing monkeys have met Mr Markov: probabilities of spelling "omglolbbq" after the digitial monkeys have read Dracula

September 15, 2014
By
If the typing monkeys have met Mr Markov: probabilities of spelling "omglolbbq" after the digitial monkeys have read Dracula

On the weekend, randomly after watching Catching Fire, I remember the problem of the typing monkeys (Infinite monkey theorem) in which basically could be defined as (Thanks to Wiki):#...

Read more »

Using Reddit’s JSON API to analyze post popularity

September 15, 2014
By
Using Reddit’s JSON API to analyze post popularity

Graduate student Clay McLeod decided to find out what makes a post on the social-sharing site Reddit popular. These are the questions he seeks to answer: What’s in a...

Read more »

Creating a map showing land covered by rising sea levels

September 15, 2014
By

I joined the Geekli.st climate Hackathon this weekend at the Hub Westminster (my favorite venue for Hackathons). While the organizers had lots of enthusiasm they had very little in...

Read more »

Mapping every IPv4 address

September 15, 2014
By
Mapping every IPv4 address

During July I was working with a commercial data source that provides extra data around IP addresses and it dawned on me: rather than pinging billions of IP addresses and...

Read more »

PCA / EOF for data with missing values – a comparison of accuracy

September 15, 2014
By
PCA / EOF for data with missing values – a comparison of accuracy

Not all Principal Component Analysis (PCA) (also called Empirical Orthogonal Function analysis, EOF) approaches are equal when it comes to dealing with a data...

Read more »

How do you say π^π^π?

September 15, 2014
By
How do you say π^π^π?

Well, not that you really probably want to know how to say such an absurdly large number. However for those of you who are interested (allowing for rounding) it...

Read more »

One datavis for you, ten for me

September 14, 2014
By
One datavis for you, ten for me

Over the years of my graduate studies I made a lot of plots. I mean tonnes. To get an extremely conservative estimate I grep’ed for every instance of “plot(”...

Read more »

Trying a prefmap

September 14, 2014
By
Trying a prefmap

Preference mapping is a key technique in sensory and consumer research. It links the sensory perception on products to the liking of products and hence provides clues to the...

Read more »

RDataMining Slides Series

September 14, 2014
By
RDataMining Slides Series

by Yanchang Zhao, RDataMining.com I have made a series of slides on R and data mining, based on my book titled R and Data Mining — Examples and Case...

Read more »

Newcastle R course, a write-up

September 13, 2014
By

I recently attended a week-long R course in Newcastle, taught by Colin Gillespie. It went from “An Introduction to R” to “Advanced Graphics” via a day each...

Read more »

The Ecology of Data Matrices: A Metaphor for Simultaneous Clustering

September 13, 2014
By
The Ecology of Data Matrices: A Metaphor for Simultaneous Clustering

"...a metaphor is an affair between a predicate with a past and an object that yields while protesting." Nelson Goodman (1976)It is, as if, data matrices were alive. The rows...

Read more »

Google uses R to calculate ROI on advertising campaigns

September 12, 2014
By
Google uses R to calculate ROI on advertising campaigns

Google has just released a new package for R: CausalImpact. Amongst many other things, this package allows Google to resolve the classical conundrum: how can we asses the impact...

Read more »

R: k-Means Clustering on an Image

September 12, 2014
By
R: k-Means Clustering on an Image

Enough with the theory we recently published, let's take a break and have fun on the application of Statistics used in Data Mining and Machine Learning, the k-Means Clustering.k-means...

Read more »

Conor Atom, a book for “children scientists” (an indiegogo campaign)

September 12, 2014
By
Conor Atom, a book for “children scientists” (an indiegogo campaign)

Mario Morales –a Colombian-American, Statistician-Bioinformatician, Member of the R community and a regular attendant of the UseR conference since 2007 has launched a book for Children called “Conor Atom, The child...

Read more »

Embedding RData files in Rmarkdown files for more reproducible analyses

September 12, 2014
By

For those of us interested in reproducible analysis, Rmarkdown is a great way of communicating our code to other researchers. Rstudio, in particular, makes it very easy...

Read more »

Read sas7bdat files in R with GGASoftware Parso library

September 12, 2014
By

... using the new R package sas7bdat.parso. The software company GGASoftware has extended the work of myself and others on the sas7bdat R package by developing a Java library...

Read more »