If the typing monkeys have met Mr Markov: probabilities of spelling "omglolbbq" after the digitial monkeys have read Dracula

September 15, 2014
By
If the typing monkeys have met Mr Markov: probabilities of spelling "omglolbbq" after the digitial monkeys have read Dracula

On the weekend, randomly after watching Catching Fire, I remember the problem of the typing monkeys (Infinite monkey theorem) in which basically could be defined as (Thanks to Wiki):# *******************#  INTRODUCTION# *******************The infi...

Read more »

Using Reddit’s JSON API to analyze post popularity

September 15, 2014
By
Using Reddit’s JSON API to analyze post popularity

Graduate student Clay McLeod decided to find out what makes a post on the social-sharing site Reddit popular. These are the questions he seeks to answer: What’s in a...

Read more »

Creating a map showing land covered by rising sea levels

September 15, 2014
By

I joined the Geekli.st climate Hackathon this weekend at the Hub Westminster (my favorite venue for Hackathons). While the organizers had lots of enthusiasm they had very little in...

Read more »

Mapping every IPv4 address

September 15, 2014
By
Mapping every IPv4 address

During July I was working with a commercial data source that provides extra data around IP addresses and it dawned on me: rather than pinging billions of IP addresses and...

Read more »

PCA / EOF for data with missing values – a comparison of accuracy

September 15, 2014
By
PCA / EOF for data with missing values – a comparison of accuracy

Not all Principal Component Analysis (PCA) (also called Empirical Orthogonal Function analysis, EOF) approaches are equal when it comes to dealing with a data...

Read more »

How do you say π^π^π?

September 15, 2014
By
How do you say π^π^π?

Well, not that you really probably want to know how to say such an absurdly large number. However for those of you who are interested (allowing for rounding) it...

Read more »

One datavis for you, ten for me

September 14, 2014
By
One datavis for you, ten for me

Over the years of my graduate studies I made a lot of plots. I mean tonnes. To get an extremely conservative estimate I grep’ed for every instance of “plot(”...

Read more »

Trying a prefmap

September 14, 2014
By
Trying a prefmap

Preference mapping is a key technique in sensory and consumer research. It links the sensory perception on products to the liking of products and hence provides clues to the...

Read more »

RDataMining Slides Series

September 14, 2014
By
RDataMining Slides Series

by Yanchang Zhao, RDataMining.com I have made a series of slides on R and data mining, based on my book titled R and Data Mining — Examples and Case...

Read more »

Newcastle R course, a write-up

September 13, 2014
By

I recently attended a week-long R course in Newcastle, taught by Colin Gillespie. It went from “An Introduction to R” to “Advanced Graphics” via a day each...

Read more »

The Ecology of Data Matrices: A Metaphor for Simultaneous Clustering

September 13, 2014
By
The Ecology of Data Matrices: A Metaphor for Simultaneous Clustering

"...a metaphor is an affair between a predicate with a past and an object that yields while protesting." Nelson Goodman (1976)It is, as if, data matrices were alive. The rows...

Read more »

Google uses R to calculate ROI on advertising campaigns

September 12, 2014
By
Google uses R to calculate ROI on advertising campaigns

Google has just released a new package for R: CausalImpact. Amongst many other things, this package allows Google to resolve the classical conundrum: how can we asses the impact...

Read more »

R: k-Means Clustering on an Image

September 12, 2014
By
R: k-Means Clustering on an Image

Enough with the theory we recently published, let's take a break and have fun on the application of Statistics used in Data Mining and Machine Learning, the k-Means Clustering.k-means...

Read more »

Conor Atom, a book for “children scientists” (an indiegogo campaign)

September 12, 2014
By
Conor Atom, a book for “children scientists” (an indiegogo campaign)

Mario Morales –a Colombian-American, Statistician-Bioinformatician, Member of the R community and a regular attendant of the UseR conference since 2007 has launched a book for Children called “Conor Atom, The child...

Read more »

Embedding RData files in Rmarkdown files for more reproducible analyses

September 12, 2014
By

For those of us interested in reproducible analysis, Rmarkdown is a great way of communicating our code to other researchers. Rstudio, in particular, makes it very easy...

Read more »

Read sas7bdat files in R with GGASoftware Parso library

September 12, 2014
By

... using the new R package sas7bdat.parso. The software company GGASoftware has extended the work of myself and others on the sas7bdat R package by developing a Java library...

Read more »

Princess Jasmine’s Trick

September 12, 2014
By
Princess Jasmine’s Trick

I’m history! No, I’m mythology! Nah, I don’t care what I am; I’m free hee! (Genie, when he is released from the magical oil lamp by Aladdin) A long...

Read more »

Using colorized PNG pictograms in R base plots

September 12, 2014
By
Using colorized PNG pictograms in R base plots

Today I stumbled across a figure in an explanation on multiple factor analysis which contained pictograms.   Figure 1 from Abdi & Valentin (2007), p. 8. I wanted to...

Read more »

shinyStore – Persistent Client-Side Storage in Shiny

September 11, 2014
By

We’re thrilled to announce the availability of shinyStore, an R package that enables HTML5 Web Storage from Shiny, an interactive web application framework for R. A live demo of...

Read more »

UVA / Charlottesville R Meetup

September 11, 2014
By
UVA / Charlottesville R Meetup

TL;DR? We started an R Users group, awesome community, huge turnout at first meeting, lots of potential.---I've sat through many hours of meetings where faculty lament the fact that...

Read more »

Martin Maechler Invited Talk at useR! 2014 – Good Practices in R Programming

September 11, 2014
By

Martin Maechler is a member of R-Core. This distinction puts him in the very apex...

Read more »

What makes a good academic conference?

September 11, 2014
By
What makes a good academic conference?

What makes a good academic conference? Here's what we like. The post What makes a good academic conference? appeared first on Decision Science News.

Read more »

Save your simulation study seeds

September 11, 2014
By
Save your simulation study seeds

Here in the Northern hemisphere, gardeners are gathering seeds from their prize-winning vegetables are storing them away for next year’s crop. Today at the 20th London Stata Users’ Group...

Read more »

pkgKitten 0.1.2: Still creating R Packages that purr

September 11, 2014
By

A brown bag release 0.1.2 of pkgKitten is now on CRAN, following yesterday's 0.1.1 upload Next time I'll try to remember that when I have parameters name and path,...

Read more »

R at Conferences this Fall

September 11, 2014
By

by Joseph Rickert The days are getting shorter here in California and the summer R conferences UseR!2014 and JSM are behind us, but there are still some very fine...

Read more »

R User Group in Birmingham, AL

September 11, 2014
By

If Birmingham, UK has one, then Birmingham, AL, USA should too. There is a big gaping hole in R user groups between Georgia and Texas that I think needs...

Read more »

“Probabilizing” uncertainty in the Brazilian Presidential Election

September 11, 2014
By
“Probabilizing” uncertainty in the Brazilian Presidential Election

The following figure shows the probability distributions of vote intentions for the main candidates after distributing the stock of undecided voters. As Marina (PSB) is getting back to her...

Read more »

Generalized Double Pareto Priors for Regression

September 10, 2014
By
Generalized Double Pareto Priors for Regression

This post is a review of the “GENERALIZED DOUBLE PARETO SHRINKAGE” Statistica Sinica (2012) paper by Armagan, Dunson and Lee. Consider the regression model (Y=Xbeta+varepsilon) where we put a...

Read more »

Visualizing Website Pathing With Sankey Charts

September 10, 2014
By
Visualizing Website Pathing With Sankey Charts

In my prior post on visualizing website structure using network graphs, I referenced that network graphs showed the pairwise relationships between two pages (in a bi-directional manner). However, if...

Read more »