# Data Analysis

### Processing Data from a Statistica Worksheet Using R

August 29, 2012 |

Context: I work with data from non-profit organizations, and so a big concern in many of my analyses is if and how much people are donating from one year to the next.  One of the  things I normally like to do … Continue reading → [Read more...]

### Sampling and the Analysis of Big Data

April 8, 2012 |

After my last post, I came across a few articles supporting the opinion that if you have a good reason to take random samples from a “big” dataset, you’re not committing some kind of sin: Big Data Blasphemy: Why Sample? … Continue reading → [Read more...]

### An unabashedly narcissistic data analysis of my own tweets. The…

April 2, 2012 |

pie( table( whence.i.tweet )) qplot( whence ) + coord_polar() pie( log( table( whence )))+RColorBrewer ggplot (see below) plot( density( tweets.len )) qplot(... stat="density") + geom_density qplot(...stat="bin") + geom_text(...) tweeple tweep... [Read more...]

### Interview with Kai Chew, CloudStat

December 31, 2011 |

Here is an interview with Kai Chew, Founder of Cloudstat. CloudStat is developing a cloud-based statistical platform to help researchers who want to make sense of data to do statistical analysis collaboratively with its high performance computing infra... [Read more...]

### CloudStat: Learn & Do R Language on the Cloud

November 19, 2011 |

Hi! My fellow useRs! I’m making a web-based R Language platform ( http://cloudst.at/ ) for my students. My aim is to decrease the learning curve of learning R and collaboration. With CloudStat, there is no more download, installation, update and mai... [Read more...]

### Interactive graphics for data analysis

September 1, 2011 |

I got a copy of Martin Theus and Simon Urbanek’s Interactive Graphics for Data Analysis a couple of years ago, whence it’s been sat on my bookshelf. Since I’ve recently become a self-proclaimed expert on interactive graphics I thought it was about time I read the thing. ... [Read more...]

### 1st Data Analysis Contest Using R

July 18, 2011 |

Emilio Torres Manzanera has just announced the 1st Data Analysis Contest Using R: “Nestoria (http://www.nestoria.com/) is a specialized web search engine platform in house prices. Nestoria and Lokku Labs aim to improve the understanding of the public of the value of its databases. The company aims to ... [Read more...]

### Software tools for data analysis – an overview

February 19, 2011 |

by Szilard Pafka Discussions on various software tools (C, C++, Perl, Python, Unix shell, R, Matlab, SAS, SPSS, Excel, databases, Hadoop etc.) used in data analysis. Szilard Pafka (founder and co-organizer of the Los Angeles R users group) presents an … Continue reading →

### Did what you write drive what I read?

September 16, 2010 |

GoogleReader allows you to track your activity, by representing the number of news items read and published by day and by hour. I use it quite a lot to stay up to date with the scientific literature (I subscribed to probably over 30 journals) and a bunch of other feeds. Stuff ... [Read more...]

### A quick analysis of the trends in the number of weddings in France (1975–2010)

August 15, 2010 |

I’m currently planning my wedding, and my fiancée and I were discussing wether there were more or less couples getting married over time. It turns out that this information is quite easy to get via INSEE, a french institute that (…)Read the rest of this entry » [Read more...]

### extrapolation and interpolation The most important lesson I…

May 25, 2010 |

extrapolation and interpolation The most important lesson I learned from this book:  regression is reliable for interpolation, but not for extrapolation.  Even further, your observations really need to cover the whole gamut of causal variables, inter... [Read more...]

### Exporting R output to MS-Word with R2wd (an example session)

May 6, 2010 |

Creating reports is one of the basic tasks in data analysis. R provides numerous functions and packages to export it’s (beautiful) output and help compile it into a report. In this post I will present one such (basic) solution for Windows OS users for exporting R output into Microsoft ... [Read more...]

### A Visual History Of Twitter’s Growth (Updated 2010-08-23)

March 22, 2010 |

Download "Getting Started with the Social Media Analytics Research Toolkit" (pdf, 1.25 megabytes) Download the Social Media Analytics Research Toolkit How The Chart Was Made Whenever a Twitter user posts a tweet, an object is created and entered into t... [Read more...]