# Posts Tagged ‘ Data Analysis ’

## Processing Data from a Statistica Worksheet Using R

August 29, 2012
By

Context: I work with data from non-profit organizations, and so a big concern in many of my analyses is if and how much people are donating from one year to the next.  One of the  things I normally like to do … Continue reading →

## Sampling and the Analysis of Big Data

April 8, 2012
By

After my last post, I came across a few articles supporting the opinion that if you have a good reason to take random samples from a “big” dataset, you’re not committing some kind of sin: Big Data Blasphemy: Why Sample? … Continue reading →

## An unabashedly narcissistic data analysis of my own tweets. The…

April 2, 2012
By

pie( table( whence.i.tweet )) qplot( whence ) + coord_polar() pie( log( table( whence )))+RColorBrewer ggplot (see below) plot( density( tweets.len )) qplot(... stat="density") + geom_density qplot(...stat="bin") + geom_text(...) tweeple tweep...

## Interview with Kai Chew, CloudStat

December 31, 2011
By

Here is an interview with Kai Chew, Founder of Cloudstat. CloudStat is developing a cloud-based statistical platform to help researchers who want to make sense of data to do statistical analysis collaboratively with its high performance computing infra...

## CloudStat: Learn & Do R Language on the Cloud

November 19, 2011
By

Hi! My fellow useRs! I’m making a web-based R Language platform ( http://cloudst.at/ ) for my students. My aim is to decrease the learning curve of learning R and collaboration. With CloudStat, there is no more download, installation, update and mai...

## Interactive graphics for data analysis

September 1, 2011
By

I got a copy of Martin Theus and Simon Urbanek’s Interactive Graphics for Data Analysis a couple of years ago, whence it’s been sat on my bookshelf. Since I’ve recently become a self-proclaimed expert on interactive graphics I thought it was about time I read the thing. Which is exactly what I did last weekend

## 1st Data Analysis Contest Using R

Emilio Torres Manzanera has just announced the 1st Data Analysis Contest Using R: “Nestoria (http://www.nestoria.com/) is a specialized web search engine platform in house prices. Nestoria and Lokku Labs aim to improve the understanding of the public of the value of its databases. The company aims to engage a few brilliant statisticians in the expectation

## Software tools for data analysis – an overview

February 19, 2011
By

by Szilard Pafka Discussions on various software tools (C, C++, Perl, Python, Unix shell, R, Matlab, SAS, SPSS, Excel, databases, Hadoop etc.) used in data analysis. Szilard Pafka (founder and co-organizer of the Los Angeles R users group) presents an … Continue reading →

## Did what you write drive what I read?

September 16, 2010
By

GoogleReader allows you to track your activity, by representing the number of news items read and published by day and by hour. I use it quite a lot to stay up to date with the scientific literature (I subscribed to probably over 30 journals) and a bunch of other feeds. Stuff tend to accumulate faster

## A quick analysis of the trends in the number of weddings in France (1975–2010)

August 15, 2010
By

I’m currently planning my wedding, and my fiancée and I were discussing wether there were more or less couples getting married over time. It turns out that this information is quite easy to get via INSEE, a french institute that (…)Read the rest of this entry »