Posts Tagged ‘ Data Analysis ’

Processing Data from a Statistica Worksheet Using R

August 29, 2012
By
Processing Data from a Statistica Worksheet Using R

Context: I work with data from non-profit organizations, and so a big concern in many of my analyses is if and how much people are donating from one year to the next.  One of the  things I normally like to do … Continue reading →

Read more »

Sampling and the Analysis of Big Data

April 8, 2012
By
Sampling and the Analysis of Big Data

After my last post, I came across a few articles supporting the opinion that if you have a good reason to take random samples from a “big” dataset, you’re not committing some kind of sin: Big Data Blasphemy: Why Sample? … Continue reading →

Read more »

An unabashedly narcissistic data analysis of my own tweets. The…

April 2, 2012
By
An unabashedly narcissistic data analysis of my own tweets.
The…

pie( table( whence.i.tweet )) qplot( whence ) + coord_polar() pie( log( table( whence )))+RColorBrewer ggplot (see below) plot( density( tweets.len )) qplot(... stat="density") + geom_density qplot(...stat="bin") + geom_text(...) tweeple tweep...

Read more »

Interview with Kai Chew, CloudStat

December 31, 2011
By

Here is an interview with Kai Chew, Founder of Cloudstat. CloudStat is developing a cloud-based statistical platform to help researchers who want to make sense of data to do statistical analysis collaboratively with its high performance computing infra...

Read more »

CloudStat: Learn & Do R Language on the Cloud

November 19, 2011
By

Hi! My fellow useRs! I’m making a web-based R Language platform ( http://cloudst.at/ ) for my students. My aim is to decrease the learning curve of learning R and collaboration. With CloudStat, there is no more download, installation, update and mai...

Read more »

Interactive graphics for data analysis

September 1, 2011
By
Interactive graphics for data analysis

I got a copy of Martin Theus and Simon Urbanek’s Interactive Graphics for Data Analysis a couple of years ago, whence it’s been sat on my bookshelf. Since I’ve recently become a self-proclaimed expert on interactive graphics I thought it was about time I read the thing. Which is exactly what I did last weekend

Read more »

1st Data Analysis Contest Using R

1st Data Analysis Contest Using R

Emilio Torres Manzanera has just announced the 1st Data Analysis Contest Using R: “Nestoria (http://www.nestoria.com/) is a specialized web search engine platform in house prices. Nestoria and Lokku Labs aim to improve the understanding of the public of the value of its databases. The company aims to engage a few brilliant statisticians in the expectation

Read more »

Software tools for data analysis – an overview

February 19, 2011
By
Software tools for data analysis – an overview

by Szilard Pafka Discussions on various software tools (C, C++, Perl, Python, Unix shell, R, Matlab, SAS, SPSS, Excel, databases, Hadoop etc.) used in data analysis. Szilard Pafka (founder and co-organizer of the Los Angeles R users group) presents an … Continue reading →

Read more »

Did what you write drive what I read?

September 16, 2010
By
Did what you write drive what I read?

GoogleReader allows you to track your activity, by representing the number of news items read and published by day and by hour. I use it quite a lot to stay up to date with the scientific literature (I subscribed to probably over 30 journals) and a bunch of other feeds. Stuff tend to accumulate faster

Read more »

A quick analysis of the trends in the number of weddings in France (1975–2010)

August 15, 2010
By
A quick analysis of the trends in the number of weddings in France (1975–2010)

I’m currently planning my wedding, and my fiancée and I were discussing wether there were more or less couples getting married over time. It turns out that this information is quite easy to get via INSEE, a french institute that (…)Read the rest of this entry »

Read more »