523 search results for "register"

Relearn boxplot and label the outliers

February 5, 2013
By
Relearn boxplot and label the outliers

Despite the fact that box plot is used almost every where and taught at undergraduate statistic classes, I recently had to re-learn the box plot in order to know how to label the outliers.This stackoverflow post was where I found how...

Read more »

Generating Labels for Supervised Text Classification using CAT and R

February 4, 2013
By
Generating Labels for Supervised Text Classification using CAT and R

The explosion in the availability of text has opened new opportunities to exploit text as data for research. As Justin Grimmer and Brandon Stewart discuss in the above paper, there are a number of approaches to reducing human text to … Continue reading →

Read more »

A package for agricultural statistic: FAOSTAT

February 3, 2013
By

After 8 years of using R, today I finally become a contributor to the community and released my first package, FAOSTAT.The package is designed to provide user with direct access to the FAOSTAT data base via R and to support the...

Read more »

Maize trade Part II: Comparison and analysis

February 3, 2013
By
Maize trade Part II: Comparison and analysis

Following my last post about the maize network, although interesting but is not very informative. What we are going to do today is to contrast the maize network with the wine trade network.The choice why we have chose wine will become clear after the...

Read more »

Taking Expectations to the Next Level

January 31, 2013
By
Taking Expectations to the Next Level

Higher Expectations I came across this post on Thursday and found it to be quite interesting. Clearly rental prices vary according to where you live. That isn't too surprising. I started thinking a bit more about it and thought that Boston and the nearby communities would have to...

Read more »

Maximize Your Expectations!

January 30, 2013
By
Maximize Your Expectations!

A Problem A major problem in secondary data analysis is that you didn't get to decide what data was collected. Lets say you were interested in how many times a student has read the Twilight books). Specifically, you want to know how effective the ads for...

Read more »

The "golden age" of a football player

January 28, 2013
By
The "golden age" of a football player

It's been some time since my last post on football. And we're talking about european soccer here. So I finally managed to write some functions which allow me to extract player stats from www.transfermarkt.de. The site tracks lots of stats in the world of soccer. For each player, there is information about the dominant foot, height, age, the estimated...

Read more »

R and foreign characters

January 25, 2013
By
R and foreign characters

Working with Russian characters can be mind-numbingly frustrating. This is true for R, as for other applications, so below I've written out the my top five tricks for making Russian inputs work in R; i believe they should be transferable to most other languages....

Read more »

Revolution Newsletter: January 2013

January 23, 2013
By

The most recent edition of the Revolution Newsletter is out. The news section is below, and you can read the full January edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. Top Innovator, Big Data Technologies. Revolution Analytics is the proud recipient of the Top...

Read more »

Going Beyond Florence Nightingale’s Data Diagram: Did Flo Blow It with Wedges?

January 23, 2013
By
Going Beyond Florence Nightingale’s Data Diagram: Did Flo Blow It with Wedges?

In 2010, I wrote a short blog item about Florence Nightingale the statistician, solely because of its novelty value. I didn't even bother to look closely at the associated graphic she designed, but that's what I intend to do here. In this first installment, I reflect on her famous data visualization by reconstructing it...

Read more »