Monthly Archives: February 2011

Example 8.26: reading data with variable number of words in a field

February 22, 2011
By
Example 8.26: reading data with variable number of words in a field

A student came with a question about how to snag data from a PDF report for analysis. Once she'd copied things her text file looked like:1 Las Vegas, NV --- 53.3 --- --- 12 Sacramento, CA --- 42.3 --- --- 23 Miami, FL --- 41.8 --- --- 34 Tucson, AZ --...

Read more »

Multithreading in R (or other types of non-sequencial programming)

February 21, 2011
By
Multithreading in R (or other types of non-sequencial programming)

Considering forwarding tick data from MT4 to R requires less than 2ms, but that charting 4 different time-frame (1min, 15min, 30min and 1hour) at each tick-update may require more than 250ms (depending on the number of bars in history), I think it is f...

Read more »

Graphing – margins, titles, mtext, workspace

February 21, 2011
By
Graphing – margins, titles, mtext, workspace

This is a great post, very true, not enough of R’s graphics are well displayed online to really see how to achieve what the often ambiguous ‘help’ information suggests. http://research.stowers-institute.org/efg/R/Graphics/Basics/mar-oma/index.htm I particularly find “mtext(“lol”, outer=T)” to be particularly useful (requires “oma=c(2,2,2,2)” or similar). http://addictedtor.free.fr/graphiques/ This site is somewhat of the way there, but I’ve found

Read more »

Thor vs. Uncanny X-Men vs. Fantastic Four

February 21, 2011
By
Thor vs. Uncanny X-Men vs. Fantastic Four

Three of Marvel’s longest running comic book series’ are Thor, Uncanny X-Men, and Fantastic Four. Using data from 2010, I compare monthly comic book sales for each series. This data only pertains to monthly issues and not trade paperbacks. Furthermore, the series Amazing Spider Man was not considered because it was released twice a month.

Read more »

Use R to view and manipulate the File System

February 21, 2011
By

One of the best ways to learn how to code in R is to view sample scripts that people share. I recently came across this post where Michael uses R to scrape twitter and collect all sorts of great data … Continue reading →

Read more »

Dataset: Wisconsin Union Protester Tweets #wiunion

February 21, 2011
By
Dataset: Wisconsin Union Protester Tweets #wiunion

   I’ve been playing with Twitter data over the last week, archiving Algerian, Egyptian, Iranian, and Chinese tweets.  I thought I’d bring the story a little closer to home this time by archiving tweets from Wisconsin Union protesters on the … Continue reading →

Read more »

Interest Rates’ Influence on 1987

February 21, 2011
By
Interest Rates’ Influence on 1987

One aspect of 1987 that does not deserve enough attention is interest rates.  Higher interest rates constrain economic activity and compete with other investments.  As seen in the chart below, the US 10year Treasury rate climbed 40% from 7% t...

Read more »

Using R for Introductory Statistics, Chapter 5, hypergeometric distribution

February 21, 2011
By
Using R for Introductory Statistics, Chapter 5, hypergeometric distribution

This is a little digression from Chapter 5 of Using R for Introductory Statistics that led me to the hypergeometric distribution.Question 5.13 A sample of 100 people is drawn from a population of 600,000. If it is known that 40% of the population has a specific attribute, what is the probability that 35 or...

Read more »

Using R for Introductory Statistics, Chapter 5, hypergeometric distribution

February 21, 2011
By
Using R for Introductory Statistics, Chapter 5, hypergeometric distribution

This is a little digression from Chapter 5 of Using R for Introductory Statistics that led me to the hypergeometric distribution.Question 5.13 A sample of 100 people is drawn from a population of 600,000. If it is known that 40% of the population h...

Read more »

Who did HBGary contact the most?

February 21, 2011
By

Following on from Friday's post about the travails of internet security firm HBGary, R user Michael Bommarito has done an analysis of the leaked emails to find the top 20 most contacted email addresses and the top 20 most referenced internet domains. There are some interesting names on those lists, to be sure. Check them out at the link...

Read more »