Monthly Archives: May 2012

Are scatterplots too complex for lay folks?

May 23, 2012
By
Are scatterplots too complex for lay folks?

Usually, I like to write about the solutions to problems I’ve had, but today I only have a problem to write about. This is the second research job I’ve had outside of academia, and in both cases I’ve met with … Continue reading →

Read more »

My On-Job Training Analytics

May 22, 2012
By
My On-Job Training Analytics

I have been working at Provincial Statistics Office of Tawi-Tawi (Philippines) which was part of the training on my OJT (On-Job Training). One of the requirements of the training is at least 80 hours of services, so I decided to work from April 19 to M...

Read more »

My new forecasting textbook

May 22, 2012
By

After years of saying that I was going to write a book to replace Makridakis, Wheelwright and Hyndman (1998), I’m finally ready to make an announcement! My new book is Forecasting: principles and practice, co-authored with George Athanasopoulos. It is available online and free-of-charge. We have written about 2/3 of the book so far (all of which is already...

Read more »

The grade level of Congress speeches, analyzed with R

May 22, 2012
By
The grade level of Congress speeches, analyzed with R

As widely reported by CNN, the Huffington Post, Talking Points Memo, the sophistication of speeches by US politicians has declined in recent years, dropping from an 11th-grade level in 2005 to a 10th-grade level today. The reports are based on an analysis by the Sunlight Foundation, based on textual analysis of congressional speeches given since 1996 provided by the...

Read more »

knitr Performance Report–Attempt 3

May 22, 2012
By
knitr Performance Report–Attempt 3

please see knitr Performance Report-Attempt 2 and knitr Performance Report-Attempt 1 Since the time of my last reporting post, RStudio, knitr, and Sweave have worked extremely hard to make document creation easier by becoming even more streamlined and ...

Read more »

read.odbc.ffdf & read.dbi.ffdf for fetching large corporate SQL data

If you are into large data but not enormeoulsy big data everyone is talking about and you are tired of finding a solution to get your data with several 10's of millions of records in R without having RAM issues, having a look at the packages ff, ffbase and ETLUtils might be the solution to your...

Read more »

Sunlight foundation analyses complexity of congressional speech

May 22, 2012
By

Sunlight foundation analyses complexity of congressional speech

Read more »

A complete Bayesian model for sensory profiling data

May 22, 2012
By
A complete Bayesian model for sensory profiling data

In this post I will try to add an important parts in the sensory profiling model I have been building. This concerns the question: 'Are all panelists equally reproducible?'. Obviously the answer is no, some are better than others. From this observation...

Read more »

Adding watermarks to plots

May 22, 2012
By
Adding watermarks to plots

A question was raised today on the mailing list: Is there an easy way to add a watermark to a ggplot? There are several options, depending on the type of watermark and the required level of control over the output, add a text label using annotate (th...

Read more »

Correlations and postive-definiteness

May 22, 2012
By
Correlations and postive-definiteness

On the way to another destination, I found some curious behavior with average correlations. The data Daily log returns from almost all of the constituents of the S&P 500 for years 2006 through 2011. The behavior Figure 1 shows the actual mean correlation among stocks for the set of years and the mean correlation with … Continue reading...

Read more »