Twin Cities R User Group Meeting Tonight!

February 16, 2011
By

TCRUG will be having a meeting TONIGHT (2/16) at 5:30 PM. We will meet in ROOM 29 in Willey Hall. Willey Hall is located on the West Bank of the Minneapolis campus. See the Google map at http://goo.gl/tnRnU. Erik Iverson will be giving a talk ...

Read more »

Mapping London’s Population Change 1801-2030

February 16, 2011
By
Mapping London’s Population Change 1801-2030

Buried in the London Datastore are the population estimates for each of the London Boroughs between 2001 – 2030. They predict a declining population for most boroughs with the exception of a few to the east. I was surprised by this general decline and also the numbers involved- I expected larger changes from one year to ...

Read more »

Regional Variation in Law Enforcement Deaths – Part A

February 15, 2011
By
Regional Variation in Law Enforcement Deaths – Part A

In recent months, there has been a series of high profile incidents in the United States where police officers were killed. While such events are unfortunate, the data suggests that it is extremely rare for an officer to be harmed or killed while on duty. In this post, I examine whether there are significant regional

Read more »

Mixed models – Part 2: lme lmer

February 15, 2011
By
Mixed models – Part 2: lme lmer

Getting more into mixed models, I’ve been playing around with both nlme::lme and lme4::lmer. http://tolstoy.newcastle.edu.au/R/e2/help/06/10/3345.html was quite a good post at explaining the differences, which from what I gather is largely performance based when using crossed or partially crossed models. In the models I am tinkering with at the moment I am noticing differences in

Read more »

Boxplots and Beyond III: Violin Plots

Boxplots and Beyond III: Violin Plots

This post is the third in a series of four on boxplots and closely related data visualization techniques for comparing subsets of a dataset, or comparing different datasets that we hope or expect to be similarly distributed.  The previous two post...

Read more »

ABC in London

February 15, 2011
By
ABC in London

After the very exciting and I think quite successful ABC in Paris meeting two years ago, Michael Stumpf from Imperial College London suggested a second edition in London along the same lines. Michael kindly associated me with the planning of this meeting. It is (logically) called ABC in London (or ABCiL) and will take place

Read more »

Statistical Graphics – Edward Tufte

February 15, 2011
By

The work of Edward Tufte is worth reading if you are interested in designing meaningful graphs and removing chart junk from your displays.

Read more »

Example 8.25: more latent class models (plus a graphical display)

February 15, 2011
By
Example 8.25: more latent class models (plus a graphical display)

In recent entries (here, here, here and here), we've been fitting a series of latent class models using SAS and R. One of the most commonly used and powerful package for latent class model estimation is Mplus. In this entry, we demonstrate how to use...

Read more »

Rcpp 0.9.1

February 15, 2011
By

A new release 0.9.1 of Rcpp went to CRAN and Debian yesterday. This version contains mostly bug-fixes and rather few enhancements. The changes are mostly 'internal fixes' and not user-facing; they mostly address some issues in memory management w...

Read more »

R 2.12.2 scheduled for February 25

February 15, 2011
By

The next release of R is scheduled for release February 25, and R 2.12.2 will likely be the final bug-fix release of the 2.12 series before R 2.13 is released in April. According to the NEWS file in the latest daily build, 2.12.2 will improve complex-arithmetic support on some rare platforms that don't support complex types in C99, and...

Read more »

Reaching 1000

February 14, 2011
By
Reaching 1000

This is the 1000th post on the ‘Og! Here are the entries that have had above 1000 views (not viewers) so far: In{s}a(ne)!! 5,353 “simply start over and build something better” 4,345 Julien on R shortcomings 1,966 Sudoku via simulated annealing 1,762 Of black swans and bleak prospects 1,462 Do we need an integrated Bayesian/likelihood

Read more »

Extracting all Crime Data for England and Wales using R and MYSQL

February 14, 2011
By
Extracting all Crime Data for England and Wales using R and MYSQL

Last week I started creating some data extraction code for the new England and Wales crime maps website using the R software / language. Although there is an API, a more efficient way of accessing all of the data (and without causing stress to their API server) is to download the CSV files located here

Read more »

R-commander installation in openSUSE

February 14, 2011
By

Thanks to this post I was able to install R-commander in openSUSE.  I've modified recipe a bit and don't want to search for it the next time.You have to perform several steps:Install R-base and R-base-devel packages from here. Install gfortran :~&...

Read more »

Modern Science and the Bayesian-Frequentist Controversy

February 14, 2011
By

The Bayesian-Frequentist debate reflects two different attitudes to the process of doing science, both quite legitimate. Bayesian statistics is well-suited to individual researchers, or a research group, trying to use all the information at its disposal to make the quickest possible progress. In pursuing progress, Bayesians tend to be aggressive and optimistic with their modeling

Read more »

Stack Exchange: Quantitative Finance in public beta

February 14, 2011
By
Stack Exchange: Quantitative Finance in public beta

The Quantitative Finance Stack Exchange community entered public beta last week.  To quote the FAQ: The Quantitative Finance Stack Exchange is intended specifically for professionals and traders working in investment banking, and aca...

Read more »

OkCupid: Finding your Valentine with R

February 14, 2011
By

Free dating site OkCupid (which was recently acquired by match.com) collects a lot of data. With over 3 million members, many of whom have provided extensive information about their personal details including preferences, lifestyle, sexuality and hobbies via their dating profiles, they have a wealth of information upon which to identify trends about the love lives of a typical...

Read more »

R Tutorial Series: One-Way Repeated Measures ANOVA

February 14, 2011
By
R Tutorial Series: One-Way Repeated Measures ANOVA

Repeated measures data require a different analysis procedure than our typical one-way ANOVA and subsequently follow a different R process. This tutorial will demonstrate how to conduct one-way repeated measures ANOVA in R using the Anova(mod, idata, i...

Read more »

R Tutorial Series: One-Way Repeated Measures ANOVA

February 14, 2011
By
R Tutorial Series: One-Way Repeated Measures ANOVA

Repeated measures data require a different analysis procedure than our typical one-way ANOVA and subsequently follow a different R process. This tutorial will demonstrate how to conduct one-way repeated measures ANOVA in R using the Anova(mod, idata, i...

Read more »

Extracting all Crime Data for England and Wales using R and MYSQL

February 14, 2011
By
Extracting all Crime Data for England and Wales using R and MYSQL

Last week I started creating some data extraction code for the new England and Wales crime maps website using the R software / language. Although there is an API, a more efficient way of accessing all of the data (and without causing stress to their AP...

Read more »

The Most Romantic Electro-Grunge Statistical Computing Song Ever Made

February 14, 2011
By
The Most Romantic Electro-Grunge Statistical Computing Song Ever Made

Warning message: This song contains highly suggestive coefficients and graphic depictions of exuberant R-core lovin’. “Plotting Ihaka” is based on Rotting Piñata by Sponge, and reflects a small measure of my boundless joy in the world of R. Despite being a firm proponent of muffins, I can confidently say that I would rather live in

Read more »

Transfering MT4 quotes in R

February 13, 2011
By
Transfering MT4 quotes in R

The more I get into this, the more I adjust my objectives. Currently, I’m willing to make the most of the Quantmod and Blotter R-packages, while eventually building a MT4 equivalent of the InteractiveBroker-R_API. As a first step, I was willing to Chart ticks from MT4 in R using the Quantmod’s chartSeries().  There are different

Read more »

Where’s your high-end sportswear made?

February 13, 2011
By
Where’s your high-end sportswear made?

Sierra Trading Post mailed me their End-of-Winter 2011 catalog, which consists mostly of mid- to high-end sportswear products, but also things like snow skis, flashlights, and binoculars. STP sells closeouts, overstock, seconds, and items with cosmetic blemishes or irregularities. I spent an hour of relaxation time enumerating the various product brands and their places of

Read more »

R, the DJIA, and M1 Money Multiplier (MULT)

February 13, 2011
By
R, the DJIA, and M1 Money Multiplier (MULT)

Description:My explorations with the FRED (St Louis Financial Reserve Database) in R have yielded some interesting plots, charts and graphs. And some questions…Charts with Explanations: Yes, I know, the axes are a mess, and the site’s style sheet doesn’t play well with the chart image. I’ll look into that “pretty-ing up” stuff...

Read more »

R, the DJIA, and M1 Money Multiplier (MULT)

February 13, 2011
By
R, the DJIA, and M1 Money Multiplier (MULT)

Description:My explorations with the FRED (St Louis Financial Reserve Database) in R have yielded some interesting plots, charts and graphs. And some questions… Charts with Explanations: Yes, I know, the axes are a mess, and the site’s style sheet ...

Read more »

Another Bernoulli factory

February 13, 2011
By
Another Bernoulli factory

The paper “Exact sampling for intractable probability distributions via a Bernoulli factory” by James Flegal and Radu Herbei got posted on arXiv without me noticing, presumably because it came out just between Larry Brown’s conference in Philadelphia and my skiing vacations! I became aware of it only yesterday and find it quite interesting in that

Read more »

Visualize NHL Play-by-Play using Tableau Public and R

February 13, 2011
By
Visualize NHL Play-by-Play using Tableau Public and R

Nothing like a little Sunday morning data hacking before a big game!  I have been wanting to play with the NHL play-by-play event files for some time now.  The JSON datasets provide a wealth of information about each event in the game including the location, as defined by the fields xcoord and ycoord. I am

Read more »

The cranky guide to trying R packages

February 13, 2011
By
The cranky guide to trying R packages

This is a tutorial on how to try out a new package in R. The summary is: expect errors, search out errors and don’t start with the built in examples or real data. Suppose you want to try out a novel statistical technique? A good fraction of the time R is your best bet for Related posts:

Read more »

What is the optimal strategy to marry the best one ?

February 13, 2011
By
What is the optimal strategy to marry the best one ?

Valentine's day is a nice opportunity to post on hot and sexy topics... Well, it's also an important day that I should not miss, probably as much as Saint Patrick's my wife's birthday. And as I mentioned last week (here), it is difficult to get the...

Read more »

sab-R-metrics: Displaying Line Plots and Time Series Data

February 13, 2011
By
sab-R-metrics: Displaying Line Plots and Time Series Data

It's been a while since I've had the chance to add anything here, but last time I left everyone with some scatter plots and some customization tools for your graphics. This week will be a little more brief than the last few tutorials and what I'd like to do is show you how to display line graphs for...

Read more »