## Color Schemes for R Bar Plots

March 1, 2009
By

A recurrent source of irritation for me is the absence of a good default behavior in R for choosing the color scheme for bar plots. A stacked bar plot looks only as good as the color scheme you use. In hope of finding a usable scheme that I could settl...

## Your flight is moving …

March 1, 2009
By

THE VALUE OF NOT FOLLOWING INSTRUCTIONS As Shane Frederick has noted, if you say “A bat and a ball cost $1.10. The bat costs$1 more than the ball. How much is the ball?”, you will notice that the vast majority of your friends will say “10 cents” instead of the correct “5 cents”, because

## Rcpp 0.6.4

March 1, 2009
By

A new maintenance version of Rcpp (now at 0.6.4) was just pushed to CRAN and has been uploaded to Debian. Rcpp is a set of utility classes that provide interfaces for transferring the major R data types to C++ and back which makes it easier to extend R with dynamically loadable code written in C or C++. This version changes how use...

## What is R?

March 1, 2009
By

Highlights R is a free software environment for statistical computing and graphics. It compiles and runs on a wide variety of UNIX platforms, Windows and MacOS.   If you wish to download R, please choose your preferred CRAN mirror. Basic questions about R like how to download and install the software, or what the license terms are, are answered

## What is R?

March 1, 2009
By

Highlights R is a free software environment for statistical computing and graphics. It compiles and runs on a wide variety of UNIX platforms, Windows and MacOS.   If you wish to download R, please choose your preferred CRAN mirror. Basic questions ...

## Project Euler Problem #22

March 1, 2009
By

Problem 22 on Project Euler proves a text file containing a large number of comma-delimited names and asks us to calculate the numeric sum of the alphabetical score for each name multiplied by the name’s position in the original list. This is mad...

## Plotting PDQ Output with R

February 27, 2009
By

One the nice things about PDQ-R (coming in release 5.0) is the ability to plot PDQ output directly in R. Here's a PDQ-R script, together with the corresponding graphical output, that I knocked up to show the effect on the throughput curve of adding mor...

## R in The Windy City

February 27, 2009
By

In honor of me moving to Chicago, the powers who abide have decided to hold the first annual “R/Finance conference for applied finance using R” conference in Chicago this year. The dates are April 24-25, 2009. R/Finance 2009: Applied Finance with R To those who made the decision on location, I’m pleased but slightly embarrassed that you

## Data Analysis Workflow… Part 1 of Infinity

February 26, 2009
By

One of the many things that I sit around pondering when I should be doing productive things is the idea of analytical workflow. I have only worked with one analytical guru who I felt really gave thought and structure to workflow and its impact on analyist productivity. When I talk about workflow I mean the

## Review of ‘Applied Econometrics in R’ in JSS

February 25, 2009
By

A short review of Kleiber and Zeileis' excellent Applied Econometrics with R is now out at the (online) Journal of Statistical Software.

## Absolutely great resource

February 25, 2009
By

A very nice resource which helped me a lot in kickstarting my Sweave efforts is Learning to Sweave in APA Style by clementi on scribd. This is a down to earth tutorial containing a lot of good tips! See for yourself:Learning to Sweave in APA Style ...

## Absolutely great resource

February 25, 2009
By

A very nice resource which helped me a lot in kickstarting my Sweave efforts is Learning to Sweave in APA Style by clementi on scribd. This is a down to earth tutorial containing a lot of good tips! See for yourself:Learning to Sweave in APA Style ...

## R/Finance conference in Chicago in April: Registration now open

February 23, 2009
By

Regarding the aforementioned R/Finance conference that will take place at the end of April here in Chicago, we announced earlier today that the conference website is now available. It provides information about the program, speakers and other details as well as a link to registration details. See you in Chicago in April!

## Sorry, you said you want a stats revolution?

February 23, 2009
By

ALL ABOUT REVOLUTION COMPUTING’S R DISTRIBUTION Decision Science News was intrigued by a company called REvolution Computing that got some attention of late for spinning their own mix of the R language for statistical computing and giving it away for free. So DSN asked to interview them to see what it’s all about Decision Science

## Project Euler Problem #15

February 22, 2009
By

Problem 15 on Project Euler asks us to find the number of distinct routes between the top left and bottom right corners in a 20×20 grid, with no backtracking allowed. I originally saw this type of problem tackled in the book Notes On Introductory ...

## PDQ-R Lives!

February 22, 2009
By

After some fiddling to get things linked correctly to the R binaries on my new Macbook, the first PDQ-R test model has run successfully! Here 'tiz ...This is an important step for PDQ development and is due entirely to the efforts of Phil Feller. Natur...

## R graphics: margins are way to large

February 22, 2009
By

For me R has a very nice and powerfull capabilities for graphics (for example see this gallery). However, I dislike the default setting for margins and placement of axis numbers and labels. Since I always forget the setting of parameters I prefer I am adding this post. For example:library(package="MASS")Sigma mu tmp plot(tmp, xlab="X variable (unit)", ylab="Y variable...

## R graphics: margins are way to large

February 22, 2009
By

For me R has a very nice and powerfull capabilities for graphics (for example see this gallery). However, I dislike the default setting for margins and placement of axis numbers and labels. Since I always forget the setting of parameters I prefer I am adding this post. For example: library(package="MASS")Sigma mu tmp plot(tmp, xlab="X variable (unit)", ylab="Y variable...

## Illinois long-term selection experiment for oil and protein in corn

February 22, 2009
By

Researchers at the University of Illinois are conducting one of the longest experiments in biology - Illinois long-term selection experiment for oil and protein in corn. The experiment started in 1896 and is still active! In esence they are selecting l...

## Project Euler Problem #13

February 21, 2009
By

Problem 13 on Project Euler asks us to sum 100 50-digit numbers and give the first 10 digits of the result. This is pretty easy. Note we are using R’s integer division operator %/% to discard the remainder of the large summed integer and just giv...

## Project Euler Problem #14

February 21, 2009
By

Problem 14 on the Project Euler site asks us to find the longest chain under 1 million created using the Collatz mapping. This is fairly straightforward, although performance again is not great: ## Problem 14 # Collatz conjecture problem14 <-&...

## Project Euler Problem #12

February 21, 2009
By

Problem 12 on the Project Euler site asks: What is the value of the first triangle number to have over five hundred divisors? A triangular number T(n) is defined as . The R code below consists of a solution, which involves the fact that the number of proper divisors of an integer n can be

## How Facebook and Google use R

February 21, 2009
By

How Facebook and Google use R Interesting read.  rpart comes in handy again.

## Registration for R/Finance 2009 is Open!

February 20, 2009
By

The conference website has details on:the agenda and speakers,travel accommodations,registration, andsponsors, who made the conference possible.Hope to see you there!

## People who love scatter plots & connecting dots

February 20, 2009
By

We hosted the first Dataviz Salon SF on Tuesday night, with lightning talks by boredom cop Shane Booth, dataviz wiz Lee Byron , computational journalist Brad Stenger, data wrangler Pete Skomoroch , and any/all data enthusiast Brendan O’Connor . I was going to blog all about it — but Tom Carden of Stamen Design already

## How Google and Facebook are using R

February 19, 2009
By

(March 26th Update: Video now available) Last night, I moderated our Bay Area R Users Group kick-off event with a panel discussion entitled “The R and Science of Predictive Analytics”, co-located with the Predictive Analytics World conference here in SF. The panel comprised of four recognized R users from industry: Bo Cowgill, Google Itamar Rosenn,

## R: Good practice – adding footnotes to graphics

February 17, 2009
By

In some statistical programs there is the option available to attach a footnote to the graphical output that is created. This footnote may contain the name of the script or the file that produced the graphic, the author’s name and the date of creation. In SAS for example there is a footnote command to achieve

## Pearson vs. Spearman Correlation Coefficients

February 17, 2009
By

One of the misuses of statistical terminology that annoys me most is the use of the word “correlation” to describe any variable that increases as another variable increases. This monotonic trend seems worth looking for, but it plainly is not what m...