Color Schemes for R Bar Plots

March 1, 2009
By
Color Schemes for R Bar Plots

A recurrent source of irritation for me is the absence of a good default behavior in R for choosing the color scheme for bar plots. A stacked bar plot looks only as good as the color scheme you use. In hope of finding a usable scheme that I could settl...

Read more »

Your flight is moving …

March 1, 2009
By
Your flight is moving …

THE VALUE OF NOT FOLLOWING INSTRUCTIONS As Shane Frederick has noted, if you say “A bat and a ball cost $1.10. The bat costs $1 more than the ball. How much is the ball?”, you will notice that the vast majority of your friends will say “10 cents” instead of the correct “5 cents”, because

Read more »

Rcpp 0.6.4

March 1, 2009
By

A new maintenance version of Rcpp (now at 0.6.4) was just pushed to CRAN and has been uploaded to Debian. Rcpp is a set of utility classes that provide interfaces for transferring the major R data types to C++ and back which makes it easier to extend R with dynamically loadable code written in C or C++. This version changes how use...

Read more »

What is R?

March 1, 2009
By

Highlights R is a free software environment for statistical computing and graphics. It compiles and runs on a wide variety of UNIX platforms, Windows and MacOS.   If you wish to download R, please choose your preferred CRAN mirror. Basic questions about R like how to download and install the software, or what the license terms are, are answered

Read more »

What is R?

March 1, 2009
By
What is R?

Highlights R is a free software environment for statistical computing and graphics. It compiles and runs on a wide variety of UNIX platforms, Windows and MacOS.   If you wish to download R, please choose your preferred CRAN mirror. Basic questions ...

Read more »

Project Euler Problem #22

March 1, 2009
By

Problem 22 on Project Euler proves a text file containing a large number of comma-delimited names and asks us to calculate the numeric sum of the alphabetical score for each name multiplied by the name’s position in the original list. This is mad...

Read more »

Plotting PDQ Output with R

February 27, 2009
By
Plotting PDQ Output with R

One the nice things about PDQ-R (coming in release 5.0) is the ability to plot PDQ output directly in R. Here's a PDQ-R script, together with the corresponding graphical output, that I knocked up to show the effect on the throughput curve of adding mor...

Read more »

R in The Windy City

February 27, 2009
By

In honor of me moving to Chicago, the powers who abide have decided to hold the first annual “R/Finance conference for applied finance using R” conference in Chicago this year. The dates are April 24-25, 2009. R/Finance 2009: Applied Finance with R To those who made the decision on location, I’m pleased but slightly embarrassed that you

Read more »

Data Analysis Workflow… Part 1 of Infinity

February 26, 2009
By

One of the many things that I sit around pondering when I should be doing productive things is the idea of analytical workflow. I have only worked with one analytical guru who I felt really gave thought and structure to workflow and its impact on analyist productivity. When I talk about workflow I mean the

Read more »

Review of ‘Applied Econometrics in R’ in JSS

February 25, 2009
By

A short review of Kleiber and Zeileis' excellent Applied Econometrics with R is now out at the (online) Journal of Statistical Software.

Read more »

Absolutely great resource

February 25, 2009
By

A very nice resource which helped me a lot in kickstarting my Sweave efforts is Learning to Sweave in APA Style by clementi on scribd. This is a down to earth tutorial containing a lot of good tips! See for yourself:Learning to Sweave in APA Style ...

Read more »

Absolutely great resource

February 25, 2009
By

A very nice resource which helped me a lot in kickstarting my Sweave efforts is Learning to Sweave in APA Style by clementi on scribd. This is a down to earth tutorial containing a lot of good tips! See for yourself:Learning to Sweave in APA Style ...

Read more »

R/Finance conference in Chicago in April: Registration now open

February 23, 2009
By

Regarding the aforementioned R/Finance conference that will take place at the end of April here in Chicago, we announced earlier today that the conference website is now available. It provides information about the program, speakers and other details as well as a link to registration details. See you in Chicago in April!

Read more »

Sorry, you said you want a stats revolution?

February 23, 2009
By
Sorry, you said you want a stats revolution?

ALL ABOUT REVOLUTION COMPUTING’S R DISTRIBUTION Decision Science News was intrigued by a company called REvolution Computing that got some attention of late for spinning their own mix of the R language for statistical computing and giving it away for free. So DSN asked to interview them to see what it’s all about Decision Science

Read more »

Project Euler Problem #15

February 22, 2009
By

Problem 15 on Project Euler asks us to find the number of distinct routes between the top left and bottom right corners in a 20×20 grid, with no backtracking allowed. I originally saw this type of problem tackled in the book Notes On Introductory ...

Read more »

PDQ-R Lives!

February 22, 2009
By
PDQ-R Lives!

After some fiddling to get things linked correctly to the R binaries on my new Macbook, the first PDQ-R test model has run successfully! Here 'tiz ...This is an important step for PDQ development and is due entirely to the efforts of Phil Feller. Natur...

Read more »

R graphics: margins are way to large

February 22, 2009
By
R graphics: margins are way to large

For me R has a very nice and powerfull capabilities for graphics (for example see this gallery). However, I dislike the default setting for margins and placement of axis numbers and labels. Since I always forget the setting of parameters I prefer I am adding this post. For example:library(package="MASS")Sigma mu tmp plot(tmp, xlab="X variable (unit)", ylab="Y variable...

Read more »

R graphics: margins are way to large

February 22, 2009
By
R graphics: margins are way to large

For me R has a very nice and powerfull capabilities for graphics (for example see this gallery). However, I dislike the default setting for margins and placement of axis numbers and labels. Since I always forget the setting of parameters I prefer I am adding this post. For example: library(package="MASS")Sigma mu tmp plot(tmp, xlab="X variable (unit)", ylab="Y variable...

Read more »

Illinois long-term selection experiment for oil and protein in corn

February 22, 2009
By
Illinois long-term selection experiment for oil and protein in corn

Researchers at the University of Illinois are conducting one of the longest experiments in biology - Illinois long-term selection experiment for oil and protein in corn. The experiment started in 1896 and is still active! In esence they are selecting l...

Read more »

Project Euler Problem #13

February 21, 2009
By

Problem 13 on Project Euler asks us to sum 100 50-digit numbers and give the first 10 digits of the result. This is pretty easy. Note we are using R’s integer division operator %/% to discard the remainder of the large summed integer and just giv...

Read more »

Project Euler Problem #14

February 21, 2009
By

Problem 14 on the Project Euler site asks us to find the longest chain under 1 million created using the Collatz mapping. This is fairly straightforward, although performance again is not great: ## Problem 14 # Collatz conjecture problem14 <-&...

Read more »

Project Euler Problem #12

February 21, 2009
By

Problem 12 on the Project Euler site asks: What is the value of the first triangle number to have over five hundred divisors? A triangular number T(n) is defined as . The R code below consists of a solution, which involves the fact that the number of proper divisors of an integer n can be

Read more »

How Facebook and Google use R

February 21, 2009
By
How Facebook and Google use R

How Facebook and Google use R Interesting read.  rpart comes in handy again.

Read more »

Registration for R/Finance 2009 is Open!

February 20, 2009
By
Registration for R/Finance 2009 is Open!

The conference website has details on:the agenda and speakers,travel accommodations,registration, andsponsors, who made the conference possible.Hope to see you there!

Read more »

People who love scatter plots & connecting dots

February 20, 2009
By
People who love scatter plots & connecting dots

We hosted the first Dataviz Salon SF on Tuesday night, with lightning talks by boredom cop Shane Booth, dataviz wiz Lee Byron , computational journalist Brad Stenger, data wrangler Pete Skomoroch , and any/all data enthusiast Brendan O’Connor . I was going to blog all about it — but Tom Carden of Stamen Design already

Read more »

How Google and Facebook are using R

February 19, 2009
By
How Google and Facebook are using R

(March 26th Update: Video now available) Last night, I moderated our Bay Area R Users Group kick-off event with a panel discussion entitled “The R and Science of Predictive Analytics”, co-located with the Predictive Analytics World conference here in SF. The panel comprised of four recognized R users from industry: Bo Cowgill, Google Itamar Rosenn,

Read more »

R: Good practice – adding footnotes to graphics

February 17, 2009
By
R: Good practice – adding footnotes to graphics

In some statistical programs there is the option available to attach a footnote to the graphical output that is created. This footnote may contain the name of the script or the file that produced the graphic, the author’s name and the date of creation. In SAS for example there is a footnote command to achieve

Read more »

Pearson vs. Spearman Correlation Coefficients

February 17, 2009
By
Pearson vs. Spearman Correlation Coefficients

One of the misuses of statistical terminology that annoys me most is the use of the word “correlation” to describe any variable that increases as another variable increases. This monotonic trend seems worth looking for, but it plainly is not what m...

Read more »

TTR_0.2 on CRAN

February 15, 2009
By
TTR_0.2 on CRAN

I am happy to announce a long-overdue update to the TTR package (version 0.2) is now on CRAN.This update represents a major milestone, as TTR useRs are no longer restricted to using matrix objects. TTR 0.2 uses xts internally, so all major time series...

Read more »