Color choosing in R made easy

May 21, 2010
By
Color choosing in R made easy

I don’t know about you, but when I want to make a graph in R, I handpick the colors, line widths etc… to produce awesome output. A lot of my time is spent on color choosing, I had to find a more convenient way of doing so. Earl F. Glynn’s “Chart of R colors”  posted [&hellip

Read more »

R 2.11.1 scheduled for May 31

May 21, 2010
By

As announced by the R Core Team, the next update to R will be 2.11.1 to be released on May 31. Despite being a minor-minor version increment, this release is expected to sport at least one new feature: BIC (in package stats4) will work with multiple fitted models, like AIC does. There will also be some improvements to the...

Read more »

Tip of the day: Keep the console active in R Productivity Environment

May 21, 2010
By

Jared Lander on Twitter asks: When I alt-tab into R Enterprise how do I make the #rstats console the active window by default. Now it goes to solution explorer. Revolution's crack Support engineer Stephen Weller offers this solution: The best way to do this is to right-click on the R-Console window and make it a 'Tabbed Document'....

Read more »

Random sudokus [p-values]

May 21, 2010
By
Random sudokus [p-values]

I reran the program checking the distribution of the digits over 9 “diagonals” (obtained by acceptable permutations of rows and column) and this test again results in mostly small p-values. Over a million iterations, and the nine (dependent) diagonals, four p-values were below 0.01, three were below 0.1, and two were above (0.21 and 0.42).

Read more »

highlight 0.1-8

May 21, 2010
By

I've pushed version 0.1-8 of highlight to CRAN. highlight is a syntax highlighter for R that renders R source code into some markup language, the package ships html and latex renderers but is flexible enough to handle other formats. Syntax highligh...

Read more »

ACM Transactions on Modeling and Computer Simulation

May 20, 2010
By
ACM Transactions on Modeling and Computer Simulation

Pierre Lecuyer is the new editor of the ACM Transactions on Modeling and Computer Simulation (TOMACS) and he has asked me to become an Area Editor for the new area of simulation in Statistics. I am quite excited by this new Æditor’s hat, since this is a cross-disciplinary journal: The ACM Transactions on Modeling and

Read more »

R: A random walk though OOP land.

May 20, 2010
By

If you are used to object oriented programing in a different language, the way R does things can seem a little strange and backwards. “proto” to the rescue. With this library you can simulate “normal” OOP. I found the examples for proto not so helpful, so to figure out how the package works I sent

Read more »

Load Testing Think Time Distributions

May 20, 2010
By
Load Testing Think Time Distributions

One of my gripes about some commercial load testing tools is that they only provide a think time distribution (Z) that is equivalent to uniform variates in the client-script. If you want some other distribution, you have to code it and debug it yoursel...

Read more »

Calling all T-shirt designers

May 20, 2010
By

Got design skills as well as R skills? The organizers of the useR! 2010 conference are looking for a design to be used on the conference T-shirts. Mango Solutions (who are sponsoring the T-shirt) will select the winner, but personally I'd give extra points for designs that use R itself -- it's been done before! R-statistics blog: useR-2010 is...

Read more »

Tutorial: Principal Components Analysis (PCA) in R

May 20, 2010
By

Found this tutorial by Emily Mankin on how to do principal components analysis (PCA) using R. Has a nice example with R code and several good references. The example starts by doing the PCA manually, then uses R's built in prcomp() function to do the s...

Read more »

RcppArmadillo 0.2.0 (and 0.2.1)

May 20, 2010
By

With the Rcpp 0.8.0 release on Monday, Romain, Doug and I were able to follow-up with a new RcppArmadillo release. RcppArmadillo uses Rcpp (and a few dozen lines of 'glue') to provide a transparent interface from R to Conrad Sanderson's impressive Arma...

Read more »

RcppArmadillo 0.2.0 (and 0.2.1)

With the Rcpp 0.8.0 release on Monday, Romain, Doug and I were able to follow-up with a new RcppArmadillo release. RcppArmadillo uses Rcpp (and a few dozen lines of 'glue') to provide a transparent interface from R to Conrad Sanderson's impressive Arm...

Read more »

Prediction in the cloud: turbulent

May 19, 2010
By

While Microsoft rolled out its Technical Computing Initiative -- promising new tools for distributed parallel computing on large data sets in the cloud -- with much fanfare earlier this week, Google made a rather more understated response. In a post to the developer-focused Google Code Blog, they quietly announced two new, but potentially disruptive, products. Google BigQuery promises super-fast...

Read more »

Introduction to Revolution R webinar tomorrow, May 20

May 19, 2010
By

A quick reminder that I'll be hosting a free webinar tomorrow. Mainly intended for those new to R and Revolution, An Introduction to Revolution R will give an overview and history of the R Project and the additional features of the Revolution R products. But even if you've used R before, there will be lots of useful tips and...

Read more »

useR-2010 is looking for a T-shirt design

May 19, 2010
By
useR-2010 is looking for a T-shirt design

Katharine Mullen has just published on the R mailing list a call for designeRs who might be willing to design a T-shirt aRt design for the shirt that will be given in useR 2010. I consider such contests as one of those good-for-the-community things, and hope regular useRs, R bloggers, and companies that are based on R – will...

Read more »

Updated R code and data for ARM

May 19, 2010
By

Patricia and I have cleaned up some of the R and Bugs code and collected the data for almost all the examples in ARM. See here for links to zip files with the code and data....

Read more »

Mining and Analyzing Online Social Graph Data

May 19, 2010
By

Drew Conway, PhD student in NYU's Department of Politics, provides an introduction to mining social graph data from the Internet that focuses on the technical, substantive and ethical concerns related to this type of analysis.

Read more »

Random [uniform?] sudokus [corrected]

May 19, 2010
By
Random [uniform?] sudokus [corrected]

As the discrepancy in the sum of the nine probabilities seemed too blatant to be attributed to numerical error given the problem scale, I went and checked my R code for the probabilities and found a choose(9,3) instead of a choose(6,3) in the last line… The fit between the true distribution and the

Read more »

RcppArmadillo 0.2.1

May 19, 2010
By

Armadillo Armadillo is a C++ linear algebra library aiming towards a good balance between speed and ease of use. Integer, floating point and complex numbers are supported, as well as a subset of trigonometric and statistics functions. Various matr...

Read more »

Random [uniform?] sudokus

May 19, 2010
By
Random [uniform?] sudokus

A longer run of the R code of yesterday with a million sudokus produced the following qqplot. It does look ok but no perfect. Actually, it looks very much like the graph of yesterday, although based on a 100-fold increase in the number of simulations. Now, if I test the adequation with a basic chi-square

Read more »

LSPM Joint Probability Tables

May 18, 2010
By
LSPM Joint Probability Tables

I've received several requests for methods to create joint probability tables for use in LSPM's portfolio optimization functions.  Rather than continue to email this example to individuals who ask, I post it here in hopes they find it via a Google...

Read more »

MLB Baseball Pitching Matchups ~ downloading pitch f/x data using the XML package in R [updatedx6]

May 18, 2010
By
MLB Baseball Pitching Matchups ~ downloading pitch f/x data using the XML package in R [updatedx6]

Update x6 (Jul 27): so I guess people want pitch counts. The data @ MLB seems to only give the pitch count of the end result and the strikes/balls/outs of the particular pitch. Of course you can combine them to get the pitch count. Stupid WordPress comments strip out necessary HTML to properly display code,

Read more »

robot (SPX) DNA Management Techniques

May 18, 2010
By
robot (SPX) DNA Management Techniques

Yes, this is related to trading, but no, it is not my thesis on why the Euro is going to parity. Instead, it is sort of a workshop for robot(SPX) developers on how to organize their digital DNA. As you begin to use programming as a money extraction tool on the markets, you'll soon find...

Read more »

Confusing slice sampler

May 18, 2010
By
Confusing slice sampler

Most embarrassingly, Liaosa Xu from Virginia Tech sent the following email almost a month ago and I forgot to reply: I have a question regarding your example 7.11 in your book Introducing Monte Carlo Methods with R.  To further decompose the uniform simulation by sampling a and b step by step, how you determine the

Read more »

R: Dueling normals

May 18, 2010
By
R: Dueling normals

More playing around with R. To create the graph above, I sampled 100 times from two different normal distributions, then plotted the ratio of times that the first distribution beat the second one on the y-axis. The second distribution always had a mean of 0, the mean of first distribution went from 0 to 4,

Read more »

Parallel Computing with R for Life Sciences

May 18, 2010
By

I hadn't heard of the CloudAsia 2010 conference before, but from the programme the workshop Master Class on HPC Application For Life Sciences looked like it was interesting. One workshop session in particular caught my eye: Practical Parallel Computing in R by Xie Chao and Tan Tin Wee (from the National University of Singapore). The workshop notes (PDF) provide...

Read more »

Prototype: Web-Friendly Visualizations in R

May 18, 2010
By

Developing web-friendly data visualizations is not very difficult, though as far as I know, a package that allows one to do this directly in R does not exist (e-mail me if you know of one). As someone who has been developing lots of data-oriented software tools, it's always nice to post visualizations online. To facilitate

Read more »

JAGS 2.1.0 and rjags 2.1.0 are released

May 17, 2010
By
JAGS 2.1.0 and rjags 2.1.0 are released

JAGS 2.1.0 is now available from Sourceforge.  You will find the source as well as binary packages for Windows and Mac OS X. Binary packages for Debian are available through the usual Debian channels, and packages for RPM-based Linux distributions … Continue reading →

Read more »

House Mountain Hike

May 17, 2010
By
House Mountain Hike

My wife Mary and my Dad Wesley and I took a hike this weekend (5/14/10) to the House Mountain state recreation area in Knox county, Tennessee. The hike was about 3.8 miles with a total elevation gain of around 1000 feet (940.23ft by GPS). The plot below gives the elevation profile over the course of

Read more »