Helpful statistical references

April 7, 2009
By
Helpful statistical references

In a previous article I provided a list of R programming resources. As a complement to that post, I’ve compiled a list of statistically oriented websites that colleagues and I have found useful below. For the most part, these sites focus on statistics and quantitative research methods rather than programming. This first grouping lists sites The post Helpful...

Read more »

Helpful statistical references

April 7, 2009
By

In a previous article I provided a list of R programming resources. As a complement to that post, I've compiled a list of statistically oriented websites that colleagues and I have found useful below. For the most part, these sites focus on statistic...

Read more »

PDQ 5.0 is on the Launch Pad

April 4, 2009
By
PDQ 5.0 is on the Launch Pad

PDQ (Pretty Damn Quick) major release 5.0 is on the launch pad at Cape SourceForge. Because of a potential collision with the North Korean ICBM/satellite launch, we won't be filling the main liquid-hydrogen tank until next week (we don't want PDQ blame...

Read more »

Modern Microprocessor MIPS

April 2, 2009
By
Modern Microprocessor MIPS

The question of how modern microprocessors compare with mainframe processors of yore, arises from time to time. The vernacular rate metric that has persisted for a long time (long in the history of computers, that is) is MIPS. Whether you approve of M...

Read more »

Rcpp 0.6.5

April 1, 2009
By

A minor new maintenance release 0.6.5 of Rcpp just went off to CRAN and Debian. This version corrects a small oversight for the OS X build, and adds the LGPL as file COPYING to the sources.

Read more »

Rcpp 0.6.5

April 1, 2009
By

A minor new maintenance release 0.6.5 of Rcpp just went off to CRAN and Debian. This version corrects a small oversight for the OS X build, and adds the LGPL as file COPYING to the sources.

Read more »

London UseR Group Talk – Slides

April 1, 2009
By

The inaugural London UseR event was a great success, with a lot of interesting people and a very constructive networking atmosphere! I gave a (slightly disjointed) talk on concurrency and the bigmemory package in R (more on that later this year at UseR...

Read more »

SNA with R: Loading your network data

April 1, 2009
By
SNA with R: Loading your network data

We are interested in Social Network Analysis using the statistical analysis and computing platform R. As usual with R, the documentation is pretty bad, so this series collects our notes as we learn more about the available packages and how they work. We use here the statnet group of packages, which seems to be the...

Read more »

R tips: Swapping columns in a matrix

March 31, 2009
By
R tips: Swapping columns in a matrix

Swapping two columns in a matrix is really easy: m <- m.

Read more »

Enhanced tidy.source() (Preserve Some Comments)

March 31, 2009
By

After a few hours’ work, I modified the function tidy.source() in the animation package so that it can preserve complete comment lines. See the tidy.source() wiki page for example. tidy.source <- function(source = "clipboard", keep.comment = TRUE, keep.blank.line = FALSE, begin.comment, end.comment, ...) { # parse and deparse the code tidy.block = function(block.text)

Read more »

Multiple plot in a single image using ImageMagick

March 31, 2009
By
Multiple plot in a single image using ImageMagick

Sometimes you need to add several plots/images either by row or by column to a single page/sheet.If you generate all your plot with R base graphics you can easily accomplished the task using the par() function, e.g., using par(mfrow=c(2,2)) and then dr...

Read more »

How accurate or reliable are R calculations?

March 28, 2009
By
How accurate or reliable are R calculations?

On the REvolutions Blog there is a nice posting treating the often raised concern on “How good or reliable R is”. At my university R is hardly used. Sometimes I was asked by lecturers wether the calculations done by R and its packages are accurate. The linked posting treats this matter and tries to clarify

Read more »

R: Zip fastener for two data frames / combining rows or columns of two dataframes in an alternating manner

March 27, 2009
By
R: Zip fastener for two data frames / combining rows or columns of two dataframes in an alternating manner

Sometimes I find it useful to merge two data frames like the following ones   X1 X2 X3 X4 Y1 Y2 Y3 Y4 1  o  o  o  o X X X X 2  o  o  o  o X X X X 3  o  o  o  o X X X X by using zip feeding either

Read more »

R tips: Eliminating the “save workspace image” prompt on exit

March 26, 2009
By
R tips: Eliminating the “save workspace image” prompt on exit

When using R, the statistical analysis and computing platform, I find it really annoying that it always prompts to save the workspace when I exit. This is how I turn it off.

Read more »

R tips: Keep your packages up-to-date

March 25, 2009
By
R tips: Keep your packages up-to-date

In this entry in a small series of tips for the use of the R statistical analysis and computing tool, we look at how to keep your addon packages up-to-date.

Read more »

Alternative implementations using ggplot2

March 25, 2009
By
Alternative implementations using ggplot2

Here and here, you can find alternative implementations of two plots  (1, 2) I created time ago using R basic graphic. The author recreates the plots taking advantage of the excellent ggplot2 package.

Read more »

Inference for R

March 24, 2009
By
Inference for R

CREATE AUTOMATICALLY UPDATED R CHARTS AND TABLES INSIDE WORD & EXCEL Decision Science News’ imagination has been recently captured by an innovative product called Inference for R. (R as in the open-source language for statistical computation.) To use it, you simply insert some code into your Microsoft Office documents. The Inference product connects to the

Read more »

Comparison of different circle graphs

March 24, 2009
By

See in my Picasa here and get corrplot package here. Thanks Bob O'Hara's advice:)I found people's tastes differ, so input parameter col (fill color) and bg (background color) was added in new edition. What is more, now you can order your variables usin...

Read more »

Comparison of different circle graphs

March 24, 2009
By

See in my Picasa here and get corrplot package here. Thanks Bob O'Hara's advice:)I found people's tastes differ, so input parameter col (fill color) and bg (background color) was added in new edition. What is more, now you can order your variables usin...

Read more »

Streaming Hadoop Data Into R Scripts

March 23, 2009
By
Streaming Hadoop Data Into R Scripts

Along the lines of Mongo Measurement Requires Mongo Management, the HadoopStreaming package on CRAN provides utilities for applying R scripts to Hadoop streaming. Hadoop is used on Amazon's EC2.

Read more »

American Immigration Trends

March 22, 2009
By

The New York Times has a beautiful visualization of immigration trends in the United States since 1880. I highly recommend spending a few minutes playing with the interactive display.

Read more »

India Census 2001 – Part 1

March 22, 2009
By
India Census 2001 – Part 1

I was trying – for the last few weeks – to get the 2001 Indian census data. Alas the census website is under construction. But fortunately the Internet rewind button works! Thankfully the literacy data was online there. The raw data is available here. I cleaned up the data so that it is easy to

Read more »

Play Sliding Puzzles on R

March 22, 2009
By

The code was shared on my google docs. See it here.

Read more »

Play Sliding Puzzles on R

March 22, 2009
By

The code was shared on my google docs. See it here.

Read more »

Progress bar in R

March 21, 2009
By

Nice summary on how to use progress bars in R. I am posting this here in order to have a note for later searches.

Read more »

Progress bar in R

March 21, 2009
By

Nice summary on how to use progress bars in R. I am posting this here in order to have a note for later searches.

Read more »

Dianne Reeves at Dominican

March 16, 2009
By

Yesterday afternoon, we had another chance to see Dianne Reeves (wikipedia). This time, it almost felt like she came to us as she was headlining at the annual trustee benefit concert at Dominican University, a small college about a mile from our place. And as in 2007 and 2003, she did not disappoint. Great voice, great stage presence. Highly recommended.

Read more »

R: Monitoring the function progress with a progress bar

March 16, 2009
By
R: Monitoring the function progress with a progress bar

Every once in while I have to write a function that contains a loop doing thousands or millions of calculations. To make sure that the function does not get stuck in an endless loop or just to fulfill the human need of control it is useful to monitor the progress. So  first I tried the

Read more »

Identify Data Points in Off-Screen R Graphics Devices

March 16, 2009
By

Today Ruya Gokhan Kocer asked me how to use the R function identify() in off-screen graphics devices. Actually it’s pretty easy as long as we obtain the list returned by identify(pos = TRUE). For example, # open a windows device x11() x = rnorm(20) y = rnorm(20) plot(x, y) # identify 5 points id = identify(x, y, n = 5, pos =

Read more »