Shrinking R’s PDF output

June 17, 2010
By

R is great for graphics, but I've found that the PDF's R produces when drawing large plots can be extremely large. This is especially common when using spplot() to plot a large raster. I've made a 15 page PDF full of rasters that was hundreds of MB in size.  Obviously I don't need all the detail (every pixel of...

Read more »

Calling Ruby, Perl or Python from R

June 16, 2010
By
Calling Ruby, Perl or Python from R

If you want to interact with other programming languages from R, there are various packages and bindings available. These packages provide a pretty high degree of integration between the langages and allow you to pass objects back and forth seemlessl...

Read more »

Conferenza a Padova

June 16, 2010
By
Conferenza a Padova

Today and tomorrow, I am attending the annual Italian statistical society meeting. While I appreciate very much the invitation, as well as the opportunity to walk through  Padova and Venezia for a short (and alas rainy!) hour on the way there (leaving home at 8am, walking in Venezia at noon!), I am rather skeptical of

Read more »

Mary, Chloe, and Miriam at breakfast

June 16, 2010
By
Mary, Chloe, and Miriam at breakfast

Read more »

R-help follow-up: truncated exponential

June 16, 2010
By
R-help follow-up: truncated exponential

I recently posted the message below with regard to sampling from the truncated exponential distribution. I left out the derivation of the CDF (mostly because text math is ugly), so I’ve included it here. There is also a short JSS article about truncated distributions in R. This problem in particular may likely be found in

Read more »

R Sapply Problem

Any expert in R please educates me. I have got a problem about the sapply (or lapply), it made me headache for over two hours.As "for loop" is very slow in R, we should try best to avoid using it, and to use vectorization instead. sapply is designed for this, for example, instead of:for (i in 1:10) {z <-...

Read more »

R Commander – hypothesis testing

June 16, 2010
By
R Commander – hypothesis testing

The R Commander GUI can be used to perform classical hypothesis testing. There are menu options to undertake the variants on the t-test as well as tests on proportions or equality of variances for two samples of data. Fast Tube by Casper The “Statistics” menu provides access to various hypothesis tests via the “Means” sub-menu including: Single sample

Read more »

The distribution of online data usage

June 16, 2010
By
The distribution of online data usage

AT&T has recently announced it will no longer offer unlimited data plans for new iPhone users in the US, and now some carriers in the UK have followed suit. In each case, the providers claim that only a very small number of users actually use enough data to warrant an unlimited plan, and most users use relatively little and...

Read more »

Arctic Sea Ice Extent Update – June, 17, 2010: Update 2

June 15, 2010
By
Arctic Sea Ice Extent Update – June, 17, 2010: Update 2

See my Arctic Update Page for daily updates on Arctic Sea Ice Extent We are at the half way point in June, 2010 and the Arctic Sea Ice Extent is melting at a record-breaking pace. Please note I have adjusted … Continue reading →

Read more »

Date and Time in R

June 15, 2010
By
Date and Time in R

The following are a few date and time functions that I needed to figure out early on when working with R.We will start when we are... the current system date.Sys.Date()Notice that this function returns a Date object.class(Date)A string in this format i...

Read more »

Welcome guest blogger, Joseph Rickert

June 15, 2010
By

I'm about to head out for a two-week holiday, so I'll be off the grid for a little while. But I have queued up some (hopefully!) interesting stories to auto-post while I'm away, so there'll still be plenty to read every weekday as usual here on the blog. Also joining us for the next couple of weeks is guest...

Read more »

Updated SoilWeb for the iPhone + Alpha Android Version

June 15, 2010
By

Major updates to the SoilWeb iPhone Application. read more

Read more »

Statistical Analysis and Visualization of the Drug War in Mexico

June 15, 2010
By
Statistical Analysis and Visualization of the Drug War in Mexico

On December 11, 2006 Felipe Calderon, as the first significant act of his presidency, sent the army to his home state of Michoacan. He claimed that it was to regain control of territories lost to the drug cartels, and indeed, a new cartel had started operating in Michocan. But the fact that he won the election by the slim margin of...

Read more »

Statistical Analysis and Visualization of the Drug War in Mexico

June 15, 2010
By
Statistical Analysis and Visualization of the Drug War in Mexico

On December 11, 2006 Felipe Calderon, as the first significant act of his presidency, sent the army to his home state of Michoacan. He claimed that it was to regain control of territories lost to the drug cartels, and indeed, a new cartel had started operating in Michocan. But the fact that he won the election by the slim margin of...

Read more »

RcppArmadillo 0.2.3

June 15, 2010
By

The minor bug-fix release 0.2.3 of RcppArmadillo went to CRAN this morning. It adds a tiny bit of configuration to permit Sun Studio / suncc to successfully build the package. There is no code change, and no configuration change for the other platform...

Read more »

RcppArmadillo 0.2.3

June 15, 2010
By

The minor bug-fix release 0.2.3 of RcppArmadillo went to CRAN this morning. It adds a tiny bit of configuration to permit Sun Studio / suncc to successfully build the package. There is no code change, and no configuration change for the other platfo...

Read more »

New R User Group in Kassel, Germany

June 15, 2010
By

There's yet another local R user group launching this week, this time in Kassell, in central Germany. Their first meeting is on June 24. If my rusty German is holding up, looks like there will be some good discussion on editors and GUIs, how to find packages and functions, and R community resources. There's been lots of activity in...

Read more »

R for Matlab Users

My favorite software is Matlab, but partly because R is free, more and more people & companies choose to use R as a major working language. Nothing wrong with that, I am at the moment changing some of my Kalman Filter Matlab codes to R.One bothering issue is each software has its own...

Read more »

Faster R through better BLAS

June 15, 2010
By
Faster R through better BLAS

Can we make our analysis using the R statistical computing and analysis platform run faster? Usually the answer is yes, and the best way is to improve your algorithm and variable selection. But recently David Smith was suggesting that a big benefit of their (commercial) version of R...

Read more »

Faster R through better BLAS

June 15, 2010
By
Faster R through better BLAS

Can we make our analysis using the R statistical computing and analysis platform run faster? Usually the answer is yes, and the best way is to improve your algorithm and variable selection. But recently David Smith was suggesting that a big benefit of their (commercial) version of R...

Read more »

Clustergram: visualization and diagnostics for cluster analysis (R code)

June 15, 2010
By
Clustergram: visualization and diagnostics for cluster analysis (R code)

About Clustergrams In 2002, Matthias Schonlau published in “The Stata Journal” an article named “The Clustergram: A graph for visualizing hierarchical and . As explained in the abstract: In hierarchical cluster analysis dendrogram graphs are used to visualize how clusters are formed. I propose an alternative graph named “clustergram” to examine how cluster members are assigned to clusters as...

Read more »

plagiarism exposed!

June 14, 2010
By
plagiarism exposed!

Last morn, I had the surprise of receiving the following email: This is to inform you that the following abstract has been submitted to the 3rd International Conference of the ERCIM WG on COMPUTING & STATISTICS (ERCIM’10) Ab#: 114 Title: Goodness of Fit Via Mixtures of Beta distributions Keywords: nonparametric estimation, posterior conditional predictive p-value.

Read more »

In case you missed it: May Roundup

June 14, 2010
By

In case you missed them, here are some articles from last month of particular interest to R users. We linked to 13 videos for learning R, from the basics ("What is R?") to more advanced topics. We noted the release of R 2.11.1. We announced that Revolution Analytics makes its software available free of charge to the academic community....

Read more »

StatEt in Ubuntu 10.04

June 14, 2010
By

I wanted a “lightweight” version of Eclipse to run R from Ubuntu. (I installed eclipse-pde using apt-get. It worked fine.) Once it was running, I installed StatEt via the “Install new software” feature from http://download.walware.de/eclipse-3.5. While it was downloading, I … Continue reading →

Read more »

New R User Groups in Sydney, South Asia

June 14, 2010
By

For R users in the Southern Hemisphere on the other side of the Prime Meridian, there are two new entries in the Local R User Group Directory: Sydney and South Asia. The Sydney group wins the prize for the best group acronym ever: SURF. The Sydney Users of R Forum will have their welcome meeting on July 6 July...

Read more »

Repulsive dots pattern, the difference of distance

June 14, 2010
By
Repulsive dots pattern, the difference of distance

What if you wanted to randomly place objects into a field, and the more objects you had, the more they rejected newcomers placed nearby? To find out, I setup a simulation. The code, shown at the end, isn’t all that interesting, and the plots shown below aren’t all that special. I think there is one

Read more »

Example 7.41: hazard function plotting

June 14, 2010
By
Example 7.41: hazard function plotting

As we continue with our series on survival analysis, we demonstrate how to plot estimated (smoothed) hazard functions. RWe will utilize the routines available in the muhaz package. Background information on the methods can be found in K.R. Hess, D.M....

Read more »

June 20, online Registration deadline for useR! 2010

June 13, 2010
By

useR!2010 is coming. I am going to give two talks there (I will write more of that soon), but in the meantime, please note that the online registration deadline is coming to an end. This was published on the R-help mailing list today: ————- The final registration deadline for the R User Conference is June 20, 2010, one week...

Read more »

R Commander – data manipulation and summaries

June 13, 2010
By
R Commander – data manipulation and summaries

Previously we considered the R Commander interface as a simple GUI for the R statistical software system. Here we will look at how to undertake data manipulation and creating basic statistical summaries of data sets. Fast Tube by Casper The R Commander GUI has two menus “Data” and “Statistics” that are used for manipulating data sets and

Read more »