Blog Archives

A Matrix Powers Package, and Some General Edifying Material on R

August 16, 2014
By
A Matrix Powers Package, and Some General Edifying Material on R

Here I will introduce matpow, a package to flexibly and conveniently compute matrix powers.  But even if you are not interested in matrices, I think many of you will find that this post contains much general material on R that you’ll find useful.  Indeed, most of this post will be about general R issues, not

Read more »

New freqparcoord Example

August 5, 2014
By
New freqparcoord Example

In my JSM talk this morning, I spoke about work done by Yingkang Xie and myself, on a novel approach to the parallel coordinates method of visualization.  I’ve made several posts to this blog in the past on freqparcoord, our implemention of our method. My talk this morning used some recently-available NYC taxi data.  You

Read more »

Code Snippet: Extracting a Subsample from a Large File

August 1, 2014
By
Code Snippet:  Extracting a Subsample from a Large File

Last week a reader of the r-help mailing list posted a query titled “Importing random subsets of a data file.”  With a very large file, it is often much easier and faster–and really, just as good–to just work with a much smaller subset of the data. Fellow readers then posted rather sophisticated solutions, such as storing

Read more »

A Handy Trick for Remote Graphics

July 22, 2014
By
A Handy Trick for Remote Graphics

I often create plots that require quite a bit of computation.  Ideally I would run this on what I’ll call Machine A, which is a very fast machine, but I am often far away, on Machine B.  So, I’d like to run my computation on B but display it on A. For the platforms I

Read more »

Rth: a Flexible Parallel Computation Package for R

June 17, 2014
By
Rth:  a Flexible Parallel Computation Package for R

I’ve been mentioning here that I’ll be discussing a new package, Rth, developed by me and Drew Schmidt, the latter of pbdR fame.  It’s now ready for use!  In this post, I’ll explain what goals Rth has, and how to use it. Platform Flexibility The key feature of Rth is in the word flexible in

Read more »

Rth: a Flexible Parallel Computation Package for R

June 17, 2014
By
Rth:  a Flexible Parallel Computation Package for R

I’ve been mentioning here that I’ll be discussing a new package, Rth, developed by me and Drew Schmidt, the latter of pbdR fame.  It’s now ready for use!  In this post, I’ll explain what goals Rth has, and how to use it. Platform Flexibility The key feature of Rth is in the word flexible in

Read more »

R beats Python! R beats Julia! Anyone else wanna challenge R?

May 21, 2014
By
R beats Python!  R beats Julia!  Anyone else wanna challenge R?

Before I left for China a few weeks ago, I said my next post would be on our Rth parallel R package. It’s not quite ready yet, so today I’ll post one of the topics I spoke on last night at the Berkeley R Language Beginners Study Group. Thanks to the group for inviting me,

Read more »

R beats Python! R beats Julia! Anyone else wanna challenge R?

May 21, 2014
By
R beats Python!  R beats Julia!  Anyone else wanna challenge R?

Before I left for China a few weeks ago, I said my next post would be on our Rth parallel R package. It’s not quite ready yet, so today I’ll post one of the topics I spoke on last night at the Berkeley R Language Beginners Study Group. Thanks to the group for inviting me,

Read more »

What Can Go Wrong: My Favorite Example

April 28, 2014
By
What Can Go Wrong:  My Favorite Example

I’m one of many who bemoan the fact that statistics is typically thought of as — alas, even taught as — a set of formula plugging methods. One enters one’s data, turns the key, and the proper answers pop out. This of course is not the case at all, and arguably statistics is as much

Read more »

What Can Go Wrong: My Favorite Example

April 28, 2014
By
What Can Go Wrong:  My Favorite Example

I’m one of many who bemoan the fact that statistics is typically thought of as — alas, even taught as — a set of formula plugging methods. One enters one’s data, turns the key, and the proper answers pop out. This of course is not the case at all, and arguably statistics is as much

Read more »