Course: Analysing Palaeolimnological Data with R

April 8, 2012
By

16th — 20th August 2012, University Marine Biological Station Millport, Isle of Cumbrae, Scotland. Steve Juggins and I will be running a 4-day residential R course to coincide with the International Paleolimnology Symposium 2012 this August. The course will be held at the University Marine Biological Station Millport on the Isle of Cumbrae, within easy...

Read more »

Using bigmemory for a distance matrix

April 7, 2012
By
Using bigmemory for a distance matrix

The process of working on metadata and temperature series gives rise to several situations where I need to calculate the distance from every station to every other station. With a small number of stations this can be done easily on the fly with the result stored in a matrix. The matrix has rows and columns

Read more »

Newton’s Method In R

Here is a toy example of implementing Newton’s method in R. I found some old code that I had written a few years ago when illustrating the difference between convergence properties of various root-finding algorithms, and this example shows a … Continue reading →

Read more »

What are the distributions on the positive k-dimensional quadrant with parametrizable covariance matrix? (solved)

April 7, 2012
By
What are the distributions on the positive k-dimensional quadrant with parametrizable covariance matrix? (solved)

Paulo (from the Instituto de Matemática e Estatística, Universidade de São Paulo, Brazil) has posted an answer to my earlier question both as a comment on the ‘Og and as a solution on StackOverflow (with a much more readable LaTeX output). His solution is based on the observation that the multidimensional log-normal distribution still allows

Read more »

Writing reproducibly in the open with knitr

April 7, 2012
By
Writing reproducibly in the open with knitr

Sweave is something of a gold standard in reproducible research. It creates a dynamic document, written in a mix of LaTeX and R code where the results of the analysis (numbers, figures, tables) are automatically generated from the code and inserted into the resulting pdf document, making them easy to update if the data or

Read more »

An R programmer looks at Julia

April 7, 2012
By

In January of this year I first saw mention of the Julia language in the release notes for LLVM. I mentioned this to Dirk Eddelbuettel and later we got in contact with Viral Shahregarding a Debian package for Julia. There are many aspects of Julia tha...

Read more »

Writing reproducibly in the open with knitr

April 7, 2012
By
Writing reproducibly in the open with knitr

Sweave is something of a gold standard in reproducible research. It creates a dynamic document, written in a mix of LaTeX and R code where the results of the analysis (numbers, figures, tables) are automatically generated from the code and inserted in...

Read more »

Temperature Change in Ireland

April 7, 2012
By
Temperature Change in Ireland

Has Ireland gotten any warmer? Ask any punter on the street and they will happily inform you of wild swings, trends and dips. “Back when I was a child”, “when I was younger”, or “years ago” are the usual refrains. What’s the evidence? To answer this, I will use the temperature data from my previous

Read more »

Install R 2.15 and further versions in Debian Squeeze

April 6, 2012
By

The last Friday, March 30th, the last stable version of R, the version 2.15.0 was released.So, to install it in Debian Squeeze, or in another Distro powered by Debian (I actually use CrunchBang Linux), just follow the same instructions described here f...

Read more »

The race for speed at the data layer

April 6, 2012
By

The competition amongst database vendors to create the fastest, most powerful "data layer" — the hardware and software to provide storage for Big Data with high-performance data processing — is clearly heating up. The Netezza appliance has been so successful that IBM has been racing to keep up with demand. SAP is also seeing success with its HANA in-memory...

Read more »

RNA-Seq Methods & March Twitter Roundup

April 6, 2012
By

There were lots of interesting developments this month that didn't work their way into a full blog post. Here is an incomplete list of what I've been tweeting about over the last few weeks. But first I want to draw your attention to the latest manuscri...

Read more »

R-Bloggers’ Web-Presence

April 6, 2012
By

We love them, we hate them: RANKINGS!Rankings are an inevitable tool to keep the human rat race going. In this regard I'll pick up my last two posts (HERE & HERE) and have some fun with it by using it to analyse R-Bloggers' web presence. I will use...

Read more »

Nonconvexity, and playing indoor paintball

April 6, 2012
By
Nonconvexity, and playing indoor paintball

Following the two previous posts (here and there), on the number of people that don't get wet while playing with water pistols, consider now an indoor version, in a non-convex room (i.e. player behind wall are now, somehow, protected). In the previ...

Read more »

Dynamite plots in R

April 6, 2012
By
Dynamite plots in R

For some time I've contemplated creating a function for creating the dynamite plots beloved by many of the applied sciences. There's a lot of criticism regarding their utility, and there are several ways that present data in a more intelligible way. A search on the subject brings up pages with such emotive titles as "Dynamite plots: unmitigated evil?"...

Read more »

The 50 most used R packages

April 5, 2012
By
The 50 most used R packages

Ask anyone what makes R a great language, one argument that often comes back is its very active community. Proof is the impressive number of packages contributed by developers from all horizons and backgrounds. The CRAN website alone lists 3,725 p...

Read more »

Compete in the Data Science Hackathon, April 28

April 5, 2012
By

All around the world at noon GMT on April 28, data scientists around the world will compete in the world's first one-day International Data Science Hackathon, organized by Data Science London. Participants will receive a data set at the beginning of the event, and work in teams of 3-5 over the following 24 hours to create the best predictive...

Read more »

An intro to R

April 5, 2012
By
An intro to R

A few weeks back I gave a talk at the local Berkeley R meetup group. The idea was to help people not make the same mistakes I made when I first started out learning R. It was the first time I made an entire presentation with Deck.js and I generated the syntax highlighted R code

Read more »

Use file.choose to customize output filenames in R functions

April 5, 2012
By

In this post, I want to address the following issue: several data files with a common trame have to be dealt with by an R function. The function should export files (such as images or data files or any other file type). I explain how to create filenames such that the function automatically exports files in the same directory...

Read more »

useR! 2012 Deadlines Approaching: Registration, Hotels, Student Scholarships

April 5, 2012
By
useR! 2012 Deadlines Approaching: Registration, Hotels,  Student Scholarships

Forwarded from Frank Harrell: DEADLINES FAST APPROACHING – 8th Annual International R User Conference useR! 2012, Nashville, Tennessee USA Registration Deadlines: Early Registration: Passed Regular Registration: Mar 1- May 12 Late Registration: May 13 – June 4 On-Site Registration: June 12 – June 15 Please note: Nashville is offering several large entertainment events the month

Read more »

Gaussian process regression with R

April 5, 2012
By
Gaussian process regression with R

I’m currently working my way through Rasmussen and Williams’s book on Gaussian processes. It’s another one of those topics that seems to crop up a lot these days, particularly around control strategies for energy systems, and thought I should be able to at...

Read more »

Basics of Working With Data in R

April 5, 2012
By

(This article was first published on R Video Tutorials - Stats Make Me Cry, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on his blog: R Video Tutorials - Stats Make Me Cry. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2,...

Read more »

Basics of Working With Data in R

April 5, 2012
By

Read more »

Where hiding if you don’t want to get wet ?

April 5, 2012
By
Where hiding if you don’t want to get wet ?

Following the previous post, two additional remarks. Following a comment by @cosi, I have investigated quickly a binomial fit to the distribution of the number of people not getting wet, with a fixed number of players on the field. It looks like it...

Read more »

Melt

April 5, 2012
By

There are many situations where data is presented in a format that is not ready to dive straight to exploratory data analysis or to use a desired statistical method. The reshape2 package for R provides useful functionality to avoid having to hack data around in a spreadsheet prior to import into R. The melt function

Read more »

A Little Web Scraping Exercise with XML-Package

April 5, 2012
By

Some months ago I posted an example of how to get the links of the contributing blogs on the R-Blogger site. I used readLines() and did some string processing using regular expressions.With package XML this can be drastically shortened - see this:# get...

Read more »

R Structure Explained

April 4, 2012
By

This post by Suraj Gupta explains it all. This is the first time I have seen a  concise and accessible explanation of the R environment structure and why it matters.   Addendum: This one by Digithead is also pretty good

Read more »

R Structure Explained

April 4, 2012
By

This post by Suraj Gupta explains it all. This is the firs time I have seen a  concise and accessible explanation of the R environment structure and why it matters.   Addendum: This one by Digithead is also pretty good

Read more »

R, I Love You

April 4, 2012
By

It is easier to critique than it is to create. I write this post with much gratitude for R, the R community and particularly R-Core who are paid $0 to bring us R. I’d like to offer an idea and I’m wondering if people are interested in ral...

Read more »

Data Science Undefined

April 4, 2012
By

One of the favorite bar room discussions of statisticians, machine learners, and computer scientists is – what is data science? (And I don’t care whether it happens in a bar or not, it’s a “bar room” discussion by virtue of...

Read more »