What a headline.. It's about combining boolean vectors in R.

What a headline.. It's about combining boolean vectors in R.

When I was a kid, I went through an 80s music phase…well, some things never change. “People just love to play with words…” Know that song? Anyway… One of the biggest pains of text mining and NLP is colloquialism — language that is only appropriate in casual language and not in formal speech or writing. Words such as informal contractions...

Saptarshi Guha, who we profiled yesterday, is at the Hadoop World conference in New York City today. At 4PM, Saptarshi will give a presentation on RHIPE, his link between R and Hadoop. Saptarashi was interviewed yesterday by Alex Handy of the SD Times, where he talked about his background and his motivation to create RHIPE. Saptarshi was sponsored by...

In case you missed them, here are some articles from August of particular interest to R users. We presented a profile of Hadley Wickham, author of many popular R packages including ggplot2 and reshape. We riffed the design of the new Twitter website into a discussion on calculating the Golden Mean with R. Several readers contributed 1-liners based on...

It is very common to cluster genes based on their expression profiles, and also very common to integrate Gene Ontology to observe the distribution of biological processes, molecular functions and cellular components for a given gene list. But, what if the two in combination? The Gene Ontology distributions across a variety of gene clusters may give us a new...

It is very common to cluster genes based on their expression profiles, and also very common to integrate Gene Ontology to observe the distribution of biological processes, molecular functions and cellular components for a given gene list. But, what if the two in combination? The Gene Ontology distributions across a variety of gene clusters may give us a...

It's the Monday of the Columbus Day weekend here, so I must have been running a Chicago Marathon yesterday. Indeed -- the 34th annual Chicago Marathon took place yesterday but everything was about its 10/10/10 date. The symmetric set of numbers was i...

With Pierre Jacob, my PhD student, and Murray Smith, from National Institute of Water and Atmospheric Research, Wellington, who actually started us on this project at the last and latest Valencia meeting, we have completed a paper on using parallel computing in independent Metropolis-Hastings algorithms. The paper is arXived and the abstract goes as follows:

"The R-Files" is an occasional series from Revolution Analytics, where we profile prominent members of the R Community. Name: Saptarshi Guha Background: Ph.D. in Statistics, Purdue University Nationality: India Years Using R: 6 Known for: Developing RHIPE package for R + Hadoop integration At just 31 years old, Saptarshi Guha has emerged as a cutting-edge contributor to the R...

Canadian R user Leo Guelman contacted me last week to ask if there was an R User Group in Toronto. There wasn't one last week, but there's one now: Leo has taken the initiative to start a new useR group. The Greater Toronto Area (GTA) R User's Group is now active on meetup.com, and taking suggestions for their first...

About 10 months ago, I was looking for a plugin to enable me to highlight R code on my self hosted WordPress blog. The solution I came up with then was to use the wp-syntax plugin (with the need for some modifications). Today I was informed of (what I believe is) a better WordPress plugin

Tal Galili’s blog post mentioned that WP-Syntax can highlight R codes. I downloaded the modified version of WP-Syntax in his blog site. The plugin throw an error when activated. I did not try the original version hosted in WordPress. I found that WP-CodeBox use GeShi for syntax highlighting as WP-Syntax does.I used this plugin when I posted perl codes, but...

Tal Galili’s blog post mentioned that WP-Syntax can highlight R codes. I downloaded the modified version of WP-Syntax in his blog site. The plugin throw an error when activated. I did not try the original version hosted in Wordpress. I found that WP-CodeBox use GeShi for syntax highlighting as WP-Syntax does.I used this plugin when I posted perl codes,...

Update: Some are not aware that GISS has switched to using Nightlights in the ROW. According to their updates they have moved to nightlights for the ROW. The station inventories can be found here The station I examine below is listed like this in the new giss inventory 20551495001 TURPAN 42.92 91.00 24 384R -9MVDEno-9x-9HOT

The puzzle in Le Monde this week is called the “square five” (sic!): Two players each have twenty-five cards with five times each of the digits 1,2,3,4,5. They alternate putting one card on top of the pile, except that they can instead take an arbitrary number of consecutive cards from the top of the pile

At midnight this morning, Kaggle began accepting submissions for the data hacking contest that we announced on Thursday. Hopefully you’ve used the last few days to build predictions for the test data set. Once you submit your predictions, you’ll be able to see your position on the leaderboard. Good luck!

The R Recommendation Engine contest is now live on Kaggle. Please head over there and start submitting your predictions for the test data set. Once you do, you can check the leaderboard to see how your algorithm compares with other people’s work. We know that there’s still plenty of progress that can be made, because

It is possible to animate a R plot and this plot can also be exported to a LaTeX document with all its animations intact. Adding this animated plot to a PowerPoint slide or save it as a flash movie file (that you can upload to YouTube) is also possible. In this article I will try to show how these...

My first introduction with LaTeX was not very pleasant. I got tired and frustrated by writing so many codes for producing a simple document; but a few days ago I could write a function for producing an animated plot in R and also could export it to my LaTeX document with all its animations intact (using animation package of...

The R package dcemriS4 is a collection of functions, with examples and documentation, that allows one to perform voxel-wise quantitative analysis of dynamic contrast-enhanced MRI (DCE-MRI) or diffusion-weighted imaging (DWI) data. The primary app...

The R package dcemriS4 is a collection of functions, with examples and documentation, that allows one to perform voxel-wise quantitative analysis of dynamic contrast-enhanced MRI (DCE-MRI) or diffusion-weighted imaging (DWI) data. The primary app...

The R package oro.nifti contains functions for the input/output and visualization of medical imaging data that follow either the ANALYZE or NIfTI formats. This package is part of the Rigorous Analytics bundle.The latest version of oro.nifti (0....