## Rcpp 0.9.0 announcement

December 25, 2010
The text below went out as a post to the r-packages list a few days ago, but I thought it would make sense to post it on the blog too. So with a little html markup... Summary Version 0.9.0 of the Rcpp package is now on CRAN and its mirrors. Thi...

## one-dimensional integrals

December 25, 2010
$one-dimensional integrals$

The foundamental idea of numerical integration is to estimate the area of the region in the xy-plane bounded by the graph of function f(x). The integral was esimated by divide x to small intervals, then add all the small approximations to give a total approximation. Read More: 468 Words Totally

## Chromosome bias in R, my notebook

December 23, 2010
My goal is to develop a means of detecting chromosome bias from a human BAM file.Because I've been working with proprietary and novel plant genomes for the last three years, I haven't had the chance to use any of the awesome UCSC-based annotational features that have been introduced and refined in Bioconductor until now. I've returned to biomedical research...

## Did you feel that?

December 23, 2010
There was a small earthquake in northern England on Tuesday. Barry Rowlingson felt the quake (it rattled the photographs on his wall), but didn't know how big of a quake it was because he didn't know how close he was to the epicentre. The British Geological Survey hadn't yet announced the quake, but did give access to seismograph readings,...

## Citizen Data Journalism: Mexico Homicides

December 23, 2010
I've recently praised some mainstream media outlets like the New York Times and New Scientist for leading the charge on data journalism. But you don't need to be a large organization to find news in data. With open data sources, and open-source data analysis tools, individuals can make newsworthy discoveries. Diego Valle-Jones has been investigating the impact of the...

## R function to convert degrees to radians

December 23, 2010
I would have never imagined that I would have to go back to high school concepts and do strange trigonometric calculations. However, it happened to me that I needed to convert GPS coordinates of a large data set to radians. It’s a trivial task, if you know how to do it. The function takes as

## Project Euler — Problem 187

December 23, 2010
http://projecteuler.net/index.php?section=problems&id=187 A composite is a number containing at least two prime factors. For example, 15 = 3 × 5; 9 = 3 × 3; 12 = 2 × 2 × 3. There are ten composites below thirty containing precisely two, not necessarily distinct, prime factors: 4, 6, 9, 10, 14, 15, 21, 22, 25, 26. Read...

## Some problems with the Mexican mortality database

December 22, 2010
I've written before about how I couldn't find the Acteal Massacre in the homicide database available at the INEGI. So I decided to check if the deaths that occurred as a consequence of the massacre were misclassified as other types of violent death, and indeed they were. In the municipality of Chenalhó,...

## A plea for consistent style!

December 22, 2010
As we get close to the end of the year, it's time to look back over the past year and think of resolutions for 2011 and beyond. One that's often on my mind relates to ways to structure my code to make it clearer to others (as well as to myself when I l...

## Forbes: R is a name you need to know in 2011

December 22, 2010
The December 20 issue of Forbes magazine, on newsstands now, includes a column about R on page 128 as part of the "Name You Need to Know in 2011" feature. It's basically an excerpt from this blog post by Steve McNally and its comments, and includes quotes from Norman Nie of Revolution Analytics, Bill Alpert of Barron's, and Brandon...

## A Special Graphics Device in R: the Null Device

December 22, 2010
It is well-known that R has several graphics devices — either the screen devices (X11(), windows(), …) or the off-screen devices (pdf(), png(), …). We can query the default graphics device in options(): getOption('device') In a non-interactive session, the default device is pdf(). This is why Sweave has to create a file named Rplots.pdf no

## Oil – Natural Gas Cointegration – turning point?

December 22, 2010
## Data-driven arterial input functions

December 22, 2010
I haven't devoted any time/effort/code to the extraction and parameter estimation of arterial input functions (AIFs) for DCE-MRI in either the vignette or the submission to JSS.  Frankly, I wanted to communicate the main features of the R package dcemriS4 and data-driven AIFs just haven't made the list... up 'til now.  So let's assume one wants to fit...

## Stacked histogram with ggplot2

December 22, 2010
With ggplot2 there is a possibility to create divide bars of a histogram into different categories:

## RcppExamples 0.1.2

December 22, 2010
A new version of our RcppExamples, package is now on CRAN. RcppExamples contains a few illustrations of how to use Rcpp. It grew out of documentation for the classic API (now in its own package RcppClassic) and we added more functions documenting...

## CrossValidated Journal Club

December 21, 2010
Journal Clubs are a great way to learn new research ideas and to keep up with the literature. The idea is that a group of people get together every week or so to discuss a paper of joint interest. This can happen within your own research group or department, or virtually online. There is now

## Questions on the parallel Rao-Blackwellisation

December 21, 2010
$Questions on the parallel Rao-Blackwellisation$

Pierre Jacob and I got this email from a student about our parallel Rao-Blackwellisation paper. Here are some parts of the questions and our answer: Although I understand how the strategy proposed in the paper helps in variance reduction, I do not understand why you set b=1 (mentioned in Section 3.2) and why it plays

## How Orbitz uses Hadoop and R to optimize hotel search

December 21, 2010
Positional bias — the tendency for users to preferentially select results in the first few positions of a search — is a big issue for all kinds of search engines. But for online travel site Orbitz the stakes are higher than for a traditional Web search engine: if a customer chooses the first-listed hotel in a search for accommodations,...

## My favorite R packages (installed with one command)

December 21, 2010
I just started a new job (working on social search awesomeness at Bing) and so I had to set up my “dev” environment with all of my usual tools (R, python,vim,etc). One thing that made this a bit easier is my habit of keeping an R script around that installs all of my common packages

## NppToR 2.5.2 Improves startup

December 21, 2010
I’ve been getting lots of feedback that there are problems starting NppToR with some of the latest version. I took to the task of looking at that yesterday on the train home. I have made improvements to the way NppToR finds the RHome directory, not relying entirely on the windows registry. I also removed the

## R programming books

December 21, 2010
My sabbatical is rapidly coming to an end, and I have to start thinking more and more about teaching. Glancing over my module description for the introductory computational statistics course I teach, I noticed that it’s a bit light on recommend/background reading. In fact it has only two books: A first course in statistical programming

## A Very Data Christmas

December 21, 2010
This week Google announced its Ngram Viewer, which allows you to explore the use of words in thousands of texts overtime, going back two hundred years. Given the relatively long time period covered by this massive data set, it is fun to explore how language has changed overtime. Some texts, however, seem to transcend time.