R Beginner’s Guide Book Update: Statistical Analysis with R Released

November 8, 2010
By
R Beginner’s Guide Book Update: Statistical Analysis with R Released

In the final days of October, my beginner's guide to R was released. The book's official title is Statistical Analysis with R and it can be found on the Packt Publishing website. The primary focus of Statistical Analysis with R is helping new users bec...

Read more »

A R wrapper for Google Prediction API

November 8, 2010
By
A R wrapper for Google Prediction API

Since I got the chance to access to both Google Storage for Developers and Google Prediction API (more details here and here), I decided to create a simple wrapper (just 4 basic functions until now) to be capable to play with the Google Prediction API ...

Read more »

A R wrapper for Google Prediction API

November 8, 2010
By
A R wrapper for Google Prediction API

Since I got the chance to access to both Google Storage for Developers and Google Prediction API (more details here and here), I decided to create a simple wrapper (just 4 basic functions until now) to be capable to play with the Google Prediction API ...

Read more »

Le Monde puzzle [43]

November 7, 2010
By
Le Monde puzzle [43]

Here is the puzzle in Le Monde I missed last week: Given a country with 6 airports and a local company with three destinations from each of the six airports, is it possible to find a circular trip with three intermediate stops from one of the airports? From all of the airports? One more airport

Read more »

Wetbulb Temperature

November 7, 2010
By
Wetbulb Temperature

This google map display is just one of 230 GHCN stations that is located in the water. After finding  instances of this phenomena over and over, it seemed an easy thing to find and analyze all such cases in GHCN. The issue matters for a two reasons: In my temperature analysis program I use a

Read more »

Updating meteorological forecasts, part 1

November 7, 2010
By
Updating meteorological forecasts, part 1

As Mark Twain said "the art of prophecy is very difficult, especially about the future" (well, actually I am not sure Mark Twain was the  first one to say so, but if you're interested by that sentence, you can look here). I have been rather su...

Read more »

R is a cool image editor!

November 7, 2010
By
R is a cool image editor!

Here I present some functions I wrote to recreate some of the most common image effect available in all image editor.They require the library rimage.To load the image, use:y <- read.jpeg("path")To display the image, use:plot(y)Original imageSepia tonergb2sepia <- function(img){ iRed <- img*255 iGreen <- img*255 iBlue <- img*255  oRed <- iRed * .393...

Read more »

R is a cool image editor!

November 7, 2010
By
R is a cool image editor!

Here I present some functions I wrote to recreate some of the most common image effect available in all image editor.They require the library rimage.To load the image, use:y <- read.jpeg("path") To display the image, use:plot(y) Original image Sepia tone rgb2sepia <- function(img){ iRed <- img*255 iGreen <- img*255 iBlue <- img*255  oRed <- iRed * .393...

Read more »

Installing R packages

November 6, 2010
By
Installing R packages

Part of the reason R has become so popular is the vast array of packages available at the cran and bioconductor repositories. In the last few years, the number of packages has grown exponentially! This is a short post giving steps on how to actually install R packages. Let’s suppose you want to install the

Read more »

Livin’ la Vida Poisson

November 5, 2010
By
Livin’ la Vida Poisson

Yes, I did just mix English, Spanish and French. And no, I living the “fishy” life, popular opinion to the contrary. Here’s the story. As someone who spends the majority of his time working online, with no oversight, I notice that I tend to drift a lot. I don’t play solitaire, or farm for virtual

Read more »

The bms Function Explained

November 5, 2010
By

This text 'pedagocially' explains how the bms function works code-wise and is intended for people who prefer to program customized adjustments of the bms package. bms is the workhorse function to do the sampling part of Bayesian Model Averaging in the...

Read more »

Because it’s Friday: Epidemiology in 1632

November 5, 2010
By
Because it’s Friday: Epidemiology in 1632

I first got interested in epidemiology when I saw the famous John Snow chart (in a Tufte book, I think?) which pinpointed the pump which caused the 1854 cholera outbreak in London. For some reason I'd gotten the impression that this was essentially the birth of epidemiology as a discipline, but it's actually been around a lot longer than...

Read more »

ACM Data Mining Camp 3

November 5, 2010
By

The San Francisco Bay Area chapter of the ACM is will hold its third data mining camp next Saturday (November 13) at the Ebay campus in San José. Like the previous camps, this will be a one-day "unconference"-style event, with an agenda developed ad-hoc on the day according to the interests of the attendee. With data scientists from the...

Read more »

2010 On Track to Being 2nd Warmest Year Since 1880

November 5, 2010
By
2010 On Track to Being 2nd Warmest Year Since 1880

In this post I present a 5 panel trend charts which show the year-to-date  anomaly trends for the 5 major global temperature anomaly series and a table that shows how  2010 YTD ranks over the entire record for each series. … Continue reading &#...

Read more »

Pretty R code in the blog

November 5, 2010
By
Pretty R code in the blog

David Smith (alias @revodavid, see also on the Revolutions blog, here) pointed out that my R code was not easy to read (not only due to my computing skills, but mainly because of the typography I use). He suggested that I use the Pretty R tool (her...

Read more »

New England R Users Group Meeting

November 5, 2010
By
New England R Users Group Meeting

Attended and thoroughly enjoyed Tuesday night’s New England R Users Group. We meet monthly in the Boston area to discuss the various ways in which people use and interact with the R programming language. Not surprisingly, we have a variety of industries represented. One of us is using R to recognize patterns in tissue samples

Read more »

Splines: opening the (black) box…

November 4, 2010
By
Splines: opening the (black) box…

Splines in regression is something which looks like a black box (or maybe like some dishes you get when you travel away from home: it tastes good, but you don't what's inside... even if you might have some clues, you never know for sure*). With spl...

Read more »

CrossValidated launched!

November 4, 2010
By
CrossValidated launched!

The CrossValidated Q&A site is now out of beta and the new design and site name is live. New design The new design looks great, thanks to Jin Yang, our designer-in-residence. Note the normal density icon for accepted answers and the site icon depicting a 5-fold cross-validation (light green for the test set and dark

Read more »

Dress your R code for the Web with Pretty R

November 4, 2010
By
Dress your R code for the Web with Pretty R

If you have some R code to include in a document, especially a Web-based document like a blog post, the new "Pretty R" feature on inside-R.org can help you make it look its best. Given some raw R code, it will create a HTML version of the code, adding syntax highlighting elements and links. Functions, strings, comments and literals...

Read more »

R is Hot: Part 5

November 4, 2010
By

This the final installment of a five-part article series. You can download the complete article from the Revolution Analytics website. Building a Business The value of R to business is borne out by the experiences of John Lucker and his team of advanced analytics professionals at Deloitte Consulting LLP. John is a Deloitte Consulting Principal and leads the firm’s...

Read more »

The dead of Juarez

November 4, 2010
By
The dead of Juarez

Ciudad Juárez became the most violent city in Mexico as a consequence of the vicious war between the Juárez Cartel and the Sinaloa Cartel. So far more than 6,500 people have been murdered in this city since the war started. With the Juárez Cartel significantly weakened, 2010 is set to be the most violent year yet... Here are some statistics...

Read more »

The dead of Juarez

November 4, 2010
By
The dead of Juarez

Ciudad Juárez became the most violent city in Mexico as a consequence of the vicious war between the Juárez Cartel and the Sinaloa Cartel. So far more than 6,500 people have been murdered in this city since the war started. With the Juárez Cartel significantly weakened, 2010 is set to be the most violent year yet... Here are some statistics...

Read more »

The Answer Depends on the Question

November 3, 2010
By

To quote from the preface to the first edition in Jeffreys (1961): ‘It is sometimes considered a paradox that the answer depends not only on the observations but on the question; it should be a platitude.’1 Generalized Linear Models : P. ...

Read more »

iPhone App Store Acceptance Time / Download Results

November 3, 2010
By
iPhone App Store Acceptance Time / Download Results

   Complaints about the iPhone App Store are not uncommon among developers.  The submission process is frustrating at best, you can expect arbitrary rejections, and Apple's policies have not always been particularly open or welcoming. &...

Read more »

iPhone App Store Acceptance Time / Download Results

November 3, 2010
By
iPhone App Store Acceptance Time / Download Results

   Complaints about the iPhone App Store are not uncommon among developers.  The submission process is frustrating at best, you can expect arbitrary rejections, and Apple's policies have not always been particularly open or welcoming. &...

Read more »

RMongo: Accessing MongoDB in R

November 3, 2010
By

I recently created RMongo, a database access layer to MongoDB in R as an R package. To install RMongo, download it from https://github.com/quid/RMongo/downloads Run: R CMD install RMongo_0.0.17.tar.gz I tried to mimic the RMySQL commands in RMongo. Bel...

Read more »

Keeping up with election results, with R

November 3, 2010
By
Keeping up with election results, with R

Yesterday's US election is pretty much over now: most of the results are in, the pundits have offered their political analysis, and there's even been a bit of mathematical analysis of the results, too. But last night as the results were flowing in, R user Brock Tibert just wanted to track the results of the Massachusetts governor's race. The...

Read more »

My residuals look weird… aren’t they ?

November 3, 2010
By
My residuals look weird… aren’t they ?

Since I got the same question twice, let us look at it quickly....  Some students show me a graph (from a Poisson regression) which looks like that, and they asked "isn't it weird ?", i.e."residuals are null or positive... this is not what we...

Read more »

Looping through a set of graphics in odfWeave

November 3, 2010
By

At proMENTE social research we often use the odfWeave and Sweave packages for the amazing statistics program R for automating the production of graphics and reports. odfWeave and Sweave are for the OpenOffice and lyx (www.lyx.org) word processors respectively. One problem with this approach arises when you have to produce and insert into your document

Read more »