Issues of R Client Library For The Google Prediction API

December 16, 2010
By

The Google Prediction API is a black-box system for building predictive models, it provides pattern-matching and machine learning capabilities. So Google algorithms automatically creates a model from the training models given a set of training data and makes prediction under this model given a set of explanatory variables, read http://code.google.com/apis/predict/docs/getting-started.html for an overview. I am eager...

Read more »

R 2.12.1 is out

December 16, 2010
By

As promised, the latest patch to R is out with the release of R 2.12.1, as announced today by the R Core Team. If you build R yourself, sources are available now at your local CRAN mirror, and binaries for Windows, Mac and Linux will be available in the next few days. There are a few new features: The...

Read more »

Tennis and risk management

December 16, 2010
By
Tennis and risk management

As mentioned already here, while we were going to Québec City for the workshop, we had interesting discussions in the car, and Maciej mentioned an article recently published in The Actuary, Hence, I wanted to discuss (extremely) rare event probabi...

Read more »

Where to find good data sets

December 16, 2010
By

O'Reilly Media has been a big advocate of Open Data and believes that is where a lot of computing is going to be headed in the future.  I think they are definitely on to something.  Yet the future could be now.  There is a lot of opportu...

Read more »

Where to find good data sets

December 16, 2010
By

O'Reilly Media has been a big advocate of Open Data and believes that is where a lot of computing is going to be headed in the future.  I think they are definitely on to something.  Yet the future could be now.  There is a lot of opportu...

Read more »

Area plots unmasked

December 15, 2010
By
Area plots unmasked

RESULTS OF THE GREAT AREA PLOT QUIZ If you are the type of reader who remembers things from last week, you may remember the great area plot quiz we had running. This week, we are excited to announce that the results are in. The plot above shows answers to the four questions. The correct answers

Read more »

Value of an R object in an expression

December 15, 2010
By
Value of an R object in an expression

Just wanted to create an expression, existing of some mathematical annotation and a value of an R object. Wasn't that intuitive!

Read more »

Data Driven Journalism

December 15, 2010
By

Last night at the Bay Area UseR Group meeting, Peter Aldhous, San Francisco Bureau Chief of New Scientist Magazine, gave an inspiring presentation about Data Driven Journalism. Even though the newspaper industry is faltering as a business model, there's a beacon of light: journalists can be the driving force behind bringing the meaning in the huge data sets that...

Read more »

Databases (SQL, noSQL); Interfacing R with Excel

December 15, 2010
By
Databases (SQL, noSQL); Interfacing R with Excel

Los Angeles R users group Dec. 14 2010 meeting (see meetup info here): 1. A SQL primer for R users – Neal Fultz Video and slides will be available soon 2. R Database Access – Shrikrishna Bhogaonker 3. NoSQL data … Continue reading →

Read more »

I really need to find hot (and sexy) topics

December 15, 2010
By
I really need to find hot (and sexy) topics

50 days ago (here), I was supposed to be very optimistic about the probability that I could reach a million viewed pages on that blog (over a bit more than two years). Unfortunately, the wind has changed and today, the probability is quite low... ...

Read more »

Customizing Maps in R: spplot() and latticeExtra functions

December 15, 2010
By
Customizing Maps in R: spplot() and latticeExtra functions

I recently noticed the new latticeExtra page on R-forge, which contains many very interesting demos of new lattice-related functionality. There are strong opinions about the "best" graphics system in R (base graphics, grid graphics, lattice, ggplot, et...

Read more »

Examples for infochimps Package, and Intro Slides

December 15, 2010
By
Examples for infochimps Package, and Intro Slides

Tomorrow I am headed out of town for a few weeks, so posting will be lighter than the usual lightness. My first stop is Chicago, and tomorrow night I will be speaking about my R package for the infochimps API at the Chicago R Users Group. If you are in the Chicagoland area and

Read more »

Why Use R?

December 14, 2010
By

I use R very frequently and take for granted much that it has to offer.  I forget how R is different from similar tools, so I have trouble communicating the benefits of using R.  The goal of this post is to highlight R's main strengths, but first... my story.How I got started with RI was introduced...

Read more »

Why Use R?

December 14, 2010
By

I use R very frequently and take for granted much that it has to offer.  I forget how R is different from similar tools, so I have trouble communicating the benefits of using R.  The goal of this post is to highlight R's main strengths, but first... my story.How I got started with RI was introduced...

Read more »

Facebook’s Social Network Graph

December 14, 2010
By
Facebook’s Social Network Graph

Paul Butler, an intern on Facebook’s data infrastructure engineering team, was interested in visualizing the "locality of friendship". Luckily, he has some great data to work with: Facebook's social network of the friendships between its 500 million members. But visualizing that much data can be a challenge in its own right -- it takes skill to draw meaning from...

Read more »

Logical operators in R

December 14, 2010
By
Logical operators in R

In R, the operators “|” and “&” indicate the logical operations OR and AND. For example, to test if x equals 1 and y equals 2 we do the following: > x = 1; y = 2 > (x == 1) & (y == 2) TRUE However, if you are used to programming in

Read more »

My first Reproducible Research Compendium

December 14, 2010
By

I have just completed my first Reproducible Research Compendium “Analysis of the combined survey datasets from the American Red Cross Tsunami Recovery Program Psycho-Social Project (adult community respondents)”. It is basically all the reports and data from all the work I did on evaluation psychosocial projects for the American Red Cross, bundled up. But one

Read more »

My first Reproducible Research Compendium

December 14, 2010
By

I have just completed my first Reproducible Research Compendium "Analysis of the combined survey datasets from the American Red Cross Tsunami Recovery Program Psycho-Social Project (adult community respondents)". It is basically all the repo...

Read more »

Is it that stupid to make extremely long term forecast when studying mortality ?

December 14, 2010
By
Is it that stupid to make extremely long term forecast when studying mortality ?

I received recently a comment by FCA (here) who raised an important question, about forecast in dynamic mortality models. (S)he mentioned that from his(her) point of view, the econometric models I considered were "good to predict for the next, say,...

Read more »

RcppDE 0.1.0

December 13, 2010
By

A new package RcppDE has been uploaded in a first version 0.1.0 to CRAN. It provides differential evolution optimisation---a variant of stochastic optimisation that is similar to genetic algorithms but particularly suitable for the floating-point repr...

Read more »

Adap’skiii [latest]

December 13, 2010
By
Adap’skiii [latest]

Just to point out there still is room for more participants to the Adap’skiii workshop! We have now reached 60 participants for this Utah workshop and would welcome more, quite obviously! All participants are also free to present a poster on the evening of the 4th, in the bar. Filed under: pictures, R, Statistics, Travel,

Read more »

Machine Learning and Data Mining with R

December 13, 2010
By

The San Francisco Bay Area ACM runs several courses on data mining and machine learning with R. Machine Learning 101 deals primarily with supervised learning problems, and Machine Learning 102 covers unsupervised learning and fault detection. Machine Learning 101 & 102 were most recently presented by Mike Bowles & Tricia Hoffman in September, and the lecture notes and class...

Read more »

Example 8.18: A Monte Carlo experiment

December 13, 2010
By
Example 8.18: A Monte Carlo experiment

In recent weeks, we've explored methods to fit logistic regression models when a state of quasi-complete separation exists. We considered Firth's penalized likelihood approach, exact logistic regression, and Bayesian models using Markov chain Monte Ca...

Read more »

Video of Reproducible Research with R: Melbourne R Users 1st Dec 2010

December 13, 2010
By

As previously mentionedI gave a talk at Melbourne R Users Group titled "Reproducible Research and R Workflow". It covered technologies including LaTeX, Sweave, R, make, Eclipse, and git. This post shares the video. I'd like to thank Pedro Olaya for fi...

Read more »

Video of Reproducible Research with R: Melbourne R Users 1st Dec 2010

December 13, 2010
By

As previously mentionedI gave a talk at Melbourne R Users Group titled "Reproducible Research and R Workflow". It covered technologies including LaTeX, Sweave, R, make, Eclipse, and git. This post shares the video. I'd like to thank Pedro Olaya for ...

Read more »

Some quibbles about “The R Book” by Michael Crawley

December 13, 2010
By
Some quibbles about “The R Book” by Michael Crawley

A friend recently bought The R Book and I said I would tell him of problems that I’ve noticed with it.  You can eavesdrop. Page 4 The word “library” is used instead of “package”.  This (common)  error substantially raises the blood pressure of some people — probably to an unwarranted extent. An R package is … Continue reading...

Read more »

Ghcn V3 Metadata improvements

December 12, 2010
By
Ghcn V3 Metadata improvements

The Global Historical Climate Network  (GHCN) is in it’s beta stage. On of the stated goals of the project is to improve the metadata that is provided for the station data.  Over the past few months several independent volunteers have been focusing on the issue of station metadata, each with their own focus. Ron Broberg

Read more »

Using R for Introductory Statistics, Chapter 4

December 12, 2010
By
Using R for Introductory Statistics, Chapter 4

Chapter 4 of Using R for Introductory Statistics gets us started working with multivariate data. The question is: what are the relationships among the variables? One way to go about answering it is by pairwise comparison of variables. Another technique is to divide the data into categories by the values of some variables and...

Read more »

Using R for Introductory Statistics, Chapter 4

December 12, 2010
By
Using R for Introductory Statistics, Chapter 4

Chapter 4 of Using R for Introductory Statistics gets us started working with multivariate data. The question is: what are the relationships among the variables? One way to go about answering it is by pairwise comparison of variables. Another techniq...

Read more »