A Word Cloud with Spatial Meaning

December 7, 2011
By
A Word Cloud with Spatial Meaning

..Some time ago I did a word cloud for representing a Google Scholar search result. Tal Galili pointed me at a post by Drew Conway that expanded on the topic of word clouds lacking spatial meaning. In fact the spatial ordering of words in a word cloud ...

Read more »

UseR! 2011 slides are now available

December 7, 2011
By

I have just realized that UseR! 2011 presentation slides are now available from the conference web site.Unfortunately, no big surprise this year. Or maybe this is good news as it means that I have all the important stuff in my RSS Reader. And by the way, this blog is now listed on www.r-bloggers.com.To be fair there was...

Read more »

Principal Components Analysis with "R" (Part: 001)

December 7, 2011
By
Principal Components Analysis with "R" (Part: 001)

This is the first "post" of my new adventure with a software that I consider very interesting and that give to people the oportunity to work with Chemometrics ("R" is free).To follow these examples, yo can download the following article:"Multivariate S...

Read more »

R is not C

December 7, 2011
By

I keep trying to write R code like it was C code. It is a habit I’m trying to break myself of. For example, the other day I need to construct a model matrix of 1′s and 0′s in the … Continue reading →

Read more »

Call for Proposals for The R Series from Chapman & Hall/CRC

December 7, 2011
By

(This is a guest post based on Calver, Rob e-mail to the R-help mailing list) We are delighted to announce that our new series of books on R is up and running, with two books already published and another nine forthcoming (including three set to publish in 2012). We are keen to receive proposals for books covering all aspects of...

Read more »

Improved Moving Average Code is available for download!

December 7, 2011
By
Improved Moving Average Code is available for download!

Quantitative Finance, Technical Trading & Analysis. Fotis Papailias, Dimitrios Thomakos Fotis Quantitative Finance & Technical Trading Improved Moving Average Code is available for download!Hi everyone, in the last few days we have received great feedback from you. Due to increasing demand we have made a short version of the original code available for personal use. Please let us...

Read more »

World Since June 2008

December 6, 2011
By
World Since June 2008

For a client meeting, I struggled with how best to illustrate world markets since June 2008.  I used R to produce this, but I’m still not completely satisfied. Anyone have suggestions to improve? From TimelyPortfolio What I thought was inte...

Read more »

Using R — Basic error Handing with tryCatch()

December 6, 2011
By

The R language definition section on Exception Handling describes a very few basics about exceptions in R but is of little use to anyone trying to write robust code that can recover gracefully in the face of errors. In fact, …   read mo...

Read more »

My oh my

December 6, 2011
By
My oh my

Noted without comment, visit Biostatistics Ryan Gosling !!! for more gems like the one above.

Read more »

Judging panel for Applications of R in Business contest

December 6, 2011
By

Now that the final submissions are in for the Applications of R in Business contest, it's time to announce the judging panel who will select the winners for $20,000 in cash prizes from Revolution Analytics. They are: Edd Dumbill, technologist and writer for O'Reilly Media David Menninger, VP & Research Director at Ventana Research Steve Miller, technology writer and...

Read more »

Displaying german stock performance with R using ggplot2

December 6, 2011
By
Displaying german stock performance with R using ggplot2

I cannot follow stock market developments daily, so I was looking for a quick overview of what had happened in the last week. What would be of interest for me is  “How did German stocks perform over the last 5 days, compared to the last 20 trading days and the last 250 trading days”. R

Read more »

An example RNA-Seq Quality Control and Analysis Workflow

December 6, 2011
By

I found the slides below on the education page from Bioinformatics & Research Computing at the Whitehead Institute. The first set (PDF) gives an overview of the methods and software available for quality assessment of microarray and RNA-seq experim...

Read more »

Visualization of Prosper.com’s Loan Data Part I of II – Compare and Contrast with Lending Club

December 6, 2011
By
Visualization of Prosper.com’s Loan Data Part I of II – Compare and Contrast with Lending Club

Due to the positive feedback received on this post I thought I would re-create the analysis on another peer-to-peer lending dataset, courtesy of Prosper.com. You can access the Prosper Marketplace data via an API or by simply downloading XML files that are updated nightly http://www.prosper.com/tools/. If you are going to follow the route I took

Read more »

Example 9.17: (much) better pairs plots

December 6, 2011
By
Example 9.17: (much) better pairs plots

Pairs plots (section 5.1.17) are a useful way of displaying the pairwise relations between variables in a dataset. But the default display is unsatisfactory when the variables aren't all continuous. In this entry, we discuss ways to improve these dis...

Read more »

The new visualization package for genome data in Bioconductor: ggbio

December 6, 2011
By
The new visualization package for genome data in Bioconductor: ggbio

It’s been a while since I’ve been waiting for the release of a visualization package in Bioconductor. Back in 2008 I was really impressed by the power of GenomeGraphs and I have used it in multiple occasions. Yet from both the Bioconductor Developer Meeting in Heidelberg 2010 and BioC2011 I’ve been waiting for the release of...

Read more »

RcppArmadillo 0.2.32

December 5, 2011
By

Conrad Sanderson made the official 2.4.0 release of Armadillo last evening. This was wrapped into RcppArmadillo release 0.2.32 which arrived on CRAN earlier today. The NEWS entries summarising the changes since the 2.2.* series, we already saw mos...

Read more »

Multi-Asset Backtest : Rotational Trading Strategies

December 5, 2011
By
Multi-Asset Backtest : Rotational Trading Strategies

I want to discuss the implementation of Rotational Trading Strategies using the backtesting library in the Systematic Investor Toolbox.The Rotational Trading strategy switches investment allocations throughout the time, betting on few top ranked assets. For example, the ranking can be based on relative strength or momentum. A few examples of the Rotational Trading Strategies (or

Read more »

My Favorite Graphs

December 5, 2011
By
My Favorite Graphs

The important criterion for a graph is not simply how fast we can see a result; rather it is whether through the use of the graph we can see something that would have been harder to see otherwise or that could not have been seen at all. – William Cleveland, The Elements of Graphing Data, Related posts:

Read more »

Using R — Standalone Scripts & Error Messages

December 5, 2011
By

Open-source R is an amazing tool for statistical analysis and data visualization. Serious R gurus have found ways to do just about anything entirely within the R environment. Nevertheless, there are many of us who wish to plug R into …  ...

Read more »

Loading OSU’s VGPM ocean productivity data in R

December 5, 2011
By

Oregon State University makes a set of ocean productivity data derived from satellite data available for download and use by researchers. The Ocean Productivity website explains the available data and how it was derived. I have put together a few R fun...

Read more »

PCA file calculation with "R".

December 5, 2011
By
PCA file calculation with "R".

X es la matriz centrada (X is the centered matrix). Xcov es la matriz de covarianzas de X (Xcov is the covariance matrix of X).Con la función "eigen" calculamos los "eigenvectors" y "eigenvalues" de Xcov.(With the function "eigen" we calculate the "ei...

Read more »

Decimal to Binary in "R"

December 5, 2011
By
Decimal to Binary in "R"

Lately...I've been learning "R"...that weird programming language aimed for Statistics and Statistical programming...and I really like it...so as usual, I needed to create my own Decimal to Binary application -;)binary bsum bexp while (p_number > 0) { digit p_number bsum ...

Read more »

Vote Compass: visualizing Canadian poll results with R

December 5, 2011
By
Vote Compass: visualizing Canadian poll results with R

Vote Compass is an online "electoral literacy application, whose goal is to encourage engagement with and stimulate discussion around the policy platforms of Canada's political parties. In the lead-up to the 2011 Canadian election, Vote Compass collected the results of an on-line 10-minute survey from more than 2 million participants, and used the results to align voters with the...

Read more »

International Open Data Hackathon

December 5, 2011
By
International Open Data Hackathon

This past Saturday, I hung out at the Seattle branch of the International Open Data Hackathon. The event was hosted at the Pioneer Square office of Socrata, a small company that helps governments provide public open data. A pair of data analysts from Tableau were showing off a visualization for the Washington...

Read more »

A pure R poker hand evaluator

December 5, 2011
By
A pure R poker hand evaluator

There's already a lot of great posts out there about poker hand evaluators, so I'll keep this short.  Kenneth J. Shackleton recently released a very slick 5-card and 7-card poker hand evaluator called SpecialK.  This evaluator is li...

Read more »

From datasets to algorithms in R

December 5, 2011
By

Many statistical algorithms are taught and implemented in terms of linear algebra. Statistical packages often borrow heavily from optimized linear algebra libraries such as LINPACK, LAPACK, or BLAS. When implementing these algorithms in systems such as Octave or MATLAB, it is up to you to translate the data from the use case terms (factors, categories, numerical variables)...

Read more »

R-bloggers

December 5, 2011
By

For a long time, I have relied on R-bloggers for new, interesting, arcane, and all around useful information related to R and statistics. Now my R-related material is appearing there. If you use the R package at all, R-bloggers should be in your feed a...

Read more »

The Art of R Programming – my two cents

December 5, 2011
By
The Art of R Programming – my two cents

What makes this book different from other books about R is stated clearly by the author Norman Matloff in the introduction: "This book is not a compendium of the myriad types of statistical methods that are available in the wonderful R package. It r...

Read more »

The volatility mystery continues

December 5, 2011
By
The volatility mystery continues

How do volatility estimates based on monthly versus daily returns differ? Previously The post “The mystery of volatility estimates from daily versus monthly returns” and its offspring “Another look at autocorrelation in the S&P 500″ discussed what appears to be an anomaly in the estimation of volatility from daily versus monthly data. In recent times … Continue reading...

Read more »