December 2011 issue of the R Journal: An overview

December 20, 2011
By
December 2011 issue of the R Journal: An overview

The December 2011 issue of the R Journal is now available for download. Three times a year, the open-access journal of the R project publishes peer-reviewed articles on research and applications of R and R packages. As of the latest issue, all articles are published under a Creative Commons license, making them accessible for translation, academic and commercial uses...

Read more »

Assessing Model Fit Through Simulation

December 20, 2011
By
Assessing Model Fit Through Simulation

In the area of political science where I am active (european politics) it is very common to simply estimate a model, and start drawing inferences immediately. This is a shame as drawing inferences from a model without assessing how well it fits the dat...

Read more »

Fast paced food-web plotting action

December 20, 2011
By
Fast paced food-web plotting action

A simple foodwebOne of my interests is in food web topology and food web dynamics. My research in experimental ponds has left me with 2430 food webs to make sense of, and one way to facilitate understanding that many networks, and really any food web, is through visualization. There are...

Read more »

Pairs Trading Issues

December 20, 2011
By
Pairs Trading Issues

(This article was first published on Eran Raviv » R, and kindly contributed to R-bloggers) A few words for those of you who are not familiar with the “pairs trading” concept. First you should understand that the movement of every stock is dominated not by the companies performance but by the general market movement. This is the origin of...

Read more »

Tutorial on SPARQL Package for R

December 19, 2011
By
Tutorial on SPARQL Package for R

Tools are major enablers of Linked Science. One crucial aspect is how to access and analyze data, and especially how to get only that part of data which is of interest for a given research question.  Linked Data solves the … Continue reading →

Read more »

Rotational Trading Strategies: borrowing ideas from Engineering Returns

December 19, 2011
By
Rotational Trading Strategies: borrowing ideas from Engineering Returns

Frank Hassler at Engineering Returns blog wrote an excellent article Rotational Trading: how to reduce trades and improve returns. The article presents four methods to reduce trades: Trade less frequently. I.e. weekly instead of daily rebalancing. Different criteria for enter / exit a trade. Smooth the rank over the last couple of bars. Combination of

Read more »

Visualizing ChaLearn Gestures Test Data

December 19, 2011
By
Visualizing ChaLearn Gestures Test Data

The colored paths are labeled training data, just like in my last post on this.The title gives the "answer" for a test video:Could you tell from just this what the sequence of gestures was?Not perfectly, but way better than chance.See a couple more exa...

Read more »

RTextTools v1.3.2 Released

RTextTools was updated to version 1.3.2 today, adding support for n-gram token analysis, a faster maximum entropy algorithm, and numerous bug fixes. The source code has been synced with the Google Code repository, so please feel free to check out a copy and add your own features!With the core feature set of RTextTools finalized, the next major releas

Read more »

Blog Statistics with StatCounter & R

December 19, 2011
By
Blog Statistics with StatCounter & R

If you're interested in analysing your blog's statistics this can easily be done with a web-service like StatCounter (free, only registration needed, quite extensive service) and with R.After implementing the StatCounter script in the html code of a we...

Read more »

IRIS Flower Data Set (R-003)

December 19, 2011
By
IRIS Flower Data Set (R-003)

Centramos la matriz con el comando, generando a partir de A una nueva matriz que llamamos "Acentered"Acentered=scale(A,center=T)Ahora con la función "eigen":Esta es otra forma de proceder con el cálculo de los componentes principales (eigenvectors y ...

Read more »

Portfolio Optimization in R, Part 2

December 19, 2011
By
Portfolio Optimization in R, Part 2

In the previous post, we built the efficient frontier of a portfolio of bonds. The next logical step is to find the super efficient (or market) portfolio holdings.  If you are unfamiliar with the concept, take a second and read the section section...

Read more »

Submit a paper to the R/Finance conference

December 19, 2011
By

For anybody using the R language to analyze financial data, the R/Finance conference is the conference of the year. If you have something to share about applied finance with R, the call for papers is now open. The details are below, and the deadline for submissions is January 31, 2012. R/Finance 2012: Applied Finance with R May 11 and...

Read more »

Maximal Information Coefficient (MIC)

December 19, 2011
By
Maximal Information Coefficient (MIC)

Pearson r correlation coefficients for various distributions of paired data (Credit: Denis Boigelot, Wikimedia Commons)A paper published this week in Science outlines a new statistic called the maximal information coefficient (MIC), which is able to equally describe the correlation between paired variables regardless of linear or nonlinear relationship. In...

Read more »

The R Journal (Volume 3/2, December 2011) is out

December 19, 2011
By

The new R journal for December 2011 is out! You can Download the complete issue from here, while refereed articles may be downloaded individually using the links below: Table of Contents Editorial 3   Contributed Research Articles   Creating and Deploying an Application with (R)Excel and R  Thomas Baier, Erich Neuwirth and Michele De Meo 5 glm2: Fitting Generalized Linear Models...

Read more »

Christmas Gift to the R Community: The R Journal!

December 19, 2011
By
Christmas Gift to the R Community: The R Journal!

The R Journal Volume 3/2 is available! Get it from here.

Read more »

Spatial Data with R

December 19, 2011
By
Spatial Data with R

On September 14th 2011 Dr Alec Stephenson gave a talk on exploring spatial data with R (see Meetup page). The video of the talk is now available online. The talk provides a non-mathematical and entirely equation-free talk on visualizing and … Continue reading →

Read more »

data.frame objects in R (via “R in Action”)

December 18, 2011
By
data.frame objects in R (via “R in Action”)

The followings introductory post is intended for new users of R.  It deals with R data frames: what they are, and how to create, view, and update them. This is a guest article by Dr. Robert I. Kabacoff, the founder of (one of) the first online R tutorials websites: Quick-R.  Kabacoff has recently published the book ”R Read more...

Read more »

Portfolio Optimization in R, Part 1

December 17, 2011
By
Portfolio Optimization in R, Part 1

I briefly mentioned in my last post; that I was fooling around with portfolio optimization in R.  This post will the first in a series on the topic of portfolio optimization. Please note, nothing I am about to say should be taken as advice for investing.  These results are based on prior observed returns and the future...

Read more »

Function to Collect Geographic Coordinates for IP-Addresses

December 17, 2011
By
Function to Collect Geographic Coordinates for IP-Addresses

I added the function IPtoXY to theBioBucket-Archives which collects geographic coordinates for IP-addresses.It uses a web-service at http://www.datasciencetoolkit.org// and works with the base R-packages. # System time to collect coordinates of 100 IP-...

Read more »

IRIS Flower Data Set (R-002)

December 17, 2011
By
IRIS Flower Data Set (R-002)

Ver  primero: IRIS Flower Data Set (R-001)See first:        IRIS Flower Data Set (R-001)El comando "summary" nos ayuda a comprender la importancia de cada componente principal:Los "eigenvalues" son las desviacion...

Read more »

IRIS Flower Data Set (R-001)

December 17, 2011
By
IRIS Flower Data Set (R-001)

IRIS Flower Data SetEste es el Link a Wikipedia donde podéis encontrar los datos que utilizó Fisher en su trabajo de 1936. Ya hemos trabajado con estos datos en Excel y los continuaremos usando en nuevas entradas.En este link, podemos ver las fotos de las flores (IRIS en castellano son lírios).Represento como LS (longitud del sépalo), AS...

Read more »

Ripley on model selection, and some links on exploratory model analysis

December 17, 2011
By

This is really fun. I love how Ripley thinks, with just about every concept considered in broad generality while being connected to real-data examples. He’s a great statistical storyteller as well. . . . and Wickham on exploratory model analysis I came across Ripley’s slides in a reference from Hadley Wickham’s article on exploratory model The post Ripley...

Read more »

cRazydays 2012 with ggplot2

December 17, 2011
By
cRazydays 2012 with ggplot2

Season’s Greetings Hi, dear R-bloggers and its readers. Here in Japan it’s very cold now. The end

Read more »

knitr: nice alternative for Sweave

December 17, 2011
By

I recently discovered knitr for dynamic report generation in R. It seems like a very powerful alternative to Sweave. Particularly, I am interested in png graphic device support (it supports more than 20 graphic devices) and R code formatting.Check it o...

Read more »

semi-automatic ABC

December 17, 2011
By
semi-automatic ABC

The talk of Wednesday afternoon Ordinary Meeting of the Royal Statistical Society went on quite well, I think. I would have expected a few people (in general) and some specific people (in particular) but this being the last week of term the schedule was not the best of times. Paul Fearnhead gave the talk, insisting

Read more »

ai-class.com vs ml-class.com

December 16, 2011
By
ai-class.com vs ml-class.com

For those who did not know, Stanford university offered free off charge 3 courses at beginning of the autumn. It is kind of shocking – US based institution offers education for free! Take any socialism oriented country and one of the promises is education for free. But it seems, that the argument loosing the power – Stanford,

Read more »

Poor, Poor Hillary

December 16, 2011
By
Poor, Poor Hillary

This will be the last baby name related post but this came out of part two web scrapping post last month. I was looking for the fastest rising Names. I flip the logic and looked for the fastest declining names in relative popularity. Out of that exerci...

Read more »

Lattice Explore Bonds

December 16, 2011
By
Lattice Explore Bonds

Since my fifth most popular post has been Bond Market as a Casino Game Part 1, I thought I would use Vanguard Total US Bond Market mutual fund (VBMFX) monthly returns to build our skills in the lattice R package and help visualize the unbelievable run ...

Read more »

A quick primer on split-apply-combine problems

December 16, 2011
By
A quick primer on split-apply-combine problems

I’ve just answered my hundred billionth question on Stack Overflow that goes something like I want to calculate some statistic for lots of different groups. Although these questions provide a steady stream of easy points, its such a common and basic data analysis concept that I thought it would be useful to have a document

Read more »