Rcpp Workshop slides

April 29, 2011
By

Dirk and I gave a full day Rcpp workshop yesterday in Chicago before the R in Finance conference. The pdfs of the slides are available here: part 1 (intro), part 2 (details), part 3 (modules) and part 4 (applications)

Read more »

Rcpp Workshop slides

April 29, 2011
By

Dirk and I gave a full day Rcpp workshop yesterday in Chicago before the R in Finance conference. The pdfs of the slides are available here: part 1 (intro), part 2 (details), part 3 (modules) and part 4 (applications)

Read more »

Parallelizing and cross-validating feature selection in R

April 29, 2011
By
Parallelizing and cross-validating feature selection in R

This is an example piece of code for the Overfitting competition at kaggle.com. This method has an AUC score of ~.91, which is currently good enough for about 38th place on the leaderboard. If you read the completion forums closely, you will find code...

Read more »

Gartner: Revolution Analytics a "Cool Vendor" for BI

April 29, 2011
By

Leading analyst firm Gartner has just published its "Cool Vendors in Analytics and Business Intelligence" report for 2011 (download it here if you have a Gartner subscription). In the report, Revolution Analytics is named a Gartner Cool Vendor, and recognizes the company as "innovative, impactful and intriguing": Driven in part by the rise of big data, business intelligence (BI)...

Read more »

RStudio is good for you

April 29, 2011
By
RStudio is good for you

I was recently introduced to RStudio, a new integrated development environment for R, it is just amazing! It is free, and open, compatible with PC/Mac/Linux OSs. You can also choose to run it in the cloud, and access it from your favorite web browser. As you can see, the window divides into four in a

Read more »

Example 8.36: Quadratic equation with real roots

April 29, 2011
By
Example 8.36: Quadratic equation with real roots

We often simulate data in SAS or R to confirm analytical results. For example, consider the following problem from the excellent text by Rice:Let U1, U2, and U3 be independent random variables uniform on . What is the probability that the roots...

Read more »

Slides from Rcpp workshop / master class yesterday

April 29, 2011
By

Romain and I just posted our slides from yesterday's Rcpp workshop and class (preceding the now-ongoing R/Finance conference). You can access the slides via my presentation page, or directly from here as Part 1 (Introduction), Part 2 (Details), Part ...

Read more »

Forming Formulas

April 29, 2011
By
Forming Formulas

One of the first functions a new R user learns how to use is the lm() command, which involves stating the model formula.lm(y~x1+x2, data=mydata)After a while, this just becomes a natural way to say "I want a regression of y on x1 and x2 using mydata." ...

Read more »

Forming Formulas

April 29, 2011
By
Forming Formulas

One of the first functions a new R user learns how to use is the lm() command, which involves stating the model formula.lm(y~x1+x2, data=mydata)After a while, this just becomes a natural way to say "I want a regression of y on x1 and x2 using mydata." ...

Read more »

RStudio

April 29, 2011
By

As has been discussed on various blogs the RStudio interface to R has been released. It is definitely worth checking out as it has the potential to improve the user experience for R.

Read more »

ggplot2 – First impressions

April 29, 2011
By

I was reading various R blogs and saw very nice looking plots created with ggplot2 package. Especially this blog was useful because of link to a very interesting book about ggplot2. I want to display and update the latest co-integrated pairs every day ...

Read more »

Easy way to get yield curve : what you need is only "FRBData" package !

April 28, 2011
By
Easy way to get yield curve : what you need is only "FRBData" package !

I made FRBData package and registerd it on CRAN.This package allow you to download financial data from FRB's website.This website provide many economical data such as consumer credit, money stock.This article show you how to use this package.(But, it has only a function about interest rate now. I will create other functions to download other macro-economical data in next version.)First,...

Read more »

Easy way to get yield curve : what you need is only "FRBData" package !

April 28, 2011
By
Easy way to get yield curve : what you need is only "FRBData" package !

I made FRBData package and registerd it on CRAN.This package allow you to download financial data from FRB's website.This website provide many economical data such as consumer credit, money stock.This article show you how to use this package.(But, it has only a function about interest rate now. I will create other functions to download other macro-economical data in next version.) First,...

Read more »

Processing nested lists

April 28, 2011
By
Processing nested lists

So perhaps you have all figured this out already, but I was excited to figure out how to finally neatly get all the data frames, lists, vectors, etc. out of a nested list. It is as easy as nesting calls to the apply family of functions, in the case bel...

Read more »

Slightly Different Use of Ralph Vince’s Leverage Space Trading Model

April 28, 2011
By
Slightly Different Use of Ralph Vince’s Leverage Space Trading Model

In honor of the press release Dow Jones Indexes To Develop, Co-Brand Index Family With LSP Partners two days ago, I thought I would show another slightly different use of Ralph Vince’s The Leverage Space Trading Model. Using the R LSPM package, we c...

Read more »

Fit Sigmoid Curve with Confidence Intervals

April 28, 2011
By
Fit Sigmoid Curve with Confidence Intervals

Sigmoid curve fitting for transpiration measurements from porometer at different water potentials (pressure):Read more »

Read more »

Google Summer of Code Student Project Wins Statistical Software Award

April 28, 2011
By

We are happy to report that one of the Google Summer of Code students from last year, Ian Fellows, has been awarded the John M. Chambers Statistical Software Award for his work with the R Project. This award will be presented at the annual Joint Statistical Meeting in August.Ian’s project involved extending...

Read more »

Google Summer of Code Student Project Wins Statistical Software Award

April 28, 2011
By

We are happy to report that one of the Google Summer of Code students from last year, Ian Fellows, has been awarded the John M. Chambers Statistical Software Award for his work with the R Project. This award will be presented at the annual Joint Statistical Meeting in August. Ian’s project involved extending...

Read more »

Importing RDF input in R for analysis

April 28, 2011
By

August last year I asked on BioStar about how to import RDF into the R statistical package and at the time nothing seemed existing. Over the past few weeks I ported code I wrote for Bioclipse to create the rrdf package for R, which is now available fro...

Read more »

Can you do better than cap-weighted equity benchmarks?

April 28, 2011
By

We're on our way to Chicago for the annual conference for R users in Finance, R/Finance 2011. Revolution Analytics is proud to once again sponsor this conference, and during the sponsor lunch session at noon on Saturday, we're honoured to have Guy Yollin show how the big-data capabilities of Revolution R Enterprise can be used for quantitative finance. Here...

Read more »

Data Aggregation in R: plyr, sqldf and data.table

April 28, 2011
By
Data Aggregation in R: plyr, sqldf and data.table

I’ve also previously put up a couple of posts about aggregating data in R. In this post, I’m going to be trying some other alternative methods for aggregating the dataset. Before I begin, I’d like to thank Matthew Dowle for highlighting these to me. It’s a bit daunting at first, deciding which method of aggregating data is

Read more »

Day #30-31 errorbars here, errorbars there

April 28, 2011
By

Today I have been playing with the errorbars from knime. To recreate the plot from http://flyordie.sin.khk.be/2011/04/20/day-27-a-lot-of-graphics-in-one-place/ I had to be able to create 2 y-axis, and multiple plots on 1 graph. At the end of the day I ...

Read more »

Job Search Part 5: It’s Policy Time!

April 27, 2011
By
Job Search Part 5: It’s Policy Time!

This is the last post of this special mini-series on the job search and matching theory of unemployment. I will probably be extremely distracted for the next few months, including a month-long vacation in Europe to shake the horrors of undergrad off me...

Read more »

“Inside” Functors — Multiple Arguments

April 27, 2011
By
“Inside” Functors — Multiple Arguments

Again for HTML reasons this has been taken to http://strugglingthroughproblems.blogspot.com/2011/04/inside-functors-multiple-arguments.html

Read more »

A test of Ledoit-Wolf versus a factor model

April 27, 2011
By
A test of Ledoit-Wolf versus a factor model

Statistical factor models and Ledoit-Wolf shrinkage are competing methods for estimating variance matrices of returns.  So which is better?  This adds a data point for answering that question. Previously There are past blog posts on: the idea of variance matrices factor models of variance The data in this post are from the blog posts: “Weight … Continue reading...

Read more »

How to make 3-D graphics from SAS data

April 27, 2011
By
How to make 3-D graphics from SAS data

The blog SAS Analysis shows how to create 3-D images from SAS data ... using R: Some SAS programmers like to use SAS/IML to call R’s functions . However, it seems that SAS/IML fails to work with the latest versions of R since 2.12 . Others tend to play tricks to call R into SAS’s data step...

Read more »

DST is a b!tch, be careful with POSIX in a stupid timezone

April 27, 2011
By
DST is a b!tch, be careful with POSIX in a stupid timezone

At the department we have been analyzing some transaction data for some time. We got a new dataset with lots of transactions. Once you need interpurchase (IPT) times, posix is quite useful, as you can easily difference transactions to generate IPTs.So ...

Read more »

VideoLectures.net Recommender System Competition

April 27, 2011
By
VideoLectures.net Recommender System Competition

A Guest post to R-bloggers by Bart Blaszczyk. * * * * * * * * This week a new data competition for the best recommendation system begins. Similar in a form to the famous Netflix Prize, asks data scientists, algorithm geeks and statisticians to devise the most accurate algorithm that suggests in personalized way what movies may be of interest for visitors...

Read more »

Day #28 ggplot2 in knime

April 27, 2011
By

If you haven’t read yesterday’s post, I advise you to do so, because this is the fix of yesterday. Day #27: A lot of graphics in one place I found out how to use ggplot2 in knime. Say, for example, your code is this: library(ggplot2) myplot...

Read more »