Workshop on Mixed and Multilevel Modelling with R in Toronto

February 7, 2012
By

Summer Program In Data Analysis (SPIDA): May 24th – June 1st, 2012 In its thirteenth season this year, ISR’s Summer Program in Data Analysis focuses on linear models, beginning with “standard” regression through generalized linear models, and extending to mixed or multilevel models, linear and non-linear and generalized, which incorporate two or more hierarchical levels of data or longitudinal...

Read more »

What’s new in futile.paradigm 2.0.4

February 6, 2012
By
What’s new in futile.paradigm 2.0.4

Well this certainly took a while but the latest installment of my functional dispatching library for R is finally released …Continue reading »

Read more »

how to create a variable with r

February 6, 2012
By

(This article was first published on twotorials by anthony damico, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on his blog: twotorials by anthony damico. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL,...

Read more »

how to do simple arithmetic in r

February 6, 2012
By

(This article was first published on twotorials by anthony damico, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on his blog: twotorials by anthony damico. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL,...

Read more »

More Beautiful Growth of $1 Chart

February 6, 2012
By
More Beautiful Growth of $1 Chart

With all my recent focus on reporting and visualization, you might think that I have the investments all figured out.  Unfortunately, that is not the case, and I will resume more standard investment and systems posts soon.  I did want to shar...

Read more »

The anatomy of a Twitter conversation, visualized with R

February 6, 2012
By
The anatomy of a Twitter conversation, visualized with R

If you're a Twitter user like me, you're probably familiar with the way that conversations can easily by tracked by following the #hashtag that participants include in the tweets to label the topic. But what causes some topics to take off, and others to die on the vine? Does the use of retweets (copying another users tweet to your...

Read more »

General Bayesian estimation using MHadaptive

February 6, 2012
By
General Bayesian estimation using MHadaptive

If you can write the likelihood function for your model, MHadaptive will take care of the rest (ie. all that MCMC business). I wrote this R package to simplify the estimation of posterior distributions of arbitrary models. Here’s how it works: 1) Define your model (ie the likelihood * prior). In this example, lets build

Read more »

Using apply() to create a unique id

February 6, 2012
By
Using apply() to create a unique id

Suppose you have a data set with two identifiers. For example, maybe you're studying the relationships among firms in an industry and you have a way to link the firms to one another. Each firm has an id, but the unique unit in your data set is a pair...

Read more »

An R script for estimating future inflation via the Treasury market

February 6, 2012
By

One factor that is critical for any financial planning is estimating what future inflation will be. For example, if you’re saving money in an instrument that gains 3% per year, and inflation is estimated to be 4% per year, well then you’re losing m...

Read more »

Visualising Activity Around a Twitter Hashtag or Search Term Using R

February 6, 2012
By
Visualising Activity Around a Twitter Hashtag or Search Term Using R

I think one of valid criticisms around a lot of the visualisations I post here and on my various #f1datajunkie blogs is that I often don’t post any explanatory context around the visualisations. This is partly a result of the way I use my blog posts in a selfish way to document the evolution of

Read more »

The US market will absolutely positively definitely go up in 2012

February 6, 2012
By
The US market will absolutely positively definitely go up in 2012

The Super Bowl tells us so. The Super Bowl Indicator The championship of American football decides the direction of the US stock market for  the year.  If a “National” team wins, the market goes up; if an “American” team wins, the market goes down. Yesterday the Giants, a National team, beat the Patriots. The birth … Continue reading...

Read more »

googleVis 0.2.14 is released

February 5, 2012
By
googleVis 0.2.14 is released

Version 0.2.14 of the googleVis package was released on CRAN today.ChangesThe help files have been checked against changes of the Google Visualisation API, typos in the vignette have been ironed out (thanks to Pat Burns for pointing them out), a new se...

Read more »

Comparing correlations update

February 5, 2012
By

I have just published R code for calculating CIs for differences between correlations on the Serious stats book blog. This covers independent correlations (taken from chapter 6 of the book) and dependent correlations (new R code written as a suppl...

Read more »

Comparing correlations: independent and dependent (overlapping or non-overlapping)

February 5, 2012
By
Comparing correlations: independent and dependent (overlapping or non-overlapping)

In Chapter 6 (correlation and covariance) I consider how to construct a confidence interval (CI) for the difference between two independent correlations.  The standard approach uses the Fisher z transformation to deal with boundary effects (the squashing of the distribution and increasing asymmetry as r approaches -1 or 1). As zr is approximately normally distributed

Read more »

Rstudio and asreml working together in a mac

February 5, 2012
By

December and January were crazy months, with a lot of travel and suddenly I found myself in February working in four parallel projects involving quantitative genetics data analyses. (I’ll write about some of them very soon) Anyhow, as I have … Continue reading →

Read more »

RStudio Server part 2: pros of using RStudio server for a remote connection

February 5, 2012
By

After playing around with R studio server for a while, I decided to write a followup to my previous blog post. I want to go over a few of the strong points of using RStudio server to access a remote… See more ›

Read more »

rjags

February 5, 2012
By

Running 64 bit R, JAGS and rjags on EC2 Winbugs and Jags free Item Response Theory from the dot matrix plots of proprietary software and open up a multicoloured world of posterior predictive model checking. Fitting IRT models using brute force is not ...

Read more »

Influential People in the "Big Data" Field

February 4, 2012
By

Yesterday, Haydn Shaughnnessy wrote a piece for Forbes titled, Who are the Top 20 Influencers in Big Data?Fans of R will be delighted to see David Smith of Revolution Analytics up there at number 2!Congratulations!© 2012, David E. Giles

Read more »

Measuring associations between non-numeric variables

Measuring associations between non-numeric variables

It is often useful to know how strongly or weakly two variables are associated: do they vary together or are they essentially unrelated?  In the case of numerical variables, the best-known measure of association is the product-moment correlation coefficient introduced by Karl Pearson at the end of the nineteenth century.  For variables that are ordered but not necessarily numeric...

Read more »

Multiple Factor Model – Building Fundamental Factors

February 4, 2012
By
Multiple Factor Model – Building Fundamental Factors

This is the second post in the series about Multiple Factor Models. I will build on the code presented in the prior post, Multiple Factor Model – Fundamental Data, and I will show how to build Fundamental factors described in the CSFB Alpha Factor Framework. For details of the CSFB Alpha Factor Framework please read

Read more »

Implementing Circles example

February 4, 2012
By
Implementing Circles example

This week I reimplemented part of Conic Sections 1 model from NetLogo. In the model turtles seek to to be in target distance from center.My code takes only one center point, so only circles can be obtained. Apart from turtle location plot giv...

Read more »

Berlin’s children

February 4, 2012
By
Berlin’s children

Few years ago, a newspaper claimed the block I live in — Prenzlauer Berg in Berlin — is the most fertile region in Europe. It was a hoax, as this (German) newspaper article points out. (The article has become quite famous because it coined the term Bionade Biedermeier to describe the life style in this area.)However,...

Read more »

Coming R meetings in Paris

February 4, 2012
By
Coming R meetings in Paris

If you live in Paris and are interested in R, there will be two meetings for you this week. First a Semin-R session, organized at the Muséum National d’Histoire Naturelle on Tuesday 7 Feb (too bad, the Museum is closed on Tuesdays). Presentations will be about colors, phylogenies and maps, while I will speak about

Read more »

"R": PLS Regression (Gasoline) – 003

February 3, 2012
By
"R": PLS Regression  (Gasoline) – 003

The gasoline data set has the spectra of 60 samples acquired by diffuse reflectance from 900 to 1700 nm. We saw how to plot the spectra in the previous post.Now, following the tutorial of Bjorn-Helge Mevik published in "R-News Volume 6/3, August 2006", we will do the PLS regression:gas1 <- plsr(octane~NIR, ncomp = 10,data = gasoline, validation...

Read more »

Accelerating analytics at MSU with Revolution R Enterprise

February 3, 2012
By

Erik Sigur, Information Technologist for the Department of Statistics and Probability at Michigan State University, writes at ReadWriteWeb about using Revolution R Enterprise to provide high-performance computation in R to the researchers in his department: Our search for a more effective version of R ultimately brought us to a product called Revolution R Enterprise by Revolution Analytics, which provides...

Read more »

Monty Hall by simulation in R

February 3, 2012
By
Monty Hall by simulation in R

(Almost) every introductory course in probability introduces conditional probability using the famous Monte Hall problem. In a nutshell, the problem is one of deciding on a best strategy in a simple game. In the game, the contestant is asked to select one of three doors. Behind one of the doors is a great prize (free

Read more »

Forbes: Top 20 influencers in Big Data

February 3, 2012
By

Haydn Shaughnessy at The Forbes blog provides a list of the "Top 20 Influencers in Big Data", and I'm humbled to report that yours truly is listed there at #2. It's an instantaneous ranking based on the social-media tracking tool Traakr, but it's still great to be listed alongside writers for SiliconAngle, GigaOM, and KDNuggets (and even Mashable!). I...

Read more »

New R User Groups in Austin, Adelaide

February 3, 2012
By

It's awesome to see so many local R user groups kicking off in 2011! Yet another is the Austin R User Group in Austin, Texas. They've already held their first informal get-together, and the first formal meeting on February 23 will be devoted to data management techniques in R. Props to Sandy Donlon for organizing the group! And I'm...

Read more »

Why don’t we hear more about Adrian Dantley on ESPN? This graph makes me think he was as good an offensive player as Michael Jordan.

February 3, 2012
By
Why don’t we hear more about Adrian Dantley on ESPN? This graph makes me think he was as good an offensive player as Michael Jordan.

In my last post I complained about efficiency not being discussed enough by NBA announcers and commentators. I pointed out that some of the best scorers have relatively low FG% or TS%. However, via the comments it was pointed out that top scorers need ...

Read more »