Mapping US Radiation Levels in R

May 8, 2012
By
Mapping US Radiation Levels in R

I have posted previously about the open data available on Socrata (https://opendata.socrata.com/), and I was looking at the site again today when I stumbled upon a listing of levels of various radioactive isotopes by US city and state. The data is available at https://opendata.socrata.com/Government/Sorted-RadNet-Laboratory-Analysis/w9fb-tgv6 . You will need to click export, and then download it as a csv. ...

Read more »

Learn formatR in Two Minutes

May 8, 2012
By

Anthony made a video tutorial on how to use the formatR package, which I think is pretty cool: I wish I could speak English as fast as him...

Read more »

Loading and/or Installing Packages Programmatically

May 7, 2012
By

In R, the traditional way to load packages can sometimes lead to situations where several lines of code need to be written just to load packages. These lines can cause errors if the packages are not installed, and can also be hard to maintain, particularly during deployment. Fortunately, there is a way to create a function in R that...

Read more »

Cross Sectional Correlation

May 7, 2012
By
Cross Sectional Correlation

Diversification is hard to find nowadays because financial markets are becoming increasingly correlated. I found a good visually presentation of Cross Sectional Correlation of stocks in the S&P 500 index in the Trading correlation by D. Varadi and C. Rittenhouse article. Let’s compute and plot the average correlation among stocks in the S&P 500 index

Read more »

The hockeystick revisited

May 7, 2012
By
The hockeystick revisited

Previous posts: Correlation of temperature proxies with observations The “best” proxies for temperature reconstruction Okay, I couldn’t resist. I wanted to provide some more in depth analysis of temperature proxies, but I just went ahead and did my own little reconstruction of Northern hemisphere annual average temperatures over the past millenium using McShane et al.‘s

Read more »

relevant, revised, & resubmitted

May 7, 2012
By
relevant, revised, & resubmitted

We have now completed our revision of the paper Relevant statistics for Bayesian model choice, written with Judith Rousseau, Jean-Michel Marin, and Natesh Pillai. It has been resubmitted to Series B and reposted on arXiv. The major change in the paper is the inclusion of a check about the relevance of a given summary statistics,

Read more »

useR! 2012 – DEADLINE FAST APPROACHING!

May 7, 2012
By
useR! 2012 – DEADLINE FAST APPROACHING!

DEADLINE FAST APPROACHING – 8th Annual International R User Conference useR! 2012, Nashville, Tennessee USA Registration Deadlines: Early Registration: Passed Regular Registration: Mar 1- May 12 Late Registration: May 13 – June 4 On-Site Registration: June 12 – June 15 Please note: Nashville is offering several large entertainment events the month of June, and hotels are quickly selling out....

Read more »

Thursday: Tweet-chat on Multiple Sclerosis research

May 7, 2012
By

The story about the great work that SUNY Buffalo has been doing to find a cure for Multiple Sclerosis with Revolution R Enterprise and IBM Netezza has generated a lot of attention, with stories in Forbes, InformationWeek and eWeek (amongst others). To continue the discussion, IBM has put together a panel for a "Tweet Chat" on Thursday (May 10)...

Read more »

Multiplicative effects in sensory panel data

May 7, 2012
By
Multiplicative effects in sensory panel data

In a previous post I used JAGS to build the Bayesian equivalent of a two-way ANOVA. Effects were determined of products, panelists and their interaction. In this post this model will be rebuild to provide a more simplified and advanced model. The inter...

Read more »

A sociologist converts from Stata to R

May 7, 2012
By

Ph.D candidate in sociology Ethan Fosse just switched from Stata to doing 100% of his analysis with R. His reasons? If you want to do Bayesian analysis or graph modeled coefficients (or work with complex data structures more generally), then R is much easier than Stata due to the object-oriented programming environment. It's unbelievably liberating to be able to...

Read more »

“Introduction to R” public course

May 7, 2012
By

Milano R net, in collaboration with Quantide, organizes a public course of introduction to R. When: June 7-8, 2012 Where: Hotel Michelangelo; Milano (near Central Railway Station) Further details

Read more »

Example 9.30: addressing multiple comparisons

May 7, 2012
By
Example 9.30: addressing multiple comparisons

We've been more sensitive to accounting for multiple comparisons recently, in part due to work that Nick and colleagues published on the topic. In this entry, we consider results from a randomized trial (Kypri et al., 2009) to reduce problem drinking ...

Read more »

Second Milano R net meeting to be held in September 27, 2012

May 7, 2012
By

We are organising the second Milano R net meeting in September 27, 2012.

Read more »

Bivariate linear mixed models using ASReml-R with multiple cores

May 7, 2012
By
Bivariate linear mixed models using ASReml-R with multiple cores

A while ago I wanted to run a quantitative genetic analysis where the performance of genotypes in each site was considered as a different trait. If you think about it, with 70 sites and thousands of genotypes one is trying … Continue reading →

Read more »

PubMed publications in 2011 by 202 world countries: who’s the winner?

May 7, 2012
By
PubMed publications in 2011 by 202 world countries: who’s the winner?

Which country had the most PubMed citations in 2011? To find out I used R statistical software to analyze the affiliation of 986 427 articles.

Read more »

Waterfall charts in style of The Economist with R

May 7, 2012
By
Waterfall charts in style of The Economist with R

Waterfall charts are sometimes quite helpful to illustrate the various moving parts in financial data, in particular where I have positive and negative values like a profit and loss statement (P&L). However, they can be a bit of a pain to produce in Ex...

Read more »

Presentations of the first Milano R net meeting

May 7, 2012
By

Welcome presentation Andrea Spanò, Partner at Quantide (download PDF, 0.2 MB) Past, present and future of R Stefano Iacus, Department of Economics, Business and Statistics, University of Milan and R Development Core Team (download PDF, 7.3 MB) R and Operational … Continue reading →

Read more »

Portfolio diversity

May 7, 2012
By
Portfolio diversity

How many baskets are your eggs in? Meucci diversity Attilio Meucci directly addresses the adage: Don’t put all your eggs in one basket. His idea is to think of your portfolio as a set of  subportfolios that are each uncorrelated with the rest.  If your portfolio can be configured to have a lot of roughly … Continue reading...

Read more »

playitbyr 0.2-1: data through sound, now with layers, facets, and more pleasure

May 6, 2012
By

The recent Sonification Handbook has a chapter devoted to exploratory data analysis using sound. With some help from Sam Ferguson, one of the chapter's authors, I've made it easy to implement those techniques using R with playitbyr.

Read more »

clusterProfiler: an R Package for Comparing Biological Themes Among Gene Clusters

May 6, 2012
By

Increasing quantitative data generated from transcriptomics and proteomics require integrative strategies for analysis. Here, we present an R package, clusterProfiler that automates the process of biological-term classification and the enrichment analysis of gene clusters. The analysis module and visualization module were combined into a reusable workflow. Currently,...

Read more »

ninety two-minute r tutorial videos

May 6, 2012
By

ninety two-minute r tutorial videos now posted.  three hours of ultra-concentrated r.new videos include topics like:setting your preferences (options) (Rprofile.site) random number generation (runif) (rnorm) (set.seed) (rbinom)creating and accessi...

Read more »

Online resources for handling big data and parallel computing in R

May 6, 2012
By
Online resources for handling big data and parallel computing in R

by Yanchang Zhao, RDataMining.com Compared with many other programming languages, such as C/C++ and Java, R is less efficient and consumes much more memory. Fortunately, there are some packages that enables parallel computing in R and also packages for processing … Continue reading →

Read more »

Improving script_001: “Monitor”

May 6, 2012
By
Improving script_001: “Monitor”

After having a look to this video: http://www.screenr.com/UxH8 from rtwotutorials, and reading some tutorials, I decided to modified the script from the previous post: Practicing Script with “ R”: Monitor , in order to make it more r...

Read more »

I’ve Converted to R Full-Time

May 6, 2012
By

It's been over four years that I've been using both R and Stata, but as of last week I've become an R convert. For several years I had conducted statistical analyses in R (since many complex models can only be programmed in R), but I used Stata before ...

Read more »

reshape (from base) Explained: Part II

May 5, 2012
By
reshape (from base) Explained: Part II

Part II Explains More Complex  Wide to Long With base reshape  In part I of this base reshape tutorial we went over the basics of reshaping data with reshape.  We learned two rules that help us to be more efficient … Continue reading →

Read more »

RcppArmadillo 0.3.0.3

Two days ago, Conrad Sanderson released another bug-fix version 3.0.3 for the 3.0.0 branch of his excellent Armadillo C++ template library for linear algebra. The new RcppArmadillo release 0.3.0.3 which contains it appeared on CRAN yesterday. Beside ...

Read more »

Making an Animated Heart GIF in R

May 5, 2012
By

Messing around with animations, I created this toy example to make a growing outline of a heart in R. I have put it up as a gist on GitHub and embedded it below. It uses the animation and the ggplot2 packages--both of which make it incredibly simple to...

Read more »

What’s Up with Albert Pujols?

May 5, 2012
By
What’s Up with Albert Pujols?

After signing a huge deal with the Angels, Pujols has been having a really bad year. He hasn't hit a home run this year, breaking a career long streak. So I thought it would be a good idea to use some statistics to tell how good or bad we think Pujols will actually be this year.Coming into the year,...

Read more »

Visualizing tables in ggplot2

May 5, 2012
By
Visualizing tables in ggplot2

Recently I wanted to recreate  assocplot  using  ggplot2. In the end I propose a simple way to visualize data arranged two-way tables using geom_tile.I used Titanic data set as an example combining age and sex dimensi...

Read more »