The Fun of Error Trapping: R Package Edition

February 25, 2013
By

For the last month or so I’ve been working on an R package to make accessing the Adobe (Omniture) Digital Marketing Suite Reporting API easier.  As part of this development effort, I’m at the point where I’m intentionally introducing errors into my function inputs, trying to guess some of the ways useRs might incorrectly input The Fun of...

Read more »

Plot 3D Wireframes from Morphologika

February 25, 2013
By

Hi morphometricians,A quick update: a function to plot Morphologika 3D wireframes using the package geomorph. This routine (or a version of it) will be included in our next package update under the "read.morphologika" function.Enjoy!Example:library(geo...

Read more »

Exporting nice plots in R

February 25, 2013
By
Exporting nice plots in R

It's not always easy getting the right size. The image is CC by Kristina Gill. A vital part of statistics is producing nice plots, an area where R is outstanding. The graphical ablility of R...

Read more »

Simplify frequency plots with ggplot in R #rstats

February 25, 2013
By
Simplify frequency plots with ggplot in R #rstats

Update March 5th All downloads are now accessible from my script page! Summary This posting shows how to plot frequency plots using the ggplot-package in R. Compared to SPSS standard outputs, you will learn how to create appealing diagrams ready … Weiterlesen →

Read more »

Top 6 reasons you need to be using RStudio

February 25, 2013
By
Top 6 reasons you need to be using RStudio

Rstudio provides you with tools to make your work more productive.New to R? -  check out these resources for getting staRted. RStudio is an awesome tool that can help you do your work better and faster.  In technical terms...

Read more »

Revolution Newsletter: February 2013

February 25, 2013
By

The most recent edition of the Revolution Newsletter is out. The news section is below, and you can read the full February edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. Case study: Real-Time Marketing Analytics. Online advertising technology company Exelate uses predictive models to...

Read more »

Course: Statistical Practice in Epidemiology with R

February 25, 2013
By
Course: Statistical Practice in Epidemiology with R

Statistical Practice in Epidemiology with R is a one week course aimed at promoting the use of R among epidemiologists. This annual course has been running almost every year since 2004. This year, the course will take place from 23 … Continue reading →

Read more »

Reproducible research with R, Knitr, Pandoc and Word

February 25, 2013
By

Add references and a style sheet Below I briefly outline why Pandoc is an essential part of my research workflow, and demonstrate how to seamlessly integrate it with a bibliographic system and code written in R to produce high quality word or pdf documents. I also...

Read more »

Portfolio tests of predicted returns

February 25, 2013
By
Portfolio tests of predicted returns

Exploring the quality of predictions using random portfolios and optimization. Previously “Simple tests of predicted returns” showed a few ways to look at expected returns at the asset level.  Here we move to the portfolio level. The previous post focused on correlation.  Win Vector Blog points out that gauging prediction quality using correlation can be … Continue reading...

Read more »

Optimizing a multi-modal function with a two step anneal method.

February 24, 2013
By
Optimizing a multi-modal function with a two step anneal method.

I have been working on a reliable optimization method for this crazy function. f.egg<-function(x,y){ (2+cos(x)+cos(y))/(100+x^2+y^2) } I noticed that if I had a large variance in the random normal generator, the optimizer would jump all over t...

Read more »

Optimizing a multi-modal function with a two step anneal method.

February 24, 2013
By
Optimizing a multi-modal function with a two step anneal method.

I have been working on a reliable optimization method for this crazy function. f.egg<-function(x,y){ (2+cos(x)+cos(y))/(100+x^2+y^2) } I noticed that if I had a large variance in the random normal generator, the optimizer would jump all over t...

Read more »

MLB runs allowed by team

February 24, 2013
By

Or, How good were the Maddux/Glavine-era Braves?In this on-going series of posts about run scoring in Major League Baseball, for this installment I'll turn the equation around and look at runs allowed.  In order to account for the changing run scoring environments, the runs allowed by individual teams is compared to the league average for that season,...

Read more »

R Bootcamp Materials!

February 24, 2013
By
R Bootcamp Materials!

Learn about ColoRs in R!Analyze model results with custom functions.Good and Bad GraphicsTo train new employees at the Wisconsin Department of Public Instruction, I have developed a 2-3 day series of training modules on how to get work done in R. These...

Read more »

Rcpp class in Sat 9 Mar in NYC

February 24, 2013
By

Join Dirk Eddelbuettel for six hours of detailed and hands-on instructions and discussions around Rcpp, RInside, RcppArmadillo, RcppGSL and other packages . . . Rcpp has become the most widely-used language extension for R. Currently deployed by 103 CRAN packages and a further 10 BioConductor packages, it permits users and developers to pass “whole R The post Rcpp...

Read more »

Ten Things the Emacs Social Science Starter Kit gives you

February 24, 2013
By

I recently made some updates to the Emacs Social Science Starter Kit. I maintain the SSSK for my own convenience, but other people have found it useful as well. By now there are a lot of little bits and pieces in the kit, so I thought it might be usefu...

Read more »

Ten Things the Emacs Social Science Starter Kit gives you

February 24, 2013
By

I recently made some updates to the Emacs Social Science Starter Kit. I maintain the SSSK for my own convenience, but other people have found it useful as well. By now there are a lot of little bits and pieces in the kit, so I thought it might be usefu...

Read more »

the BUGS Book [guest post]

February 24, 2013
By
the BUGS Book [guest post]

(My colleague Jean-Louis Fouley, now at I3M, Montpellier, kindly agreed to write a review on the BUGS book for CHANCE. Here is the review, en avant-première! Watch out, it is fairly long and exhaustive! References will be available in the published version. The additions of book covers with BUGS in the title and of the corresponding

Read more »

Dynamic community occupancy modeling with R and JAGS

February 24, 2013
By
Dynamic community occupancy modeling with R and JAGS

This post is intended to provide a simple example of how to construct and make inferences on a multi-species multi-year occupancy model using R, JAGS, and the ‘rjags’ package. This is not intended to be a standalone tutorial on dynamic community occupancy modeling. Useful primary literature references include MacKenzie et al. (2002), Kery and Royle (2007), Royle and Kery...

Read more »

Workflow w/ reports package

February 24, 2013
By
Workflow w/ reports package

NOTE: THIS IS NOW A PACKAGE SEE THIS LINK FOR DETAILS Let me start with a video for people who just want to see what I’m demo-ing first: I’ve been interested in speeding up workflow lately and spending a lot … Continue reading →

Read more »

Simple Count Probability

February 24, 2013
By
Simple Count Probability

Data can take the form of counts: Compliments or complaints received Items returned Number of E. coli cases Data can also be expressed in rates: Percent of web traffic from a user permissions type Percent of businesses in a region passing a safety audit A random variable X has the Poisson distribution with parameter lambda

Read more »

Major update to the R-package geomorph

February 24, 2013
By

Hi Folks,We have just completed a major update to the R-package geomorph:software for geometric morphometric analyses in R.  Included are several new functions to  carry out additional GM analyses, as well as enhancements of existing function...

Read more »

Copying Data from Excel to R and Back

February 24, 2013
By
Copying Data from Excel to R and Back

A lot of times we are given a data set in Excel format and we want to run a quick analysis using R's functionality to look at advanced statistics or make better visualizations. There are packages for importing/exporting data from/to Excel, but I have f...

Read more »

Large correlation in parallel

February 24, 2013
By
Large correlation in parallel

A little improvement to the bigcor function proposed on Rmazing to compute huge correlation matrix in R, I made the function work in parallel using all the CPU cores available on the machine. The code is here.Here is a benchmark of the 2 func...

Read more »

The Wisdom of Crowds – Clustering Using Evidence Accumulation Clustering (EAC)

February 24, 2013
By
The Wisdom of Crowds – Clustering Using Evidence Accumulation Clustering (EAC)

Today’s blog post is about a problem known by most of the people using cluster algorithms on datasets without given true labels (unsupervised learning). The challenge here is the “freedom of choice” over a broad range of different cluster algorithms and how to determine the right parameter values. The difficulty is the following: Every clustering algorithm and even...

Read more »

Earthquakes in Netherlands

February 24, 2013
By
Earthquakes in Netherlands

In the Netherlands we have Natural Gas. Unfortunately winning this gas seems to cause some quakes. As quakes go, they are not strong. However, our buildings are not made to resist quakes, before 1986 they were unheard of, so there is some damage. It is now predicted they could get stronger and more frequent. This caused a bit of a...

Read more »

Simplify your R workflow with functions #rstats

February 24, 2013
By
Simplify your R workflow with functions #rstats

Update/ Thanks to Bernd I could improve the function of how to import the data, so here’s the updated script! /Update In R, you often may have scripts or code snippets that will be reused. In such cases, you can … Weiterlesen →

Read more »

Multi-species dynamic occupancy model with R and JAGS

February 24, 2013
By
Multi-species dynamic occupancy model with R and JAGS

This post is intended to provide a simple example of how to construct and make inferences on a multi-species multi-year occupancy model using R, JAGS, and the ‘rjags’ package. This is not intended to be a standalone tutorial on dynamic community occupancy modeling. Useful primary literature references include MacKenzie et al. (2002), Kery and Royle (2007), Royle and Kery...

Read more »

Copying Data from Excel to R and Back

February 23, 2013
By
Copying Data from Excel to R and Back

A lot of times we are given a data set in Excel format and we want to run a quick analysis using R's functionality to look at advanced statistics or make better visualizations. There are packages for importing/exporting data from/to Excel, but I have found them to be hard to work with or only work with old versions of...

Read more »

Pareto plot with ggplot2

February 23, 2013
By

A Pareto chart, named after Vilfredo Pareto, is a type of chart that contains both bars and a line graph, where individual values are represented in descending order by bars, and the cumulative total is represented by the line (quoted from Wikipedia). ...

Read more »

Sponsors