R is His Friend

May 9, 2013
By

Marcus Beck has a nice (& relatively new) blog called R is My Friend. You can guess that his posts relate to the use of R.I particularly liked his piece on the use of the XML package in R to mine data from the internet; and his post on using the integrate...

Read more »

Trevor Hastie presents glmnet: lasso and elastic-net regularization in R

May 9, 2013
By
Trevor Hastie presents glmnet: lasso and elastic-net regularization in R

by Joseph Rickert Even a casual glance at the R Community Calendar shows an impressive amount of R user group activity throughout the world: 45 events in April and 31 scheduled so far for May. New groups formed last month in Knoxville, Tennessee (The Knoxville R User Group: KRUG) and Sheffield in the UK (The Sheffield R Users). An...

Read more »

Omni test for statistical significance

May 9, 2013
By
Omni test for statistical significance

In survey research, our datasets nearly always comprise variables with mixed measurement levels – in particular, nominal, ordinal and continuous, or in R-speak, unordered factors, ordered factors and numeric variables. Sometimes it is useful to be able to do blanket tests of one set of variables (possibly of mixed level) against another without having to

Read more »

Reading an R file from GitHub

Reading an R file from GitHub

Lets say that I want to read in this R file from GitHub into R. The first thing you have to do is locate the raw file. You can do so by clicking on the Raw button in GitHub. In this case it’s https://raw.github.com/lcolladotor/ballgownR-devel/master/ballgownR/R/infoGene.R One would think that using source() would work, but it doesn’t as shown below: source("https://raw.github.com/lcolladotor/ballgownR-devel/master/ballgownR/R/infoGene.R") ##...

Read more »

Version 0.9 of timeline on CRAN

May 9, 2013
By
Version 0.9 of timeline on CRAN

The initial version of the timeline package has been released to CRAN. This package provides creates timeline plots using ggplot2 in a style similar to Preceden. I would considered this beta quality as there are more features I would like to add but has enough functionality to possibly be useful to others. install.packages('timeline',repos='http://cran.r-project.org') require(timeline) data(ww2) timeline(ww2, ww2.events, event.spots=2, event.label='', event.above=FALSE) Timeline...</p><p><a href=Read more »

A Shiny web app to find out how much medical procedures cost in your state.

May 8, 2013
By
A Shiny web app to find out how much medical procedures cost in your state.

Today the front page of the Huffington Post featured the new data available from the CMS that shows the cost of many popular procedures broken down by hospital. We here at Simply Statistics think you should be able to explore … Continue reading →

Read more »

What’s new in Revolution R Enterprise 6.2 (video)

May 8, 2013
By

If you missed last week's webinar, product manager Thomas Dinsmore shared details of the new features in Revolution R Enterprise 6.2 in the video below: You can also download slides of the presentation at the link below. Revolution Analytics webinars: What's New in Revolution R Enterprise 6.2

Read more »

Computed columns for dataframes

May 8, 2013
By
Computed columns for dataframes

Everyone loves to aggregate data. Everyone loves to create new columns based on other columns. Everyone hates to do the same thing twice. In my continuing work on multilevel view of loss reserving, I reached a point where I realized that I needed a robust mechanism to aggregate computed columns. SQL server and (I’m assuming)

Read more »

Feature Selection 2 – Genetic Boogaloo

May 8, 2013
By
Feature Selection 2 – Genetic Boogaloo

Previously, I talked about genetic algorithms (GA) for feature selection and illustrated the algorithm using a modified version of the GA R package and simulated data. The data were simulated with 200 non-informative predictors and 12 linear effects and three non-linear effects. Quadratic discriminant analysis (QDA) was used to model the data. The last set of...

Read more »

3D Mapping in R

May 8, 2013
By
3D Mapping in R

This tutorial has been kindly contributed by Robin Edwa

Read more »

SAS, SPSS, Stata Users: Learn R from Home June 17

May 8, 2013
By
SAS, SPSS, Stata Users: Learn R from Home June 17

Has learning R been driving you a bit crazy? If so, it may be that you’re “lost in translation.” On June 17 and 19, I’ll be teaching a webinar, R for SAS, SPSS and Stata Users. With each R concept, … Continue reading →

Read more »

Building a custom database of country time-series data using Quandl

May 8, 2013
By
Building a custom database of country time-series data using Quandl

Encouraged by this post I had another look at quandl for collecting datasets from different agencies. Right now I need to get data for four countries on a couple of dozen indicators. This graphic is just a quick example with only two indicators of what I am aiming to be able to do. The process

Read more »

An accept-reject sampler using RcppArmadillo::sample()

May 8, 2013
By
An accept-reject sampler using RcppArmadillo::sample()

The recently added RcppArmadillo::sample() functionality provides the same algorithm used in R’s sample() to Rcpp-level code. Because R’s own sample() is written in C with minimal work done in R, writing a wrapper around RcppArmadillo::sample() to then call in R won’t get you much of a performance boost. However, if you need to repeatedly call sample(), then calling a...

Read more »

An accept-reject sampler using RcppArmadillo::sample()

May 8, 2013
By
An accept-reject sampler using RcppArmadillo::sample()

The recently added RcppArmadillo::sample() functionality provides the same algorithm used in R’s sample() to Rcpp-level code. Because R’s own sample() is written in C with minimal work done in R, writing a wrapper around RcppArmadillo::sample() to then call in R won’t get you much of a performance boost. However, if you need to repeatedly call sample(), then calling a...

Read more »

Gambler’s Run With Shiny

May 8, 2013
By
Gambler’s Run With Shiny

I finally had an opportunity to play with Shiny, and I am very impressed. I have created a Github Project so head over there for the source code. There are a number of ways to distribute Shiny apps. If you are running R (and mostly likely you are if you are reading this), you can download and...

Read more »

heatmaps with p-values (2)… coloured according to odds ratio

May 7, 2013
By
heatmaps with p-values (2)… coloured according to odds ratio

I like heatplots with p-values -or frequencies, or whatever-. Not very conclusive, but pretty anyway. And when talking about graphs, pretty will make our neurons to fire in more interesting ways: neurons like “pretty” graphs. Moreover, observing your data can … Sigue leyendo →

Read more »

CAISN

May 7, 2013
By
CAISN

Reblogged from Zero to R Hero: Canadian Aquatic Invasive Species Networks Annual General Meeting in Kananaskis, Alberta. May 03, 3:25-5:30. This 2-hour workshop will focus on how and why we do numerical simulation in R. Time permitting, we will also look at how to build and fit likelihood based statistical models. We ask that you bring your

Read more »

New geomorph function to digitize multiple 2d images

New geomorph function to digitize multiple 2d images

Hi Morphometricians! We've enhanced geomorph's ability to continuously digitize multiple specimens' images in 2d, if these are within the same directory. This new function allows one to digitize 2d images without interruption. Thanks to Samuel Brown and Karl Fetter for suggesting the improvement. We will incorporate this function in our next package update. I'm including demonstration code...

Read more »

Poisson regression on non-integers

May 7, 2013
By
Poisson regression on non-integers

In the course on claims reserving techniques, I did mention the use of Poisson regression, even if incremental payments were not integers. For instance, we did consider incremental triangles > source("http://perso.univ-rennes1.fr/arthur.charpentier/bases.R") > INC=PAID > INC=PAID-PAID > INC 3209 1163 39 17 7 21 3367 1292 37 24 10 NA 3871...

Read more »

R in Insurance: Programme and Abstracts published

May 7, 2013
By
R in Insurance: Programme and Abstracts published

I am delighted to announce that the programme and abstracts for the first R in Insurance conference at Cass Business School in London, 15 July 2013, have been published. The conference committee received strong abstracts from academia and the industry,...

Read more »

SAS Big Data Analytics Benchmark (Part Two)

May 7, 2013
By

by Thomas Dinsmore On April 26, SAS published on its website an undated Technical Paper entitled Big Data Analytics: Benchmarking SAS, R and Mahout. In the paper, the authors (Allison J. Ames, Ralph Abbey and Wayne Thompson) describe a recent project to compare model quality, product completeness and ease of use for two SAS products together with open source...

Read more »

Eigen-analysis of Linear Model Behavior in R

May 7, 2013
By
Eigen-analysis of Linear Model Behavior in R

This post is actually about replicating the figures in Otto and Day: A Biologist’s Guide to Mathematical Modeling in Ecology and Evolution. The figures I’m interested in for this post are Figures 9.1 and 9.2 in the chapter ‘General Solutions … Continue reading →

Read more »

DataMind & The R Service Bus @ RBelgium

DataMind & The R Service Bus @ RBelgium

  Within 2 weeks on Friday, May 24, The RBelgium R user group is holding its next Regular meeting in Leuven for which this is the schedule: ** Jonathan Cornelissen - DataMind  Discover DataMind, a new online learning platform for d...

Read more »

Subsetting data

May 6, 2013
By
Subsetting data

At School we use R across many courses, because students are supposed to use statistics under a variety of contexts. Imagine their disappointment when they pass stats and discovered that R and statistics haven’t gone away! When students start working with real data sets one of their first stumbling blocks is subsetting data. We have

Read more »

Passing columns of a dataframe to a function without quotes

May 6, 2013
By
Passing columns of a dataframe to a function without quotes

I love the syntax of calls to lm and ggplot, wherein the dataframe is specified as a variable and specific columns are referenced as though they were separate variables. While developing some of my functions, I’d wanted to introduce something similar. I often find that I have a single large dataframe and want to execute

Read more »

xkcd: Visualized

May 6, 2013
By
xkcd: Visualized

IntroductionIt's been said that the ideal job is one you love enough to do for free but are good enough at that people will pay you for it. That if you do what you love no matter what others may say, and if you work at it hard enough, and long enough, eventually people will recognize it and...

Read more »

Explaining real-time predictive analytics with big data (video)

May 6, 2013
By

In my presentation to the Strata Santa Clara 2013 conference earlier this year, my goal was to give a succinct (under 20 minutes!) explanation of three terms that are two often used as mere buzzwords: predictive analytics, real time, and big data. You can download the slides for my presentation, Real-time Big Data Analytics: From Deployment to Production, from...

Read more »

Veterinary Epidemiologic Research: Count and Rate Data – Zero Counts

May 6, 2013
By
Veterinary Epidemiologic Research: Count and Rate Data – Zero Counts

Continuing on the examples from the book Veterinary Epidemiologic Research, we look today at modelling count when the count of zeros may be higher or lower than expected from a Poisson or negative binomial distribution. When there’s an excess of zero counts, you can fit either a zero-inflated model or a hurdle model. If zero

Read more »

When the “reorder” function just isn’t good enough…

May 6, 2013
By
When the “reorder” function just isn’t good enough…

The reorder function, in R 3.0.0, is behaving strangely (or I’m really not understanding something).  Take the following simple data frame: df = data.frame(a1 = c(4,1,1,3,2,4,2), a2 = c(“h”,”j”,”j”,”e”,”c”,”h”,”c”)) I expect that if I call the reorder function on the … Continue reading →

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de









ODSC

CRC R books series











Contact us if you wish to help support R-bloggers, and place your banner here.