Monthly Archives: July 2013

GGPlot2 #1: Employee Job Satisfaction at Top Tech Companies

July 13, 2013
By
GGPlot2 #1: Employee Job Satisfaction at Top Tech Companies

This is the first in a series of ongoing posts where I’ll take data on various topics and create simple visualizations of that data using the ggplot2 package in R. While my day job involves analyzing data, I rarely work on projects where I’m expected to produce “publication-worthy” graphics. Therefore, these posts are a way for

Read more »

Faster Multivariate Normal densities with RcppArmadillo and OpenMP

July 13, 2013
By
Faster Multivariate Normal densities with RcppArmadillo and OpenMP

The Multivariate Normal density function is used frequently in a number of problems. Especially for MCMC problems, fast evaluation is important. Multivariate Normal Likelihoods, Priors and mixtures of Multivariate Normals require numerous evaluations, thus speed of computation is vital. We show a twofold increase in speed by using RcppArmadillo, and some extra gain by using OpenMP. This project is based...

Read more »

Reflections on UseR! 2013

July 12, 2013
By

This week I’ve been at the R Users conference in Albacete, Spain. These conferences are a little unusual in that they are not really about research, unlike most conferences I attend. They provide a place for people to discuss and exchange ideas on how R can be used. Here are some thoughts and highlights of the conference, in no...

Read more »

Throw some, throw some STATS on that map…(Part 1)

July 12, 2013
By
Throw some, throw some STATS on that map…(Part 1)

R is a very powerful and free (and fun) software package that allows you to do, pretty much anything you could ever want. Someone told me that there’s even code that allows you to order pizza (spoiler alert: you actually cannot order pizza using R :( ). But if you’re not hungry, the statistical capabilities

Read more »

From Whale Calls to Dark Matter: Competitive Data Science with R and Python

July 12, 2013
By
From Whale Calls to Dark Matter: Competitive Data Science with R and Python

Back in June I gave a fun talk at Montreal Python on some of my dabbling in the competitive data science scene. The good people at Savior-fair Linux recorded the talk and have edited it all together into a pretty slick video. If you can spare twenty-minutes or so, have a look. If you want

Read more »

UseR! 2013: it’s a wrap!

July 12, 2013
By
UseR! 2013: it’s a wrap!

Steve Scott (Google) presents at useR! 2013, July 12 2013 The 2013 UseR! conference has drawn to a close in Albacete, Spain. The conference organizers did a fantastic job putting together a jam-packed presentation and social program for the 350+ R users in attendance. Here are just a few of my highlights from the last couple of days: Duncan...

Read more »

Calculate RMSE and MAE in R and SAS

July 12, 2013
By
Calculate RMSE and MAE in R and SAS

Here is code to calculate RMSE and MAE in R and SAS. RMSE (root mean squared error), also called RMSD (root mean squared deviation), and MAE (mean absolute error) are both used to evaluate models. MAE gives equal weight to all errors, while RMSE gives extra weight to large errors. Continue reading →

Read more »

Course Materials from useR! 2013 R/Bioconductor for Analyzing High-Throughput Genomic Data

July 12, 2013
By

At last week's 2013 useR! conference in Albacete, Spain, Martin Morgan and Marc Carlson led a course on using R/Bioconductor for analyzing next-gen sequencing data, covering alignment, RNA-seq, ChIP-seq, and sequence annotation using R. The course mate...

Read more »

An Introduction to Collaborative Filtering

July 12, 2013
By
An Introduction to Collaborative Filtering

A typical consumer today uses multiple devices to surf the web and interact in many ways with your eCommerce business. For most stores, maximizing conversion and increasing order size in this environment is not only an enormous challenge, but also an incredible opportunity. eCommerce stores also have a variety of marketing channels be it e-mail

Read more »

Popularity bigdata / large data packages in R and ffbase useR presentation

Popularity bigdata / large data packages in R and ffbase useR presentation

(This article was first published on BNOSAC - Belgium Network of Open Source Analytical Consultants, and kindly contributed to R-bloggers) A few weeks ago, Rstudio released it's download logs, showing who downloaded R packages through their CRAN mirror. More info: http://blog.rstudio.org/2013/06/10/rstudio-cran-mirror/ This is very nice information and it can be used to show the popularity of packages with R, which has...

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)