How Much Can We Learn from Top Rankings using Nonnegative Matrix Factorization?

July 10, 2014
By
How Much Can We Learn from Top Rankings using Nonnegative Matrix Factorization?

Purchases are choices from available alternatives. Post-purchase, we know what is the most preferred, but all the other options score the same. Regardless of differences in appeal, all the remaining items received the same score of not chosen. A second...

Read more »

Syrian Refugee Density in Lebanon

July 10, 2014
By
Syrian Refugee Density in Lebanon

I've done a few posts on Syria and have used data provided by the UNHCR for different analysis or visualization.  There are several links on their Syrian refugee data portal that communicate the breadth of this crisis numerically and vis...

Read more »

Tips & Tricks 4: Reading In Data Files

July 10, 2014
By

Today's exercise is another nice and simple one, and allows you to get used to manipulating datasets in R.Exercise 4 - How to read a file of coordinate data into R and make sure it is numeric.Reading your data files into R  for anal...

Read more »

Reflections on John Chambers’ UserR! 2014 Keynote Address

July 10, 2014
By
Reflections on John Chambers’ UserR! 2014 Keynote Address

by Joseph Rickert John Chambers opened UseR! 2014 by describing how the R language grew out of early efforts to give statisticians easier access to high quality statistical software. In 1976 computational statistics was a very active field, but most algorithms were compiled as Fortran subroutines. Building models with this software was not a trivial process. First you had...

Read more »

Including GitHub and Bioconductor on Rdocumentation: Technical Details

July 10, 2014
By
Including GitHub and Bioconductor on Rdocumentation: Technical Details

In our last blog post we announced the addition of GitHub and  Bioconductor R packages to Rdocumentation. For the more technical amongst you, I’ll give a short, high-level description of what’s under the hood at Rdocumentation. Along with that I’ll zoom in on some of the challenges encountered while adding GitHub and Bioconductor repositories. Rdocumentation

Read more »

In case you missed it: June 2014 Roundup

July 9, 2014
By

In case you missed them, here are some articles from June of particular interest to R users: The useR! 2014 conference in Los Angeles opened with 16 tutorials. DataInformed published an article by David Smith on how various companies use R. Joe Rickert reviews the new book "Applied Predictive Modeling" by Max Kuhn and Kjell Johnson, which is rich...

Read more »

FRAMA Part IV: Continuing the Long/Short Filter Search

July 9, 2014
By
FRAMA Part IV: Continuing the Long/Short Filter Search

This post examines an n-day median filter for two desirable properties: robustness to outliers and an inherent trend-confirming lag. While … Continue reading →

Read more »

Buster – a new R package for bagging hierarchical clustering

July 9, 2014
By
Buster – a new R package for bagging hierarchical clustering

I recently found myself a bit stuck. I needed to cluster some data. The distances between the data points were not representable in Euclidean space so I had to use hierarchical clustering. But then I wanted stable clusters that would retain their shape as I updated the data set with new observations. This I could … Continue reading...

Read more »

Recording of OpenCPU talk at #useR2014

July 9, 2014
By
Recording of OpenCPU talk at #useR2014

A recording of the useR! 2014 prentation about OpenCPU is now available on Youtube. This talk gives a brief (20 minute) motivation and introduction to some of the high level concepts of the OpenCPU system. The video contains mostly screen recor...

Read more »

Can Rcpp fuse ?

July 8, 2014
By
Can Rcpp fuse ?

One of the features of Rcpp11 people seemed to like during useR is the fuse function. fuse is somewhat similar to the c function in R. For the purpose of this post, let's simplify what fuse does, and just say that it takes several compatible vector...

Read more »

Are Consumer Preferences Deep or Shallow?

July 8, 2014
By
Are Consumer Preferences Deep or Shallow?

John Hauser, because no one questions his expertise, is an excellent spokesperson for the viewpoint that consumer preferences are real, as presented in his article "Self-Reflection and Articulated Consumer Preferences." Simply stated, preferences are e...

Read more »

Speed Tests for Rolling/Running Functions

July 8, 2014
By

I use rolling and running functions almost daily with financial time series. In my post A Whole New World with Chains and Pipes, I made this statement I have noticed that rolling analysis with xts can sometimes be slow. as.matrix is my favorite way to ...

Read more »

R Day at Strata NYC

July 8, 2014
By
R Day at Strata NYC

RStudio will teach the new essentials for doing data science in R at this year’s Strata NYC conference, Oct 15 2014. R Day at Strata is a full day of tutorials that will cover some of the most useful topics in R. You’ll learn how to manipulate and visualize data with R, as well as

Read more »

DSC 2014, Day 2

July 8, 2014
By
DSC 2014, Day 2

This will be somewhat shorter summary of the second day of DSC 2014. There was a wider variety of presentations today, but I am only going to discuss those that touched on the main theme of the day, which was … Continue reading →

Read more »

Dependencies of popular R packages

July 8, 2014
By
Dependencies of popular R packages

With the growing popularity of R, there is an associated increase in the popularity of online forums to ask questions. One of the most popular sites is StackOverflow, where more than 60 thousand questions have been asked and tagged to be related to R. On the same page, you can also find related tags. Among the top 15 tags...

Read more »

meteoForecast, a package to obtain NWP-WRF forecasts in R

meteoForecast, a package to obtain NWP-WRF forecasts in R

The Weather Research and Forecasting (WRF) Model is a numerical weather prediction (NWP) system. NWP refers to the simulation and …Sigue leyendo →

Read more »

googleVis 0.5.3 released

July 8, 2014
By
googleVis 0.5.3 released

Recently we released googleVis 0.5.3 on CRAN. The package provides an interface between R and Google Charts, allowing you to create interactive web charts from R.Screen shot of some of the Google ChartsAlthough this is mainly a maintenance release, I'd...

Read more »

analyze the demographic and health surveys (dhs) with r

July 8, 2014
By

professors of public health 101 probably cite the results of the demographic and health surveys (dhs) more than all other data sources combined.  funded by the united states agency for international development (usaid) and administered by the tech...

Read more »

Reflections on useR! 2014

July 7, 2014
By
Reflections on useR! 2014

UseR! 2014, the R user conference held last week in LA, was the most successful yet. Around 700 R users from around the world converged on the UCLA campus to share their experiences with the R language and to socialize with other data scientists, statisticians and others using R. The week began with a series of 3-hour tutorials on...

Read more »

Chillin’ at UseR! 2014

July 7, 2014
By
Chillin’ at UseR! 2014

This year’s UseR! conference was held at the University of California in Los Angeles. Despite the great weather and a nearby beach, most of the conference was spent in front of projector screens in 18° c (64° f) rooms because there were so many interesting presentations and tutorials going on. I was lucky to present my R package...

Read more »

Sometimes Table is not the Answer – a Faster 2×2 Table

July 7, 2014
By
Sometimes Table is not the Answer – a Faster 2×2 Table

The table command is great in its simplicity for cross tabulations. I have run into some settings where it is slow and I wanted to demonstrate one simple example here of why you may want to use other functions or write your own tabler. This example is a specific case where, for some examples and

Read more »

What are the names of the school principals in Mexico?, If your name is Maria, probably this post will interest you. Trends and cool plots from the national education census of Mexico in 2013

What are the names of the school principals in Mexico?, If your name is Maria, probably this post will interest you. Trends and cool plots from the national education census of Mexico in 2013

I will start this post with a disclaimer:The main intention of the post is to show how is the distribution of the school principal names in Mexico, for example, to show basic trends regarding about what is the most common nation-wide first name and so ...

Read more »

DSC 2014. Day 1

July 7, 2014
By
DSC 2014. Day 1

This is a report of the first day of the Directions in Statistical Computing (DSC) conference that took place in Brixen, Italy (See here for an introduction). Performance enhancements were the main theme of the day, covering not just improvements … Continue reading →

Read more »

3 ways that functions can improve your R code

July 7, 2014
By

Read more »

How to pick up 3 numbers from a uniform distribution in a transparent manner?

July 7, 2014
By
How to pick up 3 numbers from a uniform distribution in a transparent manner?

Over in my previous post, I’m giving away 3 copies of my video course on ggplot2 and shiny. To win a copy, you just need to leave a comment and I will select 3 winners among the n participants at … Continue reading →

Read more »

Introduction to R for Life Scientists: Course Materials

July 7, 2014
By
Introduction to R for Life Scientists: Course Materials

Last week I taught a three-hour introduction to R workshop for life scientists at UVA's Health Sciences Library.I broke the workshop into three sections:In the first half hour or so I presented slides giving an overview of R and why R is so awesome. Du...

Read more »

Four Simple Turtle Graphs To Play With Kids

July 7, 2014
By
Four Simple Turtle Graphs To Play With Kids

Technology is just a tool: in terms of getting the kids working together and motivating them, the teacher is the most important (Bill Gates) Some days ago I read this article in R-bloggers and I discovered the TurtleGraphics package. I knew about turtle graphics long time ago and I was thinking of writing a post

Read more »

Identify Fantasy Football Sleepers with this Shiny App

July 6, 2014
By

This post describes a Shiny app that identifies fantasy football sleepers.  The app allows you to modify your league settings, and calculates robust averages of projections across numerous sources.  Best of all, The post Identify Fantasy Football Sleepers with this Shiny App appeared first on Fantasy Football Analytics.

Read more »

Competitive balance and home court advantage in the NBA

July 6, 2014
By
Competitive balance and home court advantage in the NBA

Two years ago, the entire NBA season went into lockout because of mostly financial reasons. However, one central point was also about keeping a competitive balance within the NBA, so that large and small-market teams alike would have a chance to compete for a championship. THis brings us to the obvious question “Is there competitive

Read more »