# Monthly Archives: August 2013

## Applications of Interactivity to Finance

August 28, 2013
Of the nearly infinite ways of using crossfilter and dc.js in finance, the 2 that immediately came to my mind are signal analysis in system building and money manager analysis in due diligence.  My first very basic experiment explores a commonly k...

## Bayesian Estimation of Correlation – Now Robust!

August 28, 2013
So in the last post I showed how to run the Bayesian counterpart of Pearson’s correlation test by estimating the parameters of a bivariate normal distribution. A problem with assuming normality is that the normal distribution isn’t robust against outliers. Let’s see what happens if we take the data from the last post with the finishing times...

## Calculating the Highest Value to Bid on Each Player in an Auction Draft: The Bid Up To Value

August 27, 2013
In this post, I calculate the highest value you should bid on each player in an auction draft—what I refer to as the "bid up to" value.  In a previous post, I showed how to determine the best starting lineup to draft using an optimizer tool.  The "bid up to" value is calculated by finding the highest cost...

## An R function like “order” from Stata

August 27, 2013
A while ago, there was a question on Stackoverflow, Is there an equivalent R function to Stata ‘order’ command?. There isn’t really, and for the most part, you don’t really need one, but I decided that, for fun, I would write one anyway. Instead of operating directly on the data.frames, I decided to just work

## Visualizing the Forbes-CCAP University Rankings using ggplot2, rCharts, googleVis, and the shiny server

August 27, 2013
President Obama is pushing for higher education reform and the development of a rating system for Universities is a critical component of it. These ratings are likely to be based on several measures, such as graduation rates, earnings of graduates, and...

## Multicore (parallel) processing in R

August 27, 2013
Multicore (parallel) processing in R from Wallace Campbell on Vimeo. If you're not programming in parallel, you're only using a fraction of your computer's power! I demonstrate how to run "for" loops in parallel using the mclapply function from the multicore library. The code can be scaled to any number of available cores.

## In-Hadoop R-based Analytics coming to Cloudera

August 27, 2013
Revolution Analytics has teamed up with Cloudera to bring the scalable data manipulation and statistical modeling algorithms of Revolution R Enteprise to the massively-parallel computing environments of CDH3 and CDH4 Hadoop clusters. As ZDNet reports: Specifically, the upcoming version 7.0 of the Revolution R Enterpise distribution and its ScaleR algorithms will run inside CDH3 and CDH4, eliminating the need...

August 27, 2013
## Presenting Conformance Statistics

August 27, 2013
A client came to me with some conformance data. She was having a hard time making sense of it in a spreadsheet. I had a look at a couple of ways of presenting it that would bring out the important points. The Data The data came as a spreadsheet with multiple sheets. Each of the

## Fantastic presentations from R using slidify and rCharts

August 27, 2013
Dr. Ramnath Vaidyanathan of McGill University gave an excellent presentation at a joint Data Visualization DC/Statistical Programming DC event on Monday, August 19 at nclud, on two R projects he leads — slidify and rCharts.