Blog Archives

Quantitative Finance Applications in R – 4: Using the Generalized Lambda Distribution to Simulate Market Returns

February 25, 2014
By
Quantitative Finance Applications in R – 4:  Using the Generalized Lambda Distribution to Simulate Market Returns

by Daniel Hanson, QA Data Scientist, Revolution Analytics Introduction As most readers are well aware, market return data tends to have heavier tails than that which can be captured by a normal distribution; furthermore, skewness will not be captured either. For this reason, a four parameter distribution such as the Generalized Lambda Distribution (GLD) can give us a more...

Read more »

Sampling from a torus

February 19, 2014
By
Sampling from a torus

by Joseph Rickert One of the key ideas in topological data analysis is to consider a data set to be a sample from a manifold in some high dimensional topological space and then to use the tools of algebraic topology to reconstruct the manifold. It turns out that the converse problem of taking a random sample from a given...

Read more »

Princeton vs. Facebook: modeling contagion

February 18, 2014
By
Princeton vs. Facebook: modeling contagion

by James Paul Peruvankal, Senior Program Manager at Revolution Analytics Three weeks ago, researchers at Princeton released a study on Epidemiological modeling of online social network dynamics that states Facebook might lose 80% of its users by 2015-2017. Facebook data scientists hilariously debunked the study stating that Princeton itself would lose all of its students by 2021, using the...

Read more »

3D Plots in R

February 13, 2014
By
3D Plots in R

by Joseph Rickert Recently, I was trying to remember how to make a 3D scatter plot in R when it occurred to me that the documentation on how to do this is scattered all over the place. Hence, this short organizational note that you may find useful. First of all, for the benefit of newcomers, I should mention that...

Read more »

Revolution R Enterprise in the Amazon Cloud

February 12, 2014
By

by Oliver Vagner, Cloud Solutions Lead Architect at Revolution Analytics Today, I am pleased to announce our new offering in the Amazon Web Services Big Data Marketplace – Revolution R Enterprise 7 for AWS. Of course, if you follow this blog, then you are quite familiar with Revolution R Enterprise (RRE) and what it brings to the table with...

Read more »

R and the Weather

February 6, 2014
By
R and the Weather

by Joseph Rickert The weather is on everybody's mind these days: too much ice and snow east of the Rockies and no rain to speak fo in California. Ram Narasimhan has made it a little easier for R users to keep track of what's going on and also get a historical perspective. His new R package weatherData makes it...

Read more »

Revolution Analytics announces $999 site licenses for universities and public service organizations

February 4, 2014
By

by Joseph Rickert Revolution Analytics is announcing three new programs today that we hope will be modest but positive contributions to data science education and public service analytics. The first new program, the Academic Institution Program (AIP) enables colleges, universities and other educational institutions to obtain a site license for Revolution Analytics' commercial distribution of the R Language, Revolution...

Read more »

A First Look at rxDForest()

January 30, 2014
By
A First Look at rxDForest()

by Joseph RIckert Last July, I blogged about rxDTree() the RevoScaleR function for building classification and regression trees on very large data sets. As I explaned then, this function is an implementation of the algorithm introduced by Ben-Haim and Yom-Tov in their 2010 paper that builds trees on histograms of data and not on the raw data itself. This...

Read more »

Quantitative Finance Applications in R – 3: Plotting xts Time Series

January 28, 2014
By
Quantitative Finance Applications in R – 3: Plotting xts Time Series

by Daniel Hanson, QA Data Scientist, Revolution Analytics Introduction and Data Setup Last time, we included a couple of examples of plotting a single xts time series using the plot(.) function (ie, said function included in the xts package). Today, we’ll look at some quick and easy methods for plotting overlays of multiple xts time series in a single...

Read more »

Book review: "Doing Data Science" by Rachel Schutt and Cathy O’Neil

January 23, 2014
By

by Joseph Rickert Every once in a while a single book comes to crystallize a new discipline. If books still have this power in the era of electronic media, "Doing Data Science, Straight Talk from the Frontline" by Rachel Schutt and Cathy O’Neil: O'Reilly, 2013 might just be the book that defines data science. "Doing Data Science", which is...

Read more »