David Smith at the Revolutions blog posted a nice presentation on “big data” (oh, how I dislike that term). It is a nice piece of work and the Revolution guys manage to process a large amount of records, starting with … Continue reading →

R news and tutorials contributed by (552) R bloggers

Converting HTML to plain text usually involves stripping out the HTML tags whilst preserving the most basic of formatting. I wrote a function to do this which works as follows (code can be found on github): The above uses an XPath approach to achieve it’s goal. Another approach would be to use a regular expression. These

The Black-Litterman Model was created by Fisher Black and Robert Litterman in 1992 to resolve shortcomings of traditional Markovitz mean-variance asset allocation model. It addresses following two items: Lack of diversification of portfolios on the mean-variance efficient frontier. Instability of portfolios on the mean-variance efficient frontier: small changes in the input assumptions often lead to

Here's a cool application of calendar heat maps: runner Andy used R to catalogue his daily running mileage over the last 2+ years: There are lots of ways to chart data like this (a simple time-series chart, for example), but sometimes looking at data in new ways offers fresh perspectives. For example, Andy notes: "Apparently I missed running on...

Small changes in the input assumptions often lead to very different efficient portfolios constructed with mean-variance optimization. I will discuss Resampling and Covariance Shrinkage Estimator – two common techniques to make portfolios in the mean-variance efficient frontier more diversified and immune to small changes in the input assumptions. Resampling was introduced by Michaud in Efficient

Time Series as calendar heat maps + All of my running data since April 1, 2009 = Generated by the following code: #Sample Code based on example program at: source(file = "calendarHeat.R") run<- read.csv("log.csv", header = TRUE, sep=",") sum(run$Distance) date <- c() for (i in 1: dim(run)){ if(run$DistanceUnit== 'Kilometer'){ miles <- c(miles,run$Distance * 0.62) }

What is important for an investor? The rate of return is at the top of the list. Does the expected rate of return shown on the mean-variance efficient frontier paints the full picture? If investor’s investment horizon is longer than one period, for example 5 years, than the true measure of portfolio performance is Geometric

The Seattle R User Group was kind enough to invite me to give a talk about R, C++ and Rcpp. So if you can make it to the Thomas building of the Fred Hutchinson Cancer Research Center in Seattle, WA, on December 7, I would love to see you there. I ha...