Functions for plotting and getting Greek in labels

October 8, 2012
By
Functions for plotting and getting Greek in labels

The problem: We often want to plot data and assign plot attributes based on characteristics of the data. For example, if we have a group of students with the following IQs, we might want to indicate who is an outlier in the statistical sense. I like...

Read more »

S&P 500 correlations up to date

October 8, 2012
By
S&P 500 correlations up to date

I haven’t heard much about correlation lately.  I was curious about what it’s been doing. Data The dataset is daily log returns on 464 large cap US stocks from the start of 2006 to 2012 October 5. The sector data were taken from Wikipedia. The correlation calculated here is the mean correlation of stocks among … Continue reading...

Read more »

GBIF biodiversity data from R – more functions

October 8, 2012
By
GBIF biodiversity data from R – more functions

We have been working on an R package to get GBIF data from R, with the stable version available through CRAN here, and the development version available on GitHub here. We had a Google Summer of code stuent work on the package this summer - you can se...

Read more »

Presidential Candidate Sentiment Analysis

October 7, 2012
By
Presidential Candidate Sentiment Analysis

After watching the Presidential debates and hearing all the opinions on how the candidates performed, I got the hair brained idea of creating a simple function that would do automate the pulling down of tweets for each candidate, analyze the positivity or negativity of tweets, and then graph them out. This project turned out to

Read more »

SPIDER makes the top 10 barcoding publications of 2012

October 7, 2012
By
SPIDER makes the top 10 barcoding publications of 2012

In the recent Barcode Bulletin published by iBoL, our humble paper announcing the R package spider: Species identity and evolution made second on their list of the top 10 publications of 2012. Not bad for a side project! Spider is available for downl...

Read more »

Splitting and Combining R pdf Graphics

October 7, 2012
By
Splitting and Combining R pdf Graphics

A question that often comes across various help lists is how to combine or split an output from an R graphics device. Maybe you have looped/combined multiple visuals into a single pdf to avoid cluttering your working directory and now … Continue reading →

Read more »

Sample Input Data.

October 7, 2012
By
Sample Input Data.

Just a couple quick examples. Starting with   30 meter impervious surface Followed by MODIS Land cover  ( “red” is urban ) And finally Day  LST   Google earth

Read more »

More Fun With Modis

October 7, 2012
By
More Fun With Modis

I’ve started a tutorial on using the MODIS package in R the first few steps are here.  While I wait on a release of the package I thought I would play around a bit with the MRT tool and see how it worked.  Let’s recall where we are going.  I have an inventory of around

Read more »

Zurich, Sep 2012 – Portfolio Selection

October 7, 2012
By

(This article was first published on Rmetrics blogs, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on his blog: Rmetrics blogs. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web...

Read more »

Cyber Summit 2012: a bit of big data and a lot of small tweets

October 7, 2012
By
Cyber Summit 2012: a bit of big data and a lot of small tweets

Last week (October 1-3) MPK Analytics attended the annual Cyber Summit in Banff. The theme for this year was; “Leading the Way in the Age of Big Data“. As might

Read more »

Fit and Visualize A MARS Model

October 7, 2012
By
Fit and Visualize A MARS Model

Read more »

Weekend Reading – Facebook’s P/E ratio

October 7, 2012
By
Weekend Reading – Facebook’s P/E ratio

The Barron’s article Still Too Pricey by Andrew Bary looks at the share price of the Facebook and based on the P/E ration valuation metrics concludes that even at the current prices, stock is overvalued. I want to show how to do this type of fundamental analysis using the Systematic Investor Toolbox. First let’s load

Read more »

Keeping track of my calories the R way

Keeping track of my calories the R way

So...I'm back with Your Shape: Fitness Evolved 2012 for XBox Kinect. Why? Because I want to loose some weight and get back in shape of course -;)The reason I stop playing the game is simple...I'm lazy...but this time, I have come back with a goal...bur...

Read more »

Footbal ordinal model: examination and predictions

October 7, 2012
By
Footbal ordinal model: examination and predictions

In the previous entry an ordinal model for football games was developed. It is now time to look a bit better at the model and use it. This means three sections; A look at likelihood and link function, a model interpretation part, which focuse...

Read more »

Mumbai/Bangalore, 2012/13 – Rmetrics Courses

October 7, 2012
By

(This article was first published on Rmetrics blogs, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on his blog: Rmetrics blogs. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web...

Read more »

Zurich, Aug 2012 – Swiss SBBI Data

October 7, 2012
By

(This article was first published on Rmetrics blogs, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on his blog: Rmetrics blogs. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web...

Read more »

EDA Before CDA

October 6, 2012
By
EDA Before CDA

One Paragraph Summary Always explore your data visually. Whatever specific hypothesis you have when you go out to collect data is likely to be worse than any of the hypotheses you’ll form after looking at just a few simple visualizations of that data. The most effective hypothesis testing framework in existence is the test of

Read more »

R-bloggers

October 6, 2012
By

R-bloggers provides a great service, aggregating a universe of blogs which contribute aRticles on R and using R (marked using an "R"-tag.This is a nice community service creating a one-stop shop for readers to learn about R, but also a great idea for a...

Read more »

A quick introduction to ggplot()

October 5, 2012
By
A quick introduction to ggplot()

I gave a short talk today to the about ggplot. This what I presented. Additional resources at the bottom of this post ggplot is an R package for data exploration and producing plots. It produces fantastic-looking graphics and allows one to slice and dice one’s data in many different ways. Comparing with base...

Read more »

Style your R charts like the Economist, Tableau … or XKCD

October 5, 2012
By
Style your R charts like the Economist, Tableau … or XKCD

As we noted last month, the new Themes feature in ggplot2 helps you customize the design of R charts to your liking. Now, R user Jeffrey Arnold has built on this feature to create standardized themes to make R graphics looks like those from major publications and other software systems. You can use his ggthemes package to make your...

Read more »

How to read BSMAP methylation ratio files into R via methylKit

October 5, 2012
By

BSMAP is an aligner for bisulfite sequencing reads. It outputs aligned reads as well as methylation ratios per base (via methratio.py script). The methylation ratios can be read into R via methylKit package and regular methylKit analysis can ...

Read more »

DIY ZeroAccess GeoIP Plots

October 5, 2012
By
DIY ZeroAccess GeoIP Plots

Since F-Secure was #spiffy enough to provide us with GeoIP data for mapping the scope of the ZeroAccess botnet, I thought that some aspiring infosec data scientists might want to see how to use something besides Google Maps & Google Earth to view the data. If you look at the CSV file, it’s formatted as

Read more »

Running motivation #An R amusement

October 5, 2012
By
Running motivation #An R amusement

Henry John-Alder told me once that in a marathon, twice as runners cross the line at 2h 59m than at 3h 00m. He pointed out that this anomaly in the distribution of finishers per minute (roughly normal shaped) is due … Continue reading →

Read more »

Calculating distances (across matrices)

October 5, 2012
By
Calculating distances (across matrices)

This Gist is mostly for my future self, as a reminder of how to find distances between each row in two different matrices. To create a distance matrix from a single matrix, the function dist(), from the stats package is sufficient. There are times, ho...

Read more »

How to upgrade R in Ubuntu 12.04

October 4, 2012
By

Open your sources.list file in geditsudo gedit /etc/apt/sources.listand add the following line:deb http://cran.cnr.berkeley.edu/bin/linux/ubuntu/ precise/Note that you don't have to use that mirror. You may use any mirror from the list here : http://cran.r-project.org/mirrors.htmlAdd the secure APT key to your system with one commandsudo apt-key adv --keyserver keyserver.ubuntu.com --recv-keys E084DAB9Update your sources and upgrade your installationsudo apt-get update...

Read more »

Permanent Portfolio – Simple Tools

October 4, 2012
By
Permanent Portfolio – Simple Tools

I have previously described and back-tested the Permanent Portfolio strategy based on the series of posts at the GestaltU blog. Today I want to show how we can improve the Permanent Portfolio strategy perfromance using following simple tools: Volatility targeting Risk allocation Tactical market filter First, let’s load the historical prices for the stocks(SPY), gold(GLD),

Read more »

RcppArmadillo 0.3.4.3

October 4, 2012
By

Another bug-fix release of Armadillo, now at version 3.4.3 whike the 3.4.* stabilizes, and with it a version 0.3.4.3 of RcppArmadillo, our wrapper for R and Armadillo. The new version is already on CRAN as of earlier today. Once again no R level or i...

Read more »

RProtoBuf 0.2.6

October 4, 2012
By

Release 0.2.6 of RProtoBuf arrived on CRAN earlier this morning. RProtoBuf provides GNU R bindings for the Google Protobuf data encoding library used and released by Google. This release was once more driven largely by Murray whom we have now add...

Read more »

its the Gramma an punctuashun wot its’ about, Rgiht?

October 4, 2012
By
its the Gramma an punctuashun wot its’ about, Rgiht?

This is another of those confluence style posts, where a handful of things I’ve read in quick succession seem to phase lock in my mind: (brought to mind in part via @downes a week or so ago: How to Synch 32 Metronomes) The first was a post by Alan Levine on Making Text Work, which

Read more »