3896 search results for "git"

Cyber Summit 2012: a bit of big data and a lot of small tweets

October 7, 2012
By
Cyber Summit 2012: a bit of big data and a lot of small tweets

Last week (October 1-3) MPK Analytics attended the annual Cyber Summit in Banff. The theme for this year was; “Leading the Way in the Age of Big Data“. As might

Read more »

Weekend Reading – Facebook’s P/E ratio

October 7, 2012
By
Weekend Reading – Facebook’s P/E ratio

The Barron’s article Still Too Pricey by Andrew Bary looks at the share price of the Facebook and based on the P/E ration valuation metrics concludes that even at the current prices, stock is overvalued. I want to show how to do this type of fundamental analysis using the Systematic Investor Toolbox. First let’s load

Read more »

Footbal ordinal model: examination and predictions

October 7, 2012
By
Footbal ordinal model: examination and predictions

In the previous entry an ordinal model for football games was developed. It is now time to look a bit better at the model and use it. This means three sections; A look at likelihood and link function, a model interpretation part, which focuse...

Read more »

EDA Before CDA

October 6, 2012
By
EDA Before CDA

One Paragraph Summary Always explore your data visually. Whatever specific hypothesis you have when you go out to collect data is likely to be worse than any of the hypotheses you’ll form after looking at just a few simple visualizations of that data. The most effective hypothesis testing framework in existence is the test of

Read more »

A quick introduction to ggplot()

October 5, 2012
By
A quick introduction to ggplot()

I gave a short talk today to the about ggplot. This what I presented. Additional resources at the bottom of this post ggplot is an R package for data exploration and producing plots. It produces fantastic-looking graphics and allows one to slice and dice one’s data in many different ways. Comparing with base...

Read more »

Style your R charts like the Economist, Tableau … or XKCD

October 5, 2012
By
Style your R charts like the Economist, Tableau … or XKCD

As we noted last month, the new Themes feature in ggplot2 helps you customize the design of R charts to your liking. Now, R user Jeffrey Arnold has built on this feature to create standardized themes to make R graphics looks like those from major publications and other software systems. You can use his ggthemes package to make your...

Read more »

How to read BSMAP methylation ratio files into R via methylKit

October 5, 2012
By

BSMAP is an aligner for bisulfite sequencing reads. It outputs aligned reads as well as methylation ratios per base (via methratio.py script). The methylation ratios can be read into R via methylKit package and regular methylKit analysis can ...

Read more »

DIY ZeroAccess GeoIP Plots

October 5, 2012
By
DIY ZeroAccess GeoIP Plots

Since F-Secure was #spiffy enough to provide us with GeoIP data for mapping the scope of the ZeroAccess botnet, I thought that some aspiring infosec data scientists might want to see how to use something besides Google Maps & Google Earth to view the data. If you look at the CSV file, it’s formatted as

Read more »

Running motivation #An R amusement

October 5, 2012
By
Running motivation #An R amusement

Henry John-Alder told me once that in a marathon, twice as runners cross the line at 2h 59m than at 3h 00m. He pointed out that this anomaly in the distribution of finishers per minute (roughly normal shaped) is due … Continue reading →

Read more »

Calculating distances (across matrices)

October 5, 2012
By
Calculating distances (across matrices)

This Gist is mostly for my future self, as a reminder of how to find distances between each row in two different matrices. To create a distance matrix from a single matrix, the function dist(), from the stats package is sufficient. There are times, ho...

Read more »